Take a purist SRE approach to shared multi-tenant infrastructure for a resilient SaaS microservice-based containerized systems in addition to customer-centric application environments
Oversee and automate the teamβs growing presence in AWS
Contribute to core infrastructure systems development with features, bug fixes, reliability improvements, etc
Platform reliability engineering of a complex single sign-on SAML/OAuth-based central authentication platform
Creatively build and develop tooling to aid in driving 24x7x365 follow-the-sun operations of critical production systems
Automate deployment tasks for core product and infrastructure tools and maintain automation infrastructure
Create system documentation and training materials to empower and educate our fellow team members
Build and maintain observability tooling, metrics, and dashboarding for a global platform produc...