Location
toronto, on, Canada
Posted
June 27, 2026
Job Description
Responsibilities
- Managing the reliability of critical infrastructure and application platforms
- Improve and maintain site availability, scalability, service and system performance
- Investigate system errors and problems, bottleneck analysis of the system at scale, etc.
- Provide solutions for performance management, disaster recovery, monitoring and access management
- Participate in planning and retrospective sessions, attending standโups, etc.
- Build and operate highly available and scalable software and infrastructure.
- Supporting application teams on the use of the platform including providing guidance on design patterns, best practices, and security considerations.
- Our teams are flexible and fast โ you will be asked to provide peer review and quality control on a daily basis.
- Own dayโtoโday operations, including incident response, problem management, change management, and operational rea...