Apply Now
Location
toronto, on, Canada
Posted
June 08, 2026

Job Description

Elevate reliability standards at Confluent as a Senior Site Reliability Engineer. Focus on proactive reliability improvements within a multi-cloud streaming platform while managing incident response practices.

In this senior role, you'll devote 75% of your time to engineering, improving tooling, analyzing failure patterns, and designing solutions. The remaining 25% involves teaching and coordinating incident response enhancements, coaching teams, and driving organizational changes in reliability practices. Your expertise will help minimize incidents across Confluent Cloud's dynamic environment.

Key Responsibilities:
• Analyze failure patterns for proactive reliability design
• Own configuration of Rootly and integrations with key tools
• Define and maintain SLO/SLA frameworks
• Edit customer-facing incident documents for quality
• Develop training programs and coach teams through post-mortems

Requirements:
• 10+...