Location
london, england, United-Kingdom
Posted
June 06, 2026
Job Description
Neo4j's Site Reliability Engineering teamβs mission is to improve the reliability of Neo4jβs DBaaS product: Neo4j Aura. Operating at a global scale across all three major cloud providers, Aura runs hundreds of Kubernetes clusters and hosts thousands of Neo4j instances in production at any given time.
The Role
- Automate for insight and scale: Build systems that make troubleshooting fast, safe, and scalable across thousands of Neo4j instances. From internal tools that surface clear insights to canaries that support safe rollouts, you'll focus on automation that elevates reliability engineering.
- Treat operations as a software problem: Replace tribal knowledge and ad-hoc scripts with tools and systems that codify best practices - making operations predictable, scalable, and repeatable.
- Design for resilience, learn from failure: Own and evolve the tooling and processes behind incident response. From clear alerts to bla...