Software Engineer, Productivity - Inference Runtime

OpenAI

Full time Computer Occupations
Apply Now
Location
San Francisco, California, United States
Posted
July 02, 2026

Job Description

About the Team

We’re hiring a Developer Productivity engineer to support OpenAI’s Inference Runtime teams. These teams own the systems responsible for serving models reliably, efficiently, and safely across Codex, ChatGPT, API, and internal research workloads. We’re hiring a Developer Productivity Engineer to help scale the engineering systems, safeguards, and developer workflows that enable our teams to move quickly without compromising reliability or performance.

This role sits at the intersection of developer experience, CI/CD infrastructure, release engineering, production readiness, and inference systems reliability. You’ll work on the tooling and operational foundations that support model launches, inference optimizations, cloud provider integrations, and large-scale deployments across a rapidly evolving inference stack.

About the Role

We’re looking for an autonomous, high-ownership engineer who cares deeply abo...