Software Engineer, Productivity - Inference Runtime

OpenAI

Full time Computer Occupations

Apply Now

Location

San Francisco, California, United States

Posted

July 02, 2026

Job Description

About the Team
We’re hiring a Developer Productivity engineer to support OpenAI’s Inference Runtime teams. These teams own the systems responsible for serving models reliably, efficiently, and safely across Codex, ChatGPT, API, and internal research workloads. We’re hiring a Developer Productivity Engineer to help scale the engineering systems, safeguards, and developer workflows that enable our teams to move quickly without compromising reliability or performance.
This role sits at the intersection of developer experience, CI/CD infrastructure, release engineering, production readiness, and inference systems reliability. You’ll work on the tooling and operational foundations that support model launches, inference optimizations, cloud provider integrations, and large-scale deployments across a rapidly evolving inference stack.
About the Role
We’re looking for an autonomous, high-ownership engineer who cares deeply abo...
                    

Apply Now Similar Jobs

Job Details

Job Type

Full time
Category

Computer Occupations
Date Posted

July 02, 2026
Application Deadline

August 11, 2026