Manager, Software Engineering - Production AI Inference

NVIDIA

Full-time other-general

Apply Now

Location

Santa Clara, CA, United States

Posted

July 04, 2026

Job Description

                        NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a deeply technical software manager to lead production AI inference for NVIDIA Inference Microservices (NIM), the production runtime through which customers deploy optimized, enterprise-supported AI inference across cloud, data center, and edge environments. NIM makes state-of-the-art AI models available as production-ready software stack, combining optimized inference engines, model profiles/recipes, validated runtime configurations, and security hardening. This role leads the team accountable for turning fast-moving model and inference engine work into reliable NIM releases that customers can operate with confidence. 
  
 This is a hands-on engineering management role for someone who can run production execution without managing from a distance. You will lead engineers working across model onboarding, serving stack integration, performance profiling/optimization, release quality, s...

Apply Now Similar Jobs

Job Details

Job Type

Full-time
Category

other-general
Date Posted

July 04, 2026
Application Deadline

July 09, 2026