AI Computing Development Engineer, TensorRT-LLM

NVIDIA

Full-time other-general
Apply Now
Location
Shanghai, China, China
Posted
May 28, 2026

Job Description

NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and GenerativeAI that has put DL at the β€œiPhone moment” for AI. Join the team which is building the inferencing software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.




What you'll be doing:
+ Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
+ Performance analysis, optimization and tuning
+ Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
+ Provide feedback into the architecture and hardware design and development
+ Collaborate across the company to guide the direction of machine ...