Apply Now
Location
Taipei, Taiwan, Taiwan
Posted
June 03, 2026

Job Description

We are now looking for a Software Development Engineer for LLM inference!


NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT, and GenerativeAI that have put DL at the β€œiPhone moment” for AI. Join the team which is building the inference software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.


What you'll be doing:
+ Craft and develop robust inference software that can be scaled to multiple platforms for functionality and performance
+ Performance analysis, optimization, and tuning for Large Language Models (LLMs)
+ Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
+ Provide feedback into the architec...