Location
toronto, on, Canada
Posted
June 27, 2026
Job Description
Drive innovation in AI as a Deep Learning Optimization Engineer at RedHat. Focus on enhancing open-source LLM technologies and streamlining GenAI performance across enterprises.
In this senior position, you will collaborate with the RedHat AI Inference team to develop cutting-edge optimization algorithms for deep learning applications. Your role will emphasize collaborating with research scientists and mentoring junior engineers, while supporting impactful open-source projects that shape the AI landscape.
Key Responsibilities: • Design and implement inference optimization algorithms • Enhance model compression using quantization methods • Profile LLM end-to-end performance for optimization • Collaborate to translate experimental solutions into production • Guide teams and participate in open-source contributions
Requirements: • Deep understanding of machine learning fundamentals • Experience with PyTorch and advanced programming skills • Strong background in ...
In this senior position, you will collaborate with the RedHat AI Inference team to develop cutting-edge optimization algorithms for deep learning applications. Your role will emphasize collaborating with research scientists and mentoring junior engineers, while supporting impactful open-source projects that shape the AI landscape.
Key Responsibilities: • Design and implement inference optimization algorithms • Enhance model compression using quantization methods • Profile LLM end-to-end performance for optimization • Collaborate to translate experimental solutions into production • Guide teams and participate in open-source contributions
Requirements: • Deep understanding of machine learning fundamentals • Experience with PyTorch and advanced programming skills • Strong background in ...