Location
toronto, on, Canada
Posted
June 27, 2026
Job Description
Drive innovation in AI as a Deep Learning Optimization Engineer at RedHat. Focus on enhancing open-source LLM technologies and streamlining GenAI performance across enterprises.
In this senior position, you will collaborate with the RedHat AI Inference team to develop cutting-edge optimization algorithms for deep learning applications. Your role will emphasize collaborating with research scientists and mentoring junior engineers, while supporting impactful open-source projects that shape the AI landscape.
Key Responsibilities:
• Design and implement inference optimization algorithms
• Enhance model compression using quantization methods
• Profile LLM end-to-end performance for optimization
• Collaborate to translate experimental solutions into production
• Guide teams and participate in open-source contributions
Requirements:
• Deep understanding of machine learning fundamentals
• Experience with PyTorch and advanced programming skills
• Strong backgrou...
In this senior position, you will collaborate with the RedHat AI Inference team to develop cutting-edge optimization algorithms for deep learning applications. Your role will emphasize collaborating with research scientists and mentoring junior engineers, while supporting impactful open-source projects that shape the AI landscape.
Key Responsibilities:
• Design and implement inference optimization algorithms
• Enhance model compression using quantization methods
• Profile LLM end-to-end performance for optimization
• Collaborate to translate experimental solutions into production
• Guide teams and participate in open-source contributions
Requirements:
• Deep understanding of machine learning fundamentals
• Experience with PyTorch and advanced programming skills
• Strong backgrou...