Location
toronto, on, Canada
Posted
June 03, 2026
Job Description
Drive advanced AI techniques as a Machine Learning Engineer with Red Hat, focusing on model optimization and deployment. This role involves collaborating with diverse teams to implement cutting-edge deep learning algorithms.
As a key member of the Red Hat AI Inference team, you will develop state-of-the-art software for model optimization algorithms, particularly focusing on LLM projects. Your collaboration with product and research teams is crucial as you create effective pipelines and productize deep learning research while ensuring that you're at the forefront of AI innovation.
Key Responsibilities: • Contribute to design and development of inference optimizations • Optimize model compression pipelines using quantization techniques • Develop speculative decoding frameworks to enhance inference speed • Collaborate with researchers to translate ideas into production systems • Optimize end-to-end LLM performance for various hardware
Requirements: • Strong mac...
As a key member of the Red Hat AI Inference team, you will develop state-of-the-art software for model optimization algorithms, particularly focusing on LLM projects. Your collaboration with product and research teams is crucial as you create effective pipelines and productize deep learning research while ensuring that you're at the forefront of AI innovation.
Key Responsibilities: • Contribute to design and development of inference optimizations • Optimize model compression pipelines using quantization techniques • Develop speculative decoding frameworks to enhance inference speed • Collaborate with researchers to translate ideas into production systems • Optimize end-to-end LLM performance for various hardware
Requirements: • Strong mac...