Machine Learning Optimization Engineer Red Hat

Red Hat

Full-time Other-General

Apply Now

Location

toronto, on, Canada

Posted

June 03, 2026

Job Description

                        Drive advanced AI techniques as a Machine Learning Engineer with Red Hat, focusing on model optimization and deployment. This role involves collaborating with diverse teams to implement cutting-edge deep learning algorithms.

As a key member of the Red Hat AI Inference team, you will develop state-of-the-art software for model optimization algorithms, particularly focusing on LLM projects. Your collaboration with product and research teams is crucial as you create effective pipelines and productize deep learning research while ensuring that you're at the forefront of AI innovation.

Key Responsibilities: • Contribute to design and development of inference optimizations • Optimize model compression pipelines using quantization techniques • Develop speculative decoding frameworks to enhance inference speed • Collaborate with researchers to translate ideas into production systems • Optimize end-to-end LLM performance for various hardware

Requirements: • Strong mac...

Apply Now Similar Jobs

Job Details

Job Type

Full-time
Category

Other-General
Date Posted

June 03, 2026
Application Deadline

July 13, 2026