New Grad - ML Stack Optimization Engineer

Cerebras Systems

Full-time Other-General
Apply Now
Location
toronto, on, Canada
Posted
May 27, 2026

Job Description

New Grad - ML Stack Optimization Engineer

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPUโ€‘based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking realโ€‘time iteration and increasing intelligence via additional agentic computation.

Job Overview

We are seeking a highly skilled Com...