Staff Software Engineer, GPU Infrastructure (HPC)

Cohere Inc.

Full-time Other-General
Apply Now
Location
toronto, on, Canada
Posted
June 15, 2026

Job Description

Why this team?

The internal infrastructure team is responsible for building world‑class infrastructure and tools used to train, evaluate and serve Cohere's foundational models. By joining our team, you will work in close collaboration with AI researchers to support their AI workload needs on the cutting edge, with a strong focus on stability, scalability, and observability. You will be responsible for building and operating superclusters across multiple clouds. Your work will directly accelerate the development of industry‑leading AI models that power Cohere's platform North.

All of our infrastructure roles require participating in a 24x7 on‑call rotation, where you are compensated for your on‑call schedule.

As a Staff Software Engineer, You Will

  • Build and scale ML‑optimized HPC infrastructure: Deploy and manage Kubernetes‑based GPU/TPU superclusters across multiple clouds, ensuring high throughput and low‑latency performance for AI workl...