Location
Manhattan, New York, United States
Posted
June 01, 2026
Job Description
What you ’ll doDesign and implement high-performance infrastructure to support large-scale generative AI and machine learning workloads, enabling faster model iteration and real business impact Design and operate distributed systems for model training, hyperparameter tuning, inference, and data preprocessing pipelines to deliver reliable end-to-end machine learning (ML) workflows Collaborate with ML researchers and engineers to produce models, optimizing compute utilization, training throughput, and inference latency Develop and automate deployment, orchestration, and CI/CD pipelines for models and data workflows using container orchestration and infrastructure-as-code (IaC) Implement observability, monitoring, and cost-management strategies for GPU and accelerator compute environments to maintain predictable performance and spend Evaluate, integrate, and benchmark emerging hardware and software technologies across cloud and on-prem environme...