Job Description
We are partnered with a fast-growing European NeoCloud / AI infrastructure company building high-performance GPU cloud platforms for AI, machine learning and large-scale compute workloads.
The company is deploying GPU infrastructure across multiple regions and building the next generation of containerised and serverless platforms for AI workloads. Their customers include teams running demanding workloads across inference, training, model serving, HPC and distributed compute.
This is an opportunity to work close to the infrastructure layer of AI: containers, storage, networking, Linux, Kubernetes, model runtimes and GPU execution.
About the Role
We are looking for a GPU Infrastructure Performance Engineer to help make AI workloads start faster, load faster and run faster across large-scale GPU infrastructure.
This is not a generic SRE role, and it is not a pure ML research position. The role sits between infrastructure p...