Location
markham, york region, Canada
Posted
May 24, 2026
Job Description
Drive AI model development focusing on large-scale training optimization and high-performance inference. Enhance GPU capabilities to achieve world-class computational efficiency and reliability.
This senior engineering role is designed for candidates who excel in developing AI infrastructure and optimizing GPU performance. You will manage comprehensive training environments while addressing issues that arise during distributed processing. Experience with LLMs and GPU kernel development is essential.
Key Responsibilities:
• Ensure efficient large-scale model training on GPUs
• Architect solutions for complex inference serving frameworks
• Optimize pipeline reliability and performance monitoring
• Debug training issues across GPU generations
• Collaborate with architecture teams on performance enhancements
Requirements:
• Significant experience in AI/ML technologies and infrastructure
• Proven expertise in GPU kernel development and optimization
• Familia...
This senior engineering role is designed for candidates who excel in developing AI infrastructure and optimizing GPU performance. You will manage comprehensive training environments while addressing issues that arise during distributed processing. Experience with LLMs and GPU kernel development is essential.
Key Responsibilities:
• Ensure efficient large-scale model training on GPUs
• Architect solutions for complex inference serving frameworks
• Optimize pipeline reliability and performance monitoring
• Debug training issues across GPU generations
• Collaborate with architecture teams on performance enhancements
Requirements:
• Significant experience in AI/ML technologies and infrastructure
• Proven expertise in GPU kernel development and optimization
• Familia...