Machine Learning Data Engineer for Replica

Parallel Domain

Full-time Engineering
Apply Now
Location
vancouver, metro vancouver regional district, Canada
Posted
June 01, 2026

Job Description

Help shape the future of autonomous systems at Parallel Domain as a Machine Learning Data Engineer. Focus on building scalable data pipelines to support the Replica simulation platform.

This position entails overseeing data ingestion processes and ensuring reliable data pipelines that facilitate the training and validation of machine learning models. You will collaborate with technical teams to establish data standards and implement curation tools that enhance data quality. Your role is key in advancing our photorealistic digital twin technology.

Key Responsibilities:
• Develop ingestion pipelines for customer and synthetic data
• Set data quality standards and validation protocols
• Create annotation and dataset management tools
• Ensure high-quality data for ML models and evaluations

Requirements:
• Solid background in data engineering
• Knowledge of ML training data requirements
• Experience with 3D geo...