Machine Learning Data Engineer, Replica Pipelines

Parallel Domain

Full-time Engineering
Apply Now
Location
vancouver, bc, Canada
Posted
June 01, 2026

Job Description

Parallel Domain is building the worldโ€™s most advanced simulation and digital twin platform for autonomy, robotics, and computer vision. Our Replica product creates large-scale, photorealistic digital twins of real-world environments used for testing, validation, and development of autonomous systems.

About the role

We are hiring a Machine Learning Data Engineer responsible for building and scaling the data pipelines that support Replica and ML model development. You will ensure that data flows efficiently from raw customer inputs through validated, structured formats suitable for training, evaluation, and production systems.

What youโ€™ll do

Own data ingestion: Build reliable pipelines to normalize and validate customer and synthetic data.

Define data standards: Create schemas, validation checks, and quality metrics for Replica datasets.

Build curation tooling: Implement tools for dataset filtering, versioning, and annotation support....