Our robots generate massive multi-modal data streams, from video, audio, proprioception, to control trajectories. To learn from this at scale, we're building a robot data engine that turns real world experiences into structured training data for our foundation models.
This role sits at the core of that system, creating the data and compute infrastructure that makes large-scale embodied learning possible.
Role Overview
You will architect and maintain the data platform powering our robot learning stack, ensuring high-quality fleet data is captured, synchronized, labeled, and available for large-scale training. You will work across edge devices, on-prem clusters, and cloud infrastructure to build robust, automated, and scalable data flows.
Responsibilities
Design and maintain ETL pipelines to collect, synchronize, and process data from distributed robot fleets.
Implement intelligent triggers to capture the most informative episodes for learning (e.g., manipulation failures, locomotion drift).
Develop multi-modal data storage and query systems for video, audio, proprioception, and action data.
Automate annotation and labeling pipelines using AI-assisted tools.
Integrate on-device logging with cloud pipelines for seamless dataset creation.
Provide training-ready datasets to autonomy teams and monitor data quality at scale.
Preferred Qualifications
Strong background in
distributed systems, data infrastructure, or robotics data pipelines
.
Proficiency in
Python, Go, or C++
Experience with
Kubernetes, Airflow, or NATS.
Understanding of multimodal data handling and large-scale dataset design.
Familiarity with robotics telemetry, on-robot logging, and cloud integration (
S3, NFS, gRPC
).
Experience designing metrics dashboards and automating feedback loops between data and model performance.
Bonus Skills
Built or contributed to robotic fleet data systems.
Experience with foundation model data curation (tokenization, sharding, filtering).
* Strong interest in enabling embodied AI through scalable data infrastructure.
Beware of fraud agents! do not pay money to get a job
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.