Research Engineer Data Engine

Singapore, S00, SG, Singapore

Job Description

About Us





Our robots generate massive multi-modal data streams, from video, audio, proprioception, to control trajectories. To learn from this at scale, we're building a robot data engine that turns real world experiences into structured training data for our foundation models.



This role sits at the core of that system, creating the data and compute infrastructure that makes large-scale embodied learning possible.

Role Overview





You will architect and maintain the data platform powering our robot learning stack, ensuring high-quality fleet data is captured, synchronized, labeled, and available for large-scale training. You will work across edge devices, on-prem clusters, and cloud infrastructure to build robust, automated, and scalable data flows.

Responsibilities




Design and maintain ETL pipelines to collect, synchronize, and process data from distributed robot fleets. Implement intelligent triggers to capture the most informative episodes for learning (e.g., manipulation failures, locomotion drift). Develop multi-modal data storage and query systems for video, audio, proprioception, and action data. Automate annotation and labeling pipelines using AI-assisted tools. Integrate on-device logging with cloud pipelines for seamless dataset creation. Provide training-ready datasets to autonomy teams and monitor data quality at scale.



Preferred Qualifications




Strong background in

distributed systems, data infrastructure, or robotics data pipelines

. Proficiency in

Python, Go, or C++

Experience with

Kubernetes, Airflow, or NATS.

Understanding of multimodal data handling and large-scale dataset design. Familiarity with robotics telemetry, on-robot logging, and cloud integration (

S3, NFS, gRPC

). Experience designing metrics dashboards and automating feedback loops between data and model performance.



Bonus Skills




Built or contributed to robotic fleet data systems. Experience with foundation model data curation (tokenization, sharding, filtering). * Strong interest in enabling embodied AI through scalable data infrastructure.

Beware of fraud agents! do not pay money to get a job

MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD1670354
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Singapore, S00, SG, Singapore
  • Education
    Not mentioned