Collaborate with research scientists to advance methods in:
+ Planning and RL for computer use (e.g. behavioral cloning, RL on model weights, RAG-based domain knowledge)
+ Multimodal grounding (e.g. vision-only models, tree search, hybrid methods with large models)
+ Reward/judge modeling (e.g. error analysis, human evaluation, training judge models)
+ User intent understanding (e.g. modeling vague queries, preference learning) Contribute to building datasets, running experiments, and benchmarking results
Explore novel approaches and help derisk Simular's long-term technical roadmap
Document and communicate findings through internal reports or academic-style writing
You might be a fit if
Currently pursuing a PhD in Computer Science, Machine Learning, or related field
Research background in at least one of: Reinforcement learning, Large language/vision-language models, Computer vision and multimodal perception, Representation learning
Experience conducting experiments and publishing or preparing papers in top-tier conferences (NeurIPS, ICLR, ICML, CVPR, ACL, etc.)
Strong coding and prototyping skills in Python and ML frameworks (PyTorch/JAX)
* Curiosity, initiative, and interest in bridging fundamental research with applied AI
Beware of fraud agents! do not pay money to get a job
MNCJobz.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.