Free cookie consent management tool by TermsFeed Generator Research Engineer – Multimodal Foundation Models | Robotics | Axioma Search
Image
Image
Bg

Research Engineer (Robotics – Model Training)

About

Early-stage robotics and foundational AI company building universal robotics foundation models for general-purpose mobile robots. The team focuses on simulation-first data generation, proprietary multimodal models, tight hardware integration, and large-scale training systems powering real-world robotics.

You will join the team responsible for the core model training stack.

What you'll do

  • Build and optimise training infrastructure for large-scale vision-language and multimodal foundation models
  • Design systems for long-context video training, including sequence parallelism at scale
  • Support autoregressive and diffusion-based models for actions and video
  • Implement sampling during training (self-forcing) to reduce distribution drift
  • Enable RL post-training for multimodal models
  • Own data flow, memory movement, and GPU utilisation across complex training loops

What you'll need

  • Extensive experience in ML infrastructure, distributed systems, or high-performance computing
  • Direct experience training large vision-language or multimodal foundation models
  • Strong background in large-scale distributed training and GPU performance tuning
  • Experience from top AI labs, frontier model teams, or elite infrastructure groups

Shortlisted candidates will be contacted within 48 hours.

Back to job listings
  • Location London, Paris, San Francisco
  • Salary / Compensation Up to £220k + equity
  • Work Setup permanent , on-site
  • Sectors Robotics, Deep Tech, Frontier AI / Foundation Models
  • Skills ML infrastructure, Distributed Training, CUDA, Triton, GPU optimisation, Robotics
Image

Role Contact

Alex Jouatte

Bg

Didn't find the right role?

Send us your CV.