Senior Machine Learning Engineer, RL / Locomotion

Anduril Industries•Costa Mesa, CA

15d

About The Position

The Anduril Frontier Systems team is building the next generation of robotic platforms for defense and industrial applications. We are a small, high-performing team of roboticists, ML engineers, and systems engineers delivering real-world capability to operators in the field. Our systems operate in unstructured, contested environments where robustness and reliability are non-negotiable. We seek to deliver validated, integrated capabilities to the Department of Defense and the Intelligence Community. We are seeking a Senior Machine Learning Engineer who specializes in RL / Locomotion to develop and deploy locomotion policies for legged robotic platforms. You will own the full pipeline from simulation training to real-world deployment, building systems that enable robust mobility across challenging terrain - rubble, stairs, slopes, and degraded environments. Your work will directly determine whether our platforms can operate where warfighters need them.

Requirements

3 - 8+ years of experience with reinforcement learning for legged or mobile robots
Strong background in dynamics, control, and robot locomotion
Experience with RL algorithms (PPO, SAC) and training frameworks (rsl_rl, Stable Baselines, rl_games)
Hands-on experience with physics simulation (Isaac Gym, MuJoCo, PyBullet)
Demonstrated sim-to-real transfer on physical robotic systems
Proficiency in Python and PyTorch
Eligible to obtain and maintain a U.S. security clearance

Nice To Haves

Experience with bipedal or multi-limbed robotic platforms
Experience with NVIDIA Isaac Lab or Omniverse
Publications in top robotics venues (RSS, CoRL, ICRA, IROS)
Prior work in defense technology or startups

Responsibilities

Design, train, and deploy reinforcement learning policies for legged robot locomotion using GPU-parallelized simulation (Isaac Gym / Isaac Lab)
Develop terrain curriculum and domain randomization strategies that produce policies robust to real-world conditions
Own the sim-to-real transfer pipeline, identifying and closing reality gaps
Train policies for stair climbing, rough terrain traversal, payload carry, push recovery, and fall recovery
Define and evaluate performance metrics for locomotion robustness
Collaborate with manipulation and perception engineers to integrate locomotion into a full autonomy stack

Benefits

Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package.
Comprehensive, competitive benefits package (available at little to no cost to employees) ensures you’re supported in health, recovery, and whatever comes next.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume