Senior Machine Learning Engineer, RL / Locomotion
Job Description
Anduril Industries seeks a Senior Machine Learning Engineer with a specialization in reinforcement learning and locomotion to advance the mobility of legged robotic platforms. This onsite role in Costa Mesa, CA, oversees the full pipeline from simulation-based training to real-world deployment, building robust locomotion policies for challenging terrain. The position offers a salary range of USD 220,000 to 336,000 per year and requires a minimum of 3 years of RL experience with robots.
Responsibilities
- Design, train, and deploy reinforcement learning policies for legged robot locomotion, leveraging GPU-accelerated simulation environments such as Isaac Gym and Isaac Lab.
- Develop terrain curricula and domain randomization strategies to produce policies robust to real-world conditions.
- Lead the sim-to-real transfer workflow, diagnosing and bridging reality gaps.
- Train locomotion policies for stairs, rough terrain traversal, payload carrying, push recovery, and fall recovery.
- Define and assess metrics that measure locomotion robustness.
- Collaborate with manipulation and perception engineers to integrate locomotion into the autonomy stack.
Requirements
- 3 to 8+ years of reinforcement learning experience for legged or mobile robots.
- Strong background in dynamics, control, and robot locomotion.
- Experience with RL algorithms such as PPO and SAC and training frameworks including rsl_rl, Stable Baselines, and rl_games.
- Hands-on experience with physics simulation tools such as Isaac Gym, MuJoCo, and PyBullet.
- Proven track record of sim-to-real transfer on physical robotic systems.
- Proficiency in Python and PyTorch.
- Eligibility to obtain and maintain a U.S. security clearance.
Technologies
- Isaac Gym
- Isaac Lab
- MuJoCo
- PyBullet
- Python
- PyTorch
- PPO
- SAC
- rsl_rl
- Stable Baselines
- rl_games
- NVIDIA Isaac Lab
- Omniverse
Benefits
- Equity grants
- Comprehensive benefits package for full-time employees
- At Anduril, we invest in our people. Our comprehensive, competitive benefits package (available at little to no cost to employees) ensures you're supported in health, recovery, and whatever comes next.
About the Team
The Anduril Frontier Systems team is building the next generation of robotic platforms for defense and industrial applications. We are a small, high-performing team of roboticists, ML engineers, and systems engineers delivering real-world capability to operators in the field. Our systems operate in unstructured, contested environments where robustness and reliability are non-negotiable. We seek to deliver validated, integrated capabilities to the Department of Defense and the Intelligence Community.
About the Job
We are seeking a Senior Machine Learning Engineer who specializes in RL and Locomotion to develop and deploy locomotion policies for legged robotic platforms. You will own the full pipeline from simulation training to real-world deployment, building systems that enable robust mobility across challenging terrain— rubble, stairs, slopes, and degraded environments. Your work will directly determine whether our platforms can operate where warfighters need them.
Protecting Yourself From Recruitment Scams
Anduril is committed to maintaining the integrity of our talent acquisition process and the security of our candidates. We have observed a rise in sophisticated phishing and fraudulent schemes where individuals impersonate Anduril representatives, luring job seekers with false interviews or job offers. These scammers often attempt to extract payment or sensitive personal information.
Data Privacy
To view Anduril's candidate data privacy policy, please visit https://anduril.com/applicant-privacy-notice/.
By submitting your application, you consent to Anduril Industries using a third-party service provider to conduct pre-employment risk, integrity, and due diligence screening and assessing potential risks as part of your application process. This third-party service provider provides risk-intelligence services that may include analysis of sanctions and watchlists, adverse media, public-record information, and other lawful open-source or commercial data sources.