BA
Senior ML/RL Engineer, Behavior Planning
Published 2 days ago
About the company
Bot Auto is revolutionizing the transportation of goods with cutting-edge autonomous trucks. With the agility of a startup and the wisdom of seasoned experts, the team has achieved numerous world-firsts including the industry's first fully humanless commercial truckload.
About the role
We are seeking a Senior ML/RL Engineer to join the Algo team and drive development of a unified behavioral architecture. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure autonomous semi-trucks navigate highways with superhuman safety and precision.
Responsibilities
- Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate the autonomous driving stack
- Lead research and implementation of advanced RL algorithms with safety metrics as primary constraints
- Collaborate cross-functionally to design robust reward functions and evaluation metrics balancing safety, progress, and comfort
- Contribute to optimization of large-scale, high-throughput training environments for complex multi-agent scenarios
- Advance neural architectures for spatial reasoning, long-horizon planning, and interaction modeling
- Work closely with Simulation and Planning teams to integrate research-grade models into production-quality software
Requirements
- Proven track record training and deploying deep RL algorithms (e.g., PPO, SAC) for real-world robotic or autonomous systems
- Expertise in Python and PyTorch; strong understanding of modern deep learning architectures and optimization techniques
- MS or PhD in Computer Science, Robotics, or a related quantitative field
- Ability to diagnose and solve fundamental RL training challenges such as variance management and distribution shift
Nice to have
- Experience with constrained optimization or safety-critical learning frameworks
- Background in MARL training stability, including self-play and decentralized execution strategies
- Familiarity with vehicle dynamics and behavior planning for long-haul highway environments
Benefits & perks
- Comprehensive health insurance and paid time off
- Opportunity to work at the forefront of the autonomous trucking industry
Compensation
Competitive salary based on experience, with opportunities for performance bonuses and equity.