← All jobs
BA

Senior ML/RL Engineer, Behavior Planning

Bot AutoHouston, TX or San Francisco Bay Area, USFull-time

Published 2 days ago

About the company Bot Auto is revolutionizing the transportation of goods with cutting-edge autonomous trucks. With the agility of a startup and the wisdom of seasoned experts, the team has achieved numerous world-firsts including the industry's first fully humanless commercial truckload. About the role We are seeking a Senior ML/RL Engineer to join the Algo team and drive development of a unified behavioral architecture. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure autonomous semi-trucks navigate highways with superhuman safety and precision. Responsibilities - Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate the autonomous driving stack - Lead research and implementation of advanced RL algorithms with safety metrics as primary constraints - Collaborate cross-functionally to design robust reward functions and evaluation metrics balancing safety, progress, and comfort - Contribute to optimization of large-scale, high-throughput training environments for complex multi-agent scenarios - Advance neural architectures for spatial reasoning, long-horizon planning, and interaction modeling - Work closely with Simulation and Planning teams to integrate research-grade models into production-quality software Requirements - Proven track record training and deploying deep RL algorithms (e.g., PPO, SAC) for real-world robotic or autonomous systems - Expertise in Python and PyTorch; strong understanding of modern deep learning architectures and optimization techniques - MS or PhD in Computer Science, Robotics, or a related quantitative field - Ability to diagnose and solve fundamental RL training challenges such as variance management and distribution shift Nice to have - Experience with constrained optimization or safety-critical learning frameworks - Background in MARL training stability, including self-play and decentralized execution strategies - Familiarity with vehicle dynamics and behavior planning for long-haul highway environments Benefits & perks - Comprehensive health insurance and paid time off - Opportunity to work at the forefront of the autonomous trucking industry Compensation Competitive salary based on experience, with opportunities for performance bonuses and equity.
Apply for this role
Share:LinkedInXThreads