10h ago
ML Runtime Optimization Engineer
Sunnyvale, California, United States
$159.1k-$199.3k / year
full-timesenior
๐ Tech Stack
๐ผ About This Role
You'll drive ML performance optimization for on-road and off-road ADAS/AD stacks, targeting deployment on embedded compute platforms. Your work will directly impact the efficiency and latency of model inference for real-world autonomous systems. This role offers deep technical challenges across the full ML framework stack.
๐ฏ What You'll Do
- Drive ML performance optimization on multiple technologies for ADAS/AD stacks
- Develop compute usage strategies to optimize efficiency and latency
- Work on model pruning and quantization for memory constrained platforms
- Profile model performance on target embedded compute platforms
๐ Requirements
- Bachelors in Electrical Engineering or Computer Science
- 3+ years of experience with ML accelerators, GPU, CPU, SoC architecture
- Strong embedded programming skills
- Experience profiling and optimizing model performance on embedded compute platforms
โจ Nice to Have
- M.Sc or PhD in a ML related area
- Built an ML optimization framework from scratch
- Deployed ML solutions to embedded chips for real time robotics
๐ Benefits & Perks
- ๐๏ธ Paid time off
- ๐ฅ Health, dental, vision, life and disability insurance
- ๐ฐ Equity in the form of options and/or restricted stock units
- ๐ Learning and wellness stipends
- ๐ฆ 401k retirement benefits with employer match
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Onsite Interviewยท 4 hours
0 0 0