10h ago

ML Runtime Optimization Engineer

Sunnyvale, California, United States

$159.1k-$199.3k / year

full-timesenior

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll drive ML performance optimization for on-road and off-road ADAS/AD stacks, targeting deployment on embedded compute platforms. Your work will directly impact the efficiency and latency of model inference for real-world autonomous systems. This role offers deep technical challenges across the full ML framework stack.

๐ŸŽฏ What You'll Do

  • Drive ML performance optimization on multiple technologies for ADAS/AD stacks
  • Develop compute usage strategies to optimize efficiency and latency
  • Work on model pruning and quantization for memory constrained platforms
  • Profile model performance on target embedded compute platforms

๐Ÿ“‹ Requirements

  • Bachelors in Electrical Engineering or Computer Science
  • 3+ years of experience with ML accelerators, GPU, CPU, SoC architecture
  • Strong embedded programming skills
  • Experience profiling and optimizing model performance on embedded compute platforms

โœจ Nice to Have

  • M.Sc or PhD in a ML related area
  • Built an ML optimization framework from scratch
  • Deployed ML solutions to embedded chips for real time robotics

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Paid time off
  • ๐Ÿฅ Health, dental, vision, life and disability insurance
  • ๐Ÿ’ฐ Equity in the form of options and/or restricted stock units
  • ๐Ÿ“š Learning and wellness stipends
  • ๐Ÿฆ 401k retirement benefits with employer match

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Onsite Interviewยท 4 hours
0 0 0