11h ago

Lead ML Inference Engineer, Advertising

San Jose, CA

$246.5k-$486.1k / year

full-timelead Hybridsoftware

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll architect and lead the development of a state-of-the-art ML inference platform for Roku's advertising ecosystem, handling low-latency, high-throughput serving. Your work will directly optimize performance across hardware, software, and models while mentoring engineers and driving innovation.

๐ŸŽฏ What You'll Do

  • Lead design of a SOTA inference platform for ad systems
  • Oversee monitoring and observability tooling for ML services
  • Resolve performance bottlenecks and system inefficiencies
  • Incorporate advances in inference frameworks and hardware acceleration

๐Ÿ“‹ Requirements

  • MS/PhD in CS, ECE, or related field
  • 10+ years in large-scale distributed systems, 5+ in leadership
  • Strong programming in high-performance languages (C++/Rust)
  • Deep expertise in inference frameworks and ML system deployment

โœจ Nice to Have

  • Experience with GPU acceleration and HPC techniques
  • Contributions to open-source ML or systems projects
  • Experience leading global, cross-functional teams

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Paid time off
  • ๐Ÿฅ Healthcare (medical, dental, vision)
  • ๐Ÿ“ˆ Equity awards
  • ๐Ÿ‘ถ Parental leave
  • ๐Ÿ’ฐ 401(k)/pension

๐Ÿ“จ Hiring Process

Estimated timeline: 3-5 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Phone Screenยท 60 min
  3. 3On-site (4-5 rounds)ยท 4-5 hours
0 0 0