11h ago
Lead ML Inference Engineer, Advertising
San Jose, CA
$246.5k-$486.1k / year
full-timelead Hybridsoftware
๐ Tech Stack
๐ผ About This Role
You'll architect and lead the development of a state-of-the-art ML inference platform for Roku's advertising ecosystem, handling low-latency, high-throughput serving. Your work will directly optimize performance across hardware, software, and models while mentoring engineers and driving innovation.
๐ฏ What You'll Do
- Lead design of a SOTA inference platform for ad systems
- Oversee monitoring and observability tooling for ML services
- Resolve performance bottlenecks and system inefficiencies
- Incorporate advances in inference frameworks and hardware acceleration
๐ Requirements
- MS/PhD in CS, ECE, or related field
- 10+ years in large-scale distributed systems, 5+ in leadership
- Strong programming in high-performance languages (C++/Rust)
- Deep expertise in inference frameworks and ML system deployment
โจ Nice to Have
- Experience with GPU acceleration and HPC techniques
- Contributions to open-source ML or systems projects
- Experience leading global, cross-functional teams
๐ Benefits & Perks
- ๐๏ธ Paid time off
- ๐ฅ Healthcare (medical, dental, vision)
- ๐ Equity awards
- ๐ถ Parental leave
- ๐ฐ 401(k)/pension
๐จ Hiring Process
Estimated timeline: 3-5 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Phone Screenยท 60 min
- 3On-site (4-5 rounds)ยท 4-5 hours
0 0 0