1d ago
Principal Model Optimization Engineer
San Mateo, CA, United States
$295.3k-$345k / year
full-timeleadgaming
๐ Tech Stack
๐ผ About This Role
You'll optimize machine learning models for performance on GPU architectures at Roblox, supporting hundreds of ML use cases and billions of inferences daily. You'll conduct low-level profiling, develop best practices, and collaborate across teams to integrate models into production. This role offers the chance to impact a platform connecting tens of millions of users worldwide.
๐ฏ What You'll Do
- Optimize ML models for performance on GPU architectures.
- Conduct low-level performance profiling and analysis.
- Develop best practices and tooling for model optimization.
- Collaborate with cross-functional teams for integration and deployment.
๐ Requirements
- 6+ years of professional experience and system design expertise.
- Experience debugging GPUs (profiling, Xid errors).
- Proficient in CUDA, Triton, TensorRT for model execution.
- Experience with LLM optimization techniques (speculative decoding, quantization).
โจ Nice to Have
- Passion for building generalized tools for model performance.
- Ability to support internal partners like data scientists.
๐ Benefits & Perks
- ๐ฐ Equity compensation
- ๐๏ธ Flexible remote days (Monday/Friday optional in office)
- ๐ฅ Comprehensive benefits (as per company page)
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Screenยท 60 min
- 3Onsite Interviewsยท 4-5 hours
0 0 0