1d ago

Principal Model Optimization Engineer

San Mateo, CA, United States

$295.3k-$345k / year

full-timeleadgaming

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll optimize machine learning models for performance on GPU architectures at Roblox, supporting hundreds of ML use cases and billions of inferences daily. You'll conduct low-level profiling, develop best practices, and collaborate across teams to integrate models into production. This role offers the chance to impact a platform connecting tens of millions of users worldwide.

๐ŸŽฏ What You'll Do

  • Optimize ML models for performance on GPU architectures.
  • Conduct low-level performance profiling and analysis.
  • Develop best practices and tooling for model optimization.
  • Collaborate with cross-functional teams for integration and deployment.

๐Ÿ“‹ Requirements

  • 6+ years of professional experience and system design expertise.
  • Experience debugging GPUs (profiling, Xid errors).
  • Proficient in CUDA, Triton, TensorRT for model execution.
  • Experience with LLM optimization techniques (speculative decoding, quantization).

โœจ Nice to Have

  • Passion for building generalized tools for model performance.
  • Ability to support internal partners like data scientists.

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Equity compensation
  • ๐Ÿ–๏ธ Flexible remote days (Monday/Friday optional in office)
  • ๐Ÿฅ Comprehensive benefits (as per company page)

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Callยท 30 min
  2. 2Technical Screenยท 60 min
  3. 3Onsite Interviewsยท 4-5 hours
0 0 0