3h ago
Senior / Principal Inference Engineer - ML Platform
San Mateo, CA, United States
$327,000-$397,460 / year
full-timeseniorGaming / Technology
Tech Stack
Description
You will build the next generation of ML Ecosystem Tooling for model inference at Roblox, supporting billions of requests per day. You'll set technical strategy, optimize inference stacks, and partner across organizations to make ML tooling delightful to use.
Requirements
- 4+ years professional experience with system design
- Experience building distributed systems for real-time ML inference serving at millions of QPS
- Experience debugging infrastructure-level performance issues for low latency, high throughput
- Bachelor's degree in Computer Science or related field
- Familiarity with ML inference frameworks like Triton Inference Server, TensorRT, KServe
Responsibilities
- Set technical strategy and oversee development of high-scale inference infrastructure
- Optimize performance from model to infrastructure level
- Stay abreast of industry trends in ML and infrastructure
- Bootstrap and maintain ML Platform components: Serving Layer, Metadata Store, Model Registry, Pipeline Orchestrator
- Partner across organizations to build tooling, interfaces, and visualizations
0 views 0 saves 0 applications