3h ago
Staff Backend Engineer, ML Inference Systems
New York, NY, USA
$192.6k-$271.9k / year
full-timeseniorgaming
๐ Tech Stack
๐ผ About This Role
You'll design and operate distributed systems powering billions of daily ML inference requests, driving low-latency serving infrastructure. Join us to shape how billions of gaming experiences are discovered and monetized at scale.
๐ฏ What You'll Do
- Design and deploy backend services for large-scale model inference.
- Drive technical direction of low-latency, high-throughput serving infrastructure.
- Partner with ML engineers to scale serving infrastructure.
- Ensure reliability and efficiency using Prometheus and Grafana.
๐ Requirements
- 5+ years designing and maintaining distributed systems at scale.
- Expertise in Golang for high-performance backend infrastructure.
- Hands-on experience with GCP and Kubernetes.
- Strong grounding in Prometheus and Grafana.
โจ Nice to Have
- Experience with NVIDIA Triton Inference Server.
- Familiarity with auction mechanics or bidding systems in ad tech.
- Experience embracing AI as a strategic advantage in engineering.
๐ Benefits & Perks
- ๐ฅ Comprehensive health insurance
- ๐๏ธ Generous vacation and personal days
- ๐ Employee stock ownership
- ๐ช Parental leave and family-care programs
- ๐ง Mental Health and Wellbeing programs
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Phone Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3System Design Interviewยท 60 min
- 4Team Fit Interviewยท 45 min
- 5Offerยท N/A
0 0 0