2h ago
Software Engineer, ML Infra & Distributed Systems
San Francisco, CA; Los Angeles, CA; New York, NY (Hybrid); USA - Remote
$227.2k-$417k / year
full-timelead Remotestreaming entertainment
🛠 Tech Stack
+4
💼 About This Role
You'll build scalable, low-latency ML inference platforms for personalization, search, and content understanding at Tubi. You'll improve deployment and operations, lead cross-functional projects, and transform ML capabilities. This role offers architectural freedom to explore new frameworks.
🎯 What You'll Do
- Design and build scalable, high-throughput, low-latency distributed systems using Scala.
- Build reusable components for ML applications like Personalization, Search, and Ads.
- Partner with ML engineers to develop scalable solutions for their challenges.
- Optimize latency, cost, and efficiency of ML infrastructure through data-driven analysis.
📋 Requirements
- Experience designing and building scalable distributed systems with a modern backend language (e.g., Scala, Java, Python, Go, C++).
- Strong experience with AWS or equivalent cloud platform.
- Experience building online microservices at scale with low latency serving.
- Experience with SQL/NoSQL databases, message brokers, and caches (e.g., Postgres, Cassandra, Kafka, Redis).
✨ Nice to Have
- Familiarity with ML inference engines (e.g., TorchServe, Triton, vLLM) and vector stores (e.g., LanceDB, FAISS).
- Experience with Recommender Systems, Search, Autocomplete, or Ads ML.
- Proficiency in data-driven analysis of complex A/B testing results.
🎁 Benefits & Perks
- 💵 Annual bonus and long-term incentive plan.
- 🏥 Medical/dental/vision insurance and 401(k) plan.
- 🏖️ Flexible Time Off policy for salaried employees.
- 👶 12 weeks paid parental leave for bonding after birth/adoption.
- 💪 Monthly wellness reimbursement.
0 0 0