8h ago

Member of Technical Staff - Distributed Systems

San Francisco

$150k-$350k / year

full-timeseniorai-ml

🛠 Tech Stack

💼 About This Role

You'll build the core platform that schedules, routes, and operates AI workloads reliably at production scale. You'll work on distributed systems that coordinate execution across thousands of nodes, and ensure workloads run predictably under real-world load and failure conditions.

🎯 What You'll Do

  • Design and build distributed systems for AI workloads at scale
  • Develop scheduling, routing, and resource management components
  • Build production-grade APIs and control planes
  • Implement mechanisms for reliability, availability, and fault tolerance

📋 Requirements

  • Software engineering fundamentals
  • Production experience with distributed systems
  • Reasoning about concurrency and failure modes in large-scale systems

✨ Nice to Have

  • Experience with Kubernetes or adjacent systems
  • Experience with RPC or asynchronous messaging architectures
  • Familiarity with scheduling or resource management systems

🎁 Benefits & Perks

  • 💰 Competitive compensation including equity
0 0 0