7h ago

Software Engineer, Fleet Infrastructure

San Francisco

$230k-$490k / year

full-time Hybridai-ml

🛠 Tech Stack

💼 About This Role

You'll design, build, and operate infrastructure systems for model deployment and training on one of the world's largest GPU fleets. Your work directly enables AI research at massive scale. This role offers the opportunity to shape critical systems at OpenAI.

🎯 What You'll Do

  • Design and operate compute fleet components like job scheduling and cluster management
  • Interface with researchers and product teams to understand workload requirements
  • Collaborate on providing high utilization and reliability services

📋 Requirements

  • Experience with hyperscale compute systems
  • Strong programming skills
  • Experience with Kubernetes
  • Experience with public clouds (especially Azure)

✨ Nice to Have

  • Understanding of AI/ML workloads
  • Execution-focused mentality with rigorous focus on user requirements

🎁 Benefits & Perks

  • 💰 Competitive salary range $230K–$490K
  • 📊 Equity offers
  • 🔄 Relocation assistance
  • 🏢 Hybrid work model (3 days in office)
0 0 0