16h ago

Engineering Manager, Production Engineering

San Francisco, CA - US

$209k-$253k / year

full-timeseniorai-ml

🛠 Tech Stack

💼 About This Role

You'll lead a team of SREs embedded across Crusoe's AI infrastructure offerings, driving reliability improvements and owning production health of services like Kubernetes and Inference. You'll split your time between hands-on technical work and team leadership, setting technical direction and fostering a culture of ownership.

🎯 What You'll Do

  • Lead and grow a team of SREs embedded in AI product areas
  • Contribute as an IC by building tooling and automation
  • Own SLA/SLO performance, incident response, and on-call health
  • Partner with product and platform teams on infrastructure design

📋 Requirements

  • 5+ years of software or infrastructure engineering experience
  • 1-2 years in an engineering management or tech lead role
  • Strong SRE or production engineering background
  • Solid coding ability in Go or Python for tooling and automation

✨ Nice to Have

  • Experience with GPU infrastructure or AI/ML workloads
  • Background in HPC orchestration tools like Slurm or Ray
  • Prior experience at a cloud provider or AI infrastructure company

🎁 Benefits & Perks

  • 💵 Restricted Stock Units in a fast-growing company
  • 🏥 Health insurance (HDHP/PPO, vision, dental) with HSA contributions
  • 👶 Paid Parental Leave
  • 🏖️ Generous PTO and holiday schedule
  • 📱 Cell phone reimbursement and tuition reimbursement
0 0 0