16h ago
Engineering Manager, Production Engineering
San Francisco, CA - US
$209k-$253k / year
full-timeseniorai-ml
🛠 Tech Stack
💼 About This Role
You'll lead a team of SREs embedded across Crusoe's AI infrastructure offerings, driving reliability improvements and owning production health of services like Kubernetes and Inference. You'll split your time between hands-on technical work and team leadership, setting technical direction and fostering a culture of ownership.
🎯 What You'll Do
- Lead and grow a team of SREs embedded in AI product areas
- Contribute as an IC by building tooling and automation
- Own SLA/SLO performance, incident response, and on-call health
- Partner with product and platform teams on infrastructure design
📋 Requirements
- 5+ years of software or infrastructure engineering experience
- 1-2 years in an engineering management or tech lead role
- Strong SRE or production engineering background
- Solid coding ability in Go or Python for tooling and automation
✨ Nice to Have
- Experience with GPU infrastructure or AI/ML workloads
- Background in HPC orchestration tools like Slurm or Ray
- Prior experience at a cloud provider or AI infrastructure company
🎁 Benefits & Perks
- 💵 Restricted Stock Units in a fast-growing company
- 🏥 Health insurance (HDHP/PPO, vision, dental) with HSA contributions
- 👶 Paid Parental Leave
- 🏖️ Generous PTO and holiday schedule
- 📱 Cell phone reimbursement and tuition reimbursement
0 0 0