2d ago
Customer Reliability Manager
San Francisco
โจ $180k-$250k / yearest.
full-timeseniorai-ml
๐ Tech Stack
๐ผ About This Role
You'll lead a team of Customer Reliability Engineers providing high-touch support for Braintrust's AI observability platform, focusing on hybrid, BYOC, and SaaS deployments. Your core impact is reducing friction for developers building LLM-powered applications. You'll also own incident response and mentor senior engineers.
๐ฏ What You'll Do
- Lead and grow a team of Customer Reliability Engineers.
- Own the primary after-hours on-call rotation for customer-reported SEV1s.
- Run incident response and escalation, including hands-on for high-severity issues.
- Lead new BYOC deployments and upgrades.
๐ Requirements
- 5โ10+ years of experience leading support for developer-facing products.
- Deep familiarity with deploying Terraform, Helm, and Kubernetes infrastructure.
- Comfortable reviewing, debugging, and reasoning about backend services and infrastructure.
- Strong ownership of customer-impacting issues end-to-end.
โจ Nice to Have
- Familiarity with OpenAI, Anthropic, or similar LLM providers at a systems level.
- Experience guiding teams working with datasets, evaluation metrics, or prompt engineering.
- Experience supporting self-hosted offerings (e.g., Terraform, Kubernetes).
๐ Benefits & Perks
- ๐ฅ Medical, dental, and vision insurance
- ๐ฝ๏ธ Daily lunch, snacks, and beverages
- ๐ด Flexible time off
- ๐ฐ Competitive salary and equity
- ๐ค AI Stipend
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Hiring Manager Interviewยท 45 min
- 3Technical Interviewยท 60 min
๐ฉ Heads Up
- On-call rotation for SEV1s may lead to burnout
- Requires 5-10+ years of leadership but also hands-on infrastructure work
0 0 0