1d ago
Staff Infrastructure Engineer, Cluster Infrastructure
London, UK
$325k-$485k / year
full-timeleadai-ml Visa Sponsor
๐ Tech Stack
๐ผ About This Role
You'll set the technical direction for agent-driven cluster lifecycle management at Anthropic, owning the strategy for provisioning, updates, and decommissioning of compute clusters. You'll partner across teams to ensure new compute capacity is ingested on time and collaborate with cloud providers to shape long-term infrastructure strategy. This role offers the chance to scale compute faster than almost any company in the world.
๐ฏ What You'll Do
- Own technical strategy for cluster lifecycle management
- Partner across teams to ingest new compute capacity on time
- Collaborate on high-bandwidth inter-cluster connectivity
- Define strategy on cluster scalability, homogeneity, and fault tolerance
๐ Requirements
- Deep expertise in distributed systems, reliability, and cloud platforms
- Strong proficiency in at least one systems language (Rust, Go, or Python)
- Track record of leading complex, multi-quarter technical initiatives
- Ability to build alignment across senior stakeholders
โจ Nice to Have
- Experience operating large-scale compute infrastructure at hyperscale
- Depth in Kubernetes internals or cluster orchestration systems
- Experience with cloud networking and cluster security
๐ Benefits & Perks
- ๐๏ธ Generous PTO
- ๐ฅ Comprehensive health insurance
- ๐ Equity grants
- ๐ฐ Competitive salary
- ๐ Visa sponsorship
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Onsite Interviewsยท 4 hours
0 0 0