3h ago
Staff / Senior Software Engineer, Cloud Inference
San Francisco, CA | Seattle, WA
full-timeseniorArtificial Intelligence
Tech Stack
Description
You will design and build infrastructure to serve Claude across multiple cloud service providers, optimizing inference cost and performance at massive scale. Collaborate with CSP partner teams, develop CI/CD systems, and drive capacity planning to ensure reliable, cost-effective inference for millions of users.
Requirements
- Significant software engineering experience in large-scale distributed systems
- Experience with at least one major cloud platform (AWS, GCP, or Azure)
- Experience with Kubernetes, Infrastructure as Code, or container orchestration
- Strong interest in inference
- Autonomous and self-driven with end-to-end ownership
Responsibilities
- Design and build infrastructure for Claude across multiple CSPs
- Collaborate with CSP partner teams to resolve issues and influence roadmaps
- Design CI/CD automation systems for reliable model deployments
- Design interfaces and tooling abstractions for cost-effective inference management
- Optimize inference cost and performance via workload placement and routing
0 views 0 saves 0 applications