3h ago

Staff / Senior Software Engineer, Cloud Inference

San Francisco, CA | Seattle, WA
full-timeseniorArtificial Intelligence

Tech Stack

Description

You will design and build infrastructure to serve Claude across multiple cloud service providers, optimizing inference cost and performance at massive scale. Collaborate with CSP partner teams, develop CI/CD systems, and drive capacity planning to ensure reliable, cost-effective inference for millions of users.

Requirements

  • Significant software engineering experience in large-scale distributed systems
  • Experience with at least one major cloud platform (AWS, GCP, or Azure)
  • Experience with Kubernetes, Infrastructure as Code, or container orchestration
  • Strong interest in inference
  • Autonomous and self-driven with end-to-end ownership

Responsibilities

  • Design and build infrastructure for Claude across multiple CSPs
  • Collaborate with CSP partner teams to resolve issues and influence roadmaps
  • Design CI/CD automation systems for reliable model deployments
  • Design interfaces and tooling abstractions for cost-effective inference management
  • Optimize inference cost and performance via workload placement and routing
0 views 0 saves 0 applications