23h ago

Senior Site Reliability Engineer

New York, NY

$200k-$260k / yearest.

full-timesenior Remotesoftware

🛠 Tech Stack

💼 About This Role

You'll own the reliability and scalability of our SaaS infrastructure and architect for agentic AI workloads. You'll shape the developer platform used by engineering teams worldwide, operating at scale with deep observability.

🎯 What You'll Do

  • Define SLOs and drive capacity planning.
  • Design infrastructure on GCP and AWS using Terraform.
  • Evolve incident management and on-call practices.
  • Mentor engineers on operational thinking.

📋 Requirements

  • 5+ years operating cloud infrastructure (GCP/AWS).
  • Experience with Terraform and Kubernetes.
  • Proficiency in TypeScript, Java, Go, or Python.

✨ Nice to Have

  • Interest in LLM-based systems or agentic workloads.
  • Experience with distributed systems first principles.

🎁 Benefits & Perks

  • 🏖️ Flexible work from home
  • 🏢 In-person meetings in NYC office
  • 💡 Work with AI-native workflows
0 0 0