9h ago

Senior Site Reliability Engineer

Pune, India

$4800k-$8000k / year

full-timeseniorai-ml

🛠 Tech Stack

💼 About This Role

You'll design and operate scalable, fault-tolerant infrastructure for an AI-powered software development platform at Blitzy, a fast-growing U.S. GenAI startup with a strong Pune office. You'll ensure high availability and performance as you reduce MTTR and increase system uptime through hands-on contributions. This is a high-impact role where you'll shape the culture and technical standards of a new SRE team.

🎯 What You'll Do

Design and build fault-tolerant cloud infrastructure on AWS/GCP/Azure
Define SLOs, SLAs, and lead blameless postmortems
Maintain CI/CD pipelines and deployment automation
Own observability stack including Prometheus, Grafana, and Datadog
Partner with engineering teams to embed reliability practices

📋 Requirements

5+ years of SRE, DevOps, or Infrastructure Engineering experience
Strong proficiency in AWS and Kubernetes at scale
Hands-on experience with Terraform or Pulumi
Deep expertise in observability tooling and incident management

✨ Nice to Have

Experience with AI/ML workloads or GPU-accelerated infrastructure
Prior experience in a high-growth startup wearing multiple hats
Familiarity with eBPF or service mesh technologies like Istio

🎁 Benefits & Perks

💰 Competitive equity eligibility based on performance
🏋️ Everyday athlete culture promoting sleep, movement, and mental performance
🚀 Greenfield AI platform with direct influence on architectural decisions
🌍 Founding member of Pune SRE team with growth opportunity

📨 Hiring Process

Estimated timeline: 3-5 weeks · AI estimate

1Recruiter Screen· 30 min
2Technical Interview· 60 min
3System Design Interview· 60 min
4Hiring Manager Chat· 45 min
5Offer· 1 week

This description was AI-summarized. View original

0 0 0