2h ago
Staff Site Reliability Engineer
New York City, NY
$200k-$250k / year
full-timeleadai-ml
🛠 Tech Stack
💼 About This Role
You'll lead the evolution of Tabs' AI-native revenue platform as we scale. Your work on AWS infrastructure, CI/CD, and observability will ensure reliable and observable systems that delight engineering teams. You'll set the standard for operational excellence across the company, partnering closely with product and engineering teams.
🎯 What You'll Do
- Define and evolve reliability standards, SLIs, SLOs, and error budgets
- Improve observability, alerting, and incident processes across services
- Lead high-severity incidents and drive clear, actionable follow-ups
- Partner with engineering teams to design resilient, scalable systems
📋 Requirements
- 10+ years in SRE, infrastructure, or backend engineering roles
- Strong software engineering experience in one or more modern languages
- Expertise operating distributed systems in production at scale
- Deep experience with AWS, observability tooling, and CI/CD systems
✨ Nice to Have
- Experience with migration from ECS/Fargate to modern runtime
- Comfortable navigating ambiguity in a fast-moving environment
🎁 Benefits & Perks
- 🏖️ Unlimited PTO
- 🏥 Up to 100% employer-covered healthcare premiums
- 🍽️ Lunch provided via Sharebite
- 👶 Parental leave up to 12 weeks
- 🚍 Tax-free commuter and parking benefits
0 0 0