6h ago
Site Reliability Engineer
Remote (United States)
$160k-$250k / year
full-timesenior Remotesoftware
🛠 Tech Stack
+1
💼 About This Role
You'll join Replit's SRE team to ensure the reliability, scalability, and performance of infrastructure serving millions of developers. You will design observability solutions, drive automation and infrastructure as code, and establish SLOs and SLIs to maintain high availability.
🎯 What You'll Do
- Design and implement observability solutions with monitoring and alerting
- Drive automation and infrastructure as code using Terraform, Ansible, or Pulumi
- Establish SLOs and SLIs in collaboration with product and engineering teams
- Lead incident response efforts and conduct post-mortems
📋 Requirements
- 4-8 years of experience in Site Reliability Engineering or similar roles
- Strong programming skills in Python or Go
- Deep understanding of distributed systems
- Experience with Kubernetes and cloud-native technologies
✨ Nice to Have
- Experience with Google Cloud Platform (GCP)
- Knowledge of Prometheus, Grafana, or Datadog
🎁 Benefits & Perks
- 💰 Competitive Salary & Equity
- 📈 401(k) Program with 4% match
- ⚕️ Health, Dental, Vision and Life Insurance
- 🚼 Paid Parental, Medical, Caregiver Leave
- 🏝️ Flexible Time Off (FTO) + Holidays
0 0 0