10h ago
Site Reliability Engineer
Berlin, Berlin, Germany; Remote (Europe); Stuttgart, Baden-Württemberg, Germany
✨ $80k-$120k / yearest.
full-timemid Remotesoftware
🛠 Tech Stack
+2
💼 About This Role
You'll be a key player in Flip's Platform Squad, keeping our infrastructure fast, resilient, and ready for scale. You'll shape the reliability culture, tools, and practices that enable our engineering teams to release with confidence at scale. This role is perfect for an engineer passionate about building high-throughput, highly available systems.
🎯 What You'll Do
- Scale and optimize cloud infrastructure on Azure and Kubernetes clusters.
- Design zero-downtime deployments, rollback mechanisms, and disaster recovery.
- Evolve the LGTM observability stack (Loki, Grafana, Tempo, Mimir).
- Design and optimize Infrastructure as Code with Pulumi in Go.
📋 Requirements
- 1-3 years hands-on experience as an SRE, Platform, DevOps, or Infrastructure Engineer.
- Experience operating and scaling cloud infrastructure (Azure, GCP, AWS).
- Deep knowledge of Kubernetes and container orchestration in production.
- Hands-on experience with modern observability stacks (e.g., Prometheus, Mimir, Loki).
✨ Nice to Have
- Experience with Azure Kubernetes Service (AKS).
- Experience with Kubernetes Gateway API and Envoy Gateway.
- Familiarity with GitOps workflows and CI/CD pipeline design.
🎁 Benefits & Perks
- 🏠 Remote-first work model.
- 💪 E-Gym Wellpass membership covered.
- 🚴 Job-Rad Leasing.
- 🎉 Regular team events and Culture Days.
- ✈️ Work from abroad in Europe.
📨 Hiring Process
Estimated timeline: 2-4 weeks · AI estimate
- 1Recruiter Call· 30 min
- 2Technical Interview· 60 min
- 3Team Fit Interview· 45 min
0 0 0