17h ago
Site Reliability Manager
Munich
โจ $95k-$135k / yearest.
full-timemid Hybrid
๐ Tech Stack
๐ผ About This Role
You'll lead fleet observability and automated recovery paths for a global IoT platform. You'll ensure the system scales reliably and reduce operational toil through automation. This role offers the chance to shape platform capabilities and SLOs.
๐ฏ What You'll Do
- Monitor system health across hardware/IoT fleet
- Design automated recovery paths and improvements
- Build observability tools and dashboards
- Define SLOs and platform features for fleet health
๐ Requirements
- Degree in Computer Science, Mechatronics, or related
- 2-4 years SRE or Technical Operations experience
- Advanced knowledge of Python and SQL
- Experience with ELK, Prometheus, or Grafana
โจ Nice to Have
- Experience with complex IoT or distributed systems
- Data analysis skills with Metabase
- German language skills
๐ Benefits & Perks
- ๐๏ธ Hybrid work model with Anchor Days
- โ๏ธ Workation from inspiring locations
- ๐ฒ Mobility subsidy (bike leasing or travel allowance)
- ๐ OKR-driven measurable goals
- ๐ Catering with coffee, fruit, and online cafeteria
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Interviewยท 60 min
- 3Team Interviewยท 45 min
0 0 0