17h ago

Site Reliability Manager

Munich

โœจ $95k-$135k / yearest.

full-timemid Hybrid

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll lead fleet observability and automated recovery paths for a global IoT platform. You'll ensure the system scales reliably and reduce operational toil through automation. This role offers the chance to shape platform capabilities and SLOs.

๐ŸŽฏ What You'll Do

  • Monitor system health across hardware/IoT fleet
  • Design automated recovery paths and improvements
  • Build observability tools and dashboards
  • Define SLOs and platform features for fleet health

๐Ÿ“‹ Requirements

  • Degree in Computer Science, Mechatronics, or related
  • 2-4 years SRE or Technical Operations experience
  • Advanced knowledge of Python and SQL
  • Experience with ELK, Prometheus, or Grafana

โœจ Nice to Have

  • Experience with complex IoT or distributed systems
  • Data analysis skills with Metabase
  • German language skills

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Hybrid work model with Anchor Days
  • โœˆ๏ธ Workation from inspiring locations
  • ๐Ÿšฒ Mobility subsidy (bike leasing or travel allowance)
  • ๐Ÿ“Š OKR-driven measurable goals
  • ๐ŸŽ Catering with coffee, fruit, and online cafeteria

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Callยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Team Interviewยท 45 min

[email protected]

0 0 0