2d ago

Staff Site Reliability Engineer

Berlin

โœจ $180k-$220k / yearest.

full-timelead Hybridretail

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll join the Operational Excellence team at GetYourGuide, a travel experiences platform, to prevent incidents and improve reliability. You'll drive observability, cost efficiency, and a culture of operational excellence across all product teams. This role offers the chance to work with AI-powered experiences and shape reliability practices for a global company.

๐ŸŽฏ What You'll Do

  • Reduce incident frequency, MTTD, and MTTR
  • Lead post-incident reviews and drive systemic improvements
  • Build tooling and runbooks for production issue diagnosis
  • Advance Datadog-based observability practice with SLOs and alerts

๐Ÿ“‹ Requirements

  • Deep understanding of observability tooling, especially Datadog
  • Proven experience reducing MTTD, MTTR, and change failure rate
  • Strong coding skills in Java; comfortable with Go and frontend context
  • Experience with Kubernetes, AWS, and service mesh technologies (Istio/Envoy)

โœจ Nice to Have

  • Led company-wide DORA metric improvements
  • Driven automated testing improvements reducing production incidents
  • Embedded operational excellence into product engineering teams

๐ŸŽ Benefits & Perks

  • ๐Ÿ“š Annual personal growth budget and mentorship programs
  • ๐ŸŒ Work from anywhere for 40 days per year
  • ๐Ÿ’ช Health and wellness benefits including transportation and fitness budget
  • ๐Ÿท๏ธ Discounts on GetYourGuide activities for you and family
  • ๐Ÿ—ฃ๏ธ Language reimbursement program

๐Ÿ“จ Hiring Process

Estimated timeline: 3-5 weeks ยท AI estimate

  1. 1Recruiter Callยท 30 min
  2. 2Technical Screenยท 60 min
  3. 3On-site Interviewยท 4 hours

[email protected]

0 0 0