3h ago

Site Reliability Engineer

London

$55k-$63k / year

full-timemid Hybridsoftware

🛠 Tech Stack

💼 About This Role

You'll join Trainline's Reliability & Operations Engineering team to keep our platform observable, reliable, and scalable. You'll participate in incident response, build observability using metrics and traces, and collaborate with product engineering teams. This role offers exposure to a high-traffic AWS cloud-native platform and a supportive senior team.

🎯 What You'll Do

  • Participate in production incident response and service restoration
  • Design and maintain observability using metrics, logs, and traces
  • Support AWS-hosted infrastructure with IaC and CI/CD tooling
  • Collaborate with product engineering teams on operational readiness

📋 Requirements

  • Experience with SRE concepts such as SLI, SLO, and error budgets
  • Hands-on experience with observability tooling like New Relic, ELK, or Grafana
  • Experience working with cloud providers (preferably AWS)
  • Experience scripting in at least one language (preferably Python)

✨ Nice to Have

  • Experience with build, deployment & configuration management tooling such as GitHub Actions and Terraform
  • Understanding of load balancing and reverse proxy concepts
  • Application architecture concepts (threading, queuing, circuit breakers)

🎁 Benefits & Perks

  • 🏥 Private healthcare & dental insurance
  • ✈️ 28-day Work from Abroad policy
  • 📈 2-for-1 share purchase plans
  • 🚗 EV Scheme to reduce carbon emissions
  • 📚 Personal learning budgets and regular learning days
0 0 0