1d ago

Lead Product Reliability Engineer

London

$120k-$170k / year

full-timelead Hybridai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll own the reliability standard for the whole company, working across the full stack from cloud architecture to database migrations. You'll plan and execute infrastructure migrations and be the last line of defence when things break. This is a chance to join a fast-growing AI company where you'll ship code and work with a high-ownership team.

๐ŸŽฏ What You'll Do

  • Plan and execute infrastructure migrations with zero downtime.
  • Diagnose and fix production incidents, then prevent recurrence.
  • Use logs, metrics, and user reports to identify reliability improvements.
  • Build and maintain scripts, workflows, and cloud architecture.
  • Review new features pre-release to ensure reliability bar is met.

๐Ÿ“‹ Requirements

  • Solid, expert-level TypeScript skills.
  • Serious cloud experience (GCP or equivalent) at depth.
  • Deep database knowledge beyond querying (design choices).
  • Proven production ownership (on-call, incident response).
  • Fullstack capability to design and build features.

โœจ Nice to Have

  • Experience with usage-based billing migration.
  • Familiarity with AI/ML infrastructure.

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive salary up to ยฃ170k + equity
  • ๐Ÿ–๏ธ Flexible hybrid working (Mon-Thu in office, Fri WFH)
  • ๐Ÿ“ˆ High-growth startup (from $0 to $30M ARR in months)
  • ๐Ÿข Central London office (Chancery Lane)

๐Ÿ“จ Hiring Process

Estimated timeline: 2-3 weeks

  1. 1Personal interviewยท 30 min
  2. 2Live problem solving (remote)ยท 60 min
  3. 3Coding challenge (take-home)ยท 60 min
  4. 4Lunch with product reliability teamยท 60 min
0 0 0