3h ago

Senior Platform Engineer

Toronto, Ontario, Canada (Hybrid)
full-timesenior HybridIncident management software

Tech Stack

+1

Description

You'll build and scale the infrastructure behind Rootly's incident management platform, working on observability, CI/CD, and reliability for high-profile customers. You'll own systems that underpin incident response and on-call, collaborating with product teams to improve performance and developer experience.

Requirements

  • 5+ years in SRE, Platform, DevOps, or Infrastructure Engineering role
  • 5+ years writing production software
  • Strong knowledge of cloud infrastructure, distributed systems, and reliability
  • Proficiency with observability, performance tuning, and scaling
  • Experience supporting web or RPC services at scale

Responsibilities

  • Embed with product teams to enhance observability, reliability, and performance
  • Own CI/CD pipelines, observability tooling, monitoring, and incident response
  • Build tools and automation to eliminate manual toil and improve developer experience
  • Architect and scale infrastructure for performance and operational excellence
  • Define and manage SLOs and error budgets with engineering teams
0 views 0 saves 0 applications