3h ago
Senior Platform Engineer
Toronto, Ontario, Canada (Hybrid)
full-timesenior HybridIncident management software
Tech Stack
+1
Description
You'll build and scale the infrastructure behind Rootly's incident management platform, working on observability, CI/CD, and reliability for high-profile customers. You'll own systems that underpin incident response and on-call, collaborating with product teams to improve performance and developer experience.
Requirements
- 5+ years in SRE, Platform, DevOps, or Infrastructure Engineering role
- 5+ years writing production software
- Strong knowledge of cloud infrastructure, distributed systems, and reliability
- Proficiency with observability, performance tuning, and scaling
- Experience supporting web or RPC services at scale
Responsibilities
- Embed with product teams to enhance observability, reliability, and performance
- Own CI/CD pipelines, observability tooling, monitoring, and incident response
- Build tools and automation to eliminate manual toil and improve developer experience
- Architect and scale infrastructure for performance and operational excellence
- Define and manage SLOs and error budgets with engineering teams
0 views 0 saves 0 applications