1d ago
Senior Site Reliability Engineer
Greater Toronto Area, Ontario
$165k-$185k / year
full-timesenioreducation
๐ Tech Stack
๐ผ About This Role
You'll join a high-leverage Infrastructure team owning Kubernetes clusters and GitOps pipelines for a platform serving millions of students. You'll build internal tooling in Go/Python that empowers the entire engineering organization while managing Terraform-defined AWS environments.
๐ฏ What You'll Do
- Own and modernize systems across EKS, ArgoCD, and AWS
- Write and maintain high-quality Terraform and Helm code
- Build Go/Python CLIs and automation for developer experience
- Participate in on-call rotations and lead incident responses
๐ Requirements
- 5+ years in SRE, Platform, or Infrastructure roles
- Deep understanding of Kubernetes internals and debugging complex failures
- Advanced proficiency in AWS (IAM, Networking, EKS) and Terraform
- Ability to write clean, maintainable code in Go or Python
โจ Nice to Have
- Experience with GitOps workflows using ArgoCD
- Hands-on experience profiling or optimizing Node.js/TypeScript services
- Knowledge of Service Mesh architectures or Kubernetes Gateway API
๐ Benefits & Perks
- ๐๏ธ Flexible remote-friendly environment
- ๐ Culture of learning with emphasis on postmortems
- ๐งโ๐คโ๐ง Small but mighty senior team with outsized impact
- ๐ป Modern stack including ArgoCD, Kubernetes Gateway API, Drata
๐จ Hiring Process
Estimated timeline: 2-3 weeks ยท AI estimate
- 1Recruiter screenยท 30 min
- 2Hiring manager interviewยท 45 min
- 3Technical interviewยท 60 min
0 0 0