2d ago
Senior Manager, Engineering
Dublin, Ireland
โจ $200k-$260k / yearest.
full-timeseniorai-ml
๐ Tech Stack
๐ผ About This Role
You'll lead a production engineering team ensuring the reliability of Crusoe's GPU infrastructure. You'll own incident response, monitoring, and automation across a fast-scaling cloud environment. This role offers significant strategic impact at a pivotal growth stage.
๐ฏ What You'll Do
- Manage and coach a team of production engineers across shifts.
- Lead postmortems and drive down MTTR for high-severity incidents.
- Define and monitor SLIs, SLOs, and SLAs for production systems.
- Oversee observability systems and automate to reduce toil.
๐ Requirements
- 6+ years managing 24/7 technical operations or SRE teams.
- Strong Linux and infrastructure fundamentals including Kubernetes.
- Experience with Prometheus, VictoriaMetrics, or similar monitoring tools.
- Working proficiency in Golang or Python.
โจ Nice to Have
- Experience with GPU infrastructure, HPC, or AI/ML cloud environments.
- Familiarity with infrastructure-as-code tools like Terraform or Ansible.
- Experience scaling an operations team during rapid growth.
๐ Benefits & Perks
- ๐ฅ Health Insurance
- ๐ฐ Equity
- ๐๏ธ Unlimited PTO
- ๐ 401k Matching
- ๐ฅ๏ธ Remote Work Stipend
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Hiring Manager Interviewยท 45 min
- 3Technical Interviewยท 60 min
0 0 0