16h ago
Site Reliability Engineer Lead
Brazil
โจ $80k-$140k / yearest.
full-timelead Remotesoftware
๐ Tech Stack
+1
๐ผ About This Role
You'll build and lead a high-impact SRE function responsible for platform reliability, observability, and incident management at scale. You'll combine technical depth with strong people leadership, guiding a team that defines observability standards, SLOs, and reliability practices. This role shapes both strategy and execution in a fast-paced, cloud-native environment.
๐ฏ What You'll Do
- Mentor and develop a high-performing SRE team
- Define SRE strategy, roadmap, and priorities
- Establish observability standards across systems
- Drive adoption of SLIs, SLOs, and error budgets
๐ Requirements
- Proven experience leading SRE or Cloud Engineering teams
- Strong hands-on experience with SLIs, SLOs, and error budgets
- Experience with observability tools like Prometheus, Grafana, or Datadog
- Solid knowledge of telemetry systems (metrics, logs, traces)
โจ Nice to Have
- Experience with FinOps or chaos engineering
- Experience with AIOps or large-scale distributed systems
๐ Benefits & Perks
- ๐๏ธ Flexible working hours
- ๐ Home office support and equipment provision
- ๐ Learning and development support
- ๐ Birthday day off
- ๐ Stock options and performance-based bonuses
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Hiring Manager Interviewยท 45 min
0 0 0