16h ago

Site Reliability Engineer Lead

Brazil

โœจ $80k-$140k / yearest.

full-timelead Remotesoftware

๐Ÿ›  Tech Stack

+1

๐Ÿ’ผ About This Role

You'll build and lead a high-impact SRE function responsible for platform reliability, observability, and incident management at scale. You'll combine technical depth with strong people leadership, guiding a team that defines observability standards, SLOs, and reliability practices. This role shapes both strategy and execution in a fast-paced, cloud-native environment.

๐ŸŽฏ What You'll Do

  • Mentor and develop a high-performing SRE team
  • Define SRE strategy, roadmap, and priorities
  • Establish observability standards across systems
  • Drive adoption of SLIs, SLOs, and error budgets

๐Ÿ“‹ Requirements

  • Proven experience leading SRE or Cloud Engineering teams
  • Strong hands-on experience with SLIs, SLOs, and error budgets
  • Experience with observability tools like Prometheus, Grafana, or Datadog
  • Solid knowledge of telemetry systems (metrics, logs, traces)

โœจ Nice to Have

  • Experience with FinOps or chaos engineering
  • Experience with AIOps or large-scale distributed systems

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Flexible working hours
  • ๐Ÿ  Home office support and equipment provision
  • ๐Ÿ“š Learning and development support
  • ๐ŸŽ‚ Birthday day off
  • ๐Ÿ“ˆ Stock options and performance-based bonuses

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Hiring Manager Interviewยท 45 min
0 0 0