20h ago

SRE Partner

Brazil

โœจ $150k-$200k / yearest.

full-timesenior Remotesoftware

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

At the intersection of Site Reliability Engineering and product teams, you'll act as a strategic embedded partner within high-impact areas, bringing reliability practices directly into product engineering workflows. You will ensure systems are scalable, observable, and resilient at massive data scale, working closely with product, platform, and infrastructure teams to translate reliability goals into actionable outcomes.

๐ŸŽฏ What You'll Do

  • Embed SRE practices within product areas, driving reliability maturity
  • Define and implement SLOs, SLIs, error budgets, and incident management
  • Develop reliability maturity models and identify operational gaps
  • Promote platform capabilities like observability, canary deployments, and feature flags

๐Ÿ“‹ Requirements

  • Proven experience in Site Reliability Engineering or similar infrastructure roles
  • Strong knowledge of Google Cloud Platform (GCP)
  • Hands-on experience with Kubernetes and distributed systems
  • Expertise in Terraform and Infrastructure as Code

โœจ Nice to Have

  • Experience with observability tools like Prometheus, Grafana, or Elasticsearch
  • Experience with incident management and post-mortem processes

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive compensation aligned with market standards
  • ๐Ÿ  Remote-first and flexible work environment across Brazil
  • ๐Ÿš€ Work on large-scale, high-traffic systems with advanced engineering challenges
  • ๐ŸŒ Exposure to cutting-edge AI-driven engineering platforms
  • ๐Ÿ“ˆ Career growth opportunities in platform engineering and reliability leadership

๐Ÿ“จ Hiring Process

Estimated timeline: 2-3 weeks ยท AI estimate

  1. 1Recruiter phone screenยท 30 min
  2. 2Technical interviewยท 60 min
  3. 3Hiring manager interviewยท 45 min
0 0 0