6h ago

Site Reliability Engineer

Bengaluru, India

✨ $35k-$50k / yearest.

full-timemid Hybridfinance

πŸ›  Tech Stack

πŸ’Ό About This Role

You'll design resilient systems and define SLOs that reflect customer experience. Use Datadog and CloudWatch to build signal-heavy observability and improve the incident lifecycle. You'll combine software fundamentals with reliability thinking to keep systems highly available and easy to debug.

🎯 What You'll Do

  • Design systems with resilience, graceful degradation, and capacity planning.
  • Define and measure SLOs and SLIs that reflect customer experience.
  • Build observability using Datadog and CloudWatch for alerting and monitoring.
  • Continuously improve the incident lifecycle from detection to blameless follow-ups.

πŸ“‹ Requirements

  • 3+ years of experience in an SRE or Software Engineering role.
  • Hands-on coding experience in two programming languages.
  • Experience managing production environments with observability tools.
  • Experience using SLOs and SLIs to guide decisions and prioritize work.

✨ Nice to Have

  • Experience with AI-assisted development tools like GitHub Copilot or Cursor.
  • Built or contributed to agentic AI workflows for runbook automation or alert triage.
  • Familiarity with incident.io or similar incident management platforms.

🎁 Benefits & Perks

  • πŸ₯ Healthcare coverage
  • πŸ“± Internet/cell phone reimbursement
  • πŸ“š Learning and development stipend
  • ✈️ Opportunities to travel to Palo Alto HQ and Bangkok Site

πŸ“¨ Hiring Process

Estimated timeline: 2-4 weeks Β· AI estimate

  1. 1Recruiter CallΒ· 30 min
  2. 2Technical InterviewΒ· 60 min
  3. 3On-site InterviewΒ· half day
0 0 0