16h ago

Associate Manager - Reliability Operations

Hyderabad
full-timeleadfinance

🛠 Tech Stack

💼 About This Role

You'll lead a team to uphold service level objectives for a cloud-native banking platform at a $2B-valued fintech. Your core impact is ensuring production stability and driving SLO adherence in a high-stakes 24x7 SaaS environment.

🎯 What You'll Do

  • Drive SLO adherence through advanced metric monitoring and error budgets.
  • Ensure immediate alert acknowledgment and escalate to SRE teams.
  • Coordinate standard deployments across sites following SRE sign-off.
  • Mentor staff on SLO-driven decision-making and audit alert workflows.

📋 Requirements

  • 6-8 years in operations support or reliability operations
  • 2+ years in supervisory roles managing 24x7 teams
  • Proficiency in monitoring tools (Prometheus, Grafana, Splunk, PagerDuty)
  • Bachelor's degree in IT or related field

✨ Nice to Have

  • Familiarity with ITIL frameworks and SRE principles
  • Experience with cloud platforms (AWS, Azure, GCP)
  • Relevant IT certifications (e.g., ITIL Foundation)
0 0 0