16h ago
Associate Manager - Reliability Operations
Hyderabad
full-timeleadfinance
🛠 Tech Stack
💼 About This Role
You'll lead a team to uphold service level objectives for a cloud-native banking platform at a $2B-valued fintech. Your core impact is ensuring production stability and driving SLO adherence in a high-stakes 24x7 SaaS environment.
🎯 What You'll Do
- Drive SLO adherence through advanced metric monitoring and error budgets.
- Ensure immediate alert acknowledgment and escalate to SRE teams.
- Coordinate standard deployments across sites following SRE sign-off.
- Mentor staff on SLO-driven decision-making and audit alert workflows.
📋 Requirements
- 6-8 years in operations support or reliability operations
- 2+ years in supervisory roles managing 24x7 teams
- Proficiency in monitoring tools (Prometheus, Grafana, Splunk, PagerDuty)
- Bachelor's degree in IT or related field
✨ Nice to Have
- Familiarity with ITIL frameworks and SRE principles
- Experience with cloud platforms (AWS, Azure, GCP)
- Relevant IT certifications (e.g., ITIL Foundation)
0 0 0