21h ago
Staff Site Reliability Engineer
Bengaluru
✨ $120k-$160k / yearest.
full-timesenior Hybridcybersecurity
🛠 Tech Stack
💼 About This Role
You'll own reliability across a cloud-native and AI-driven platform, working at the intersection of distributed systems, Kubernetes operations, and LLM-powered automation. You'll build systems that not only scale but also think and fix themselves, impacting Saviynt's AI-powered identity platform trusted by Fortune 500 companies.
🎯 What You'll Do
- Own uptime, reliability, and performance of services on AWS + Kubernetes (EKS)
- Design and implement self-healing infrastructure using automation and AI agents
- Build LLM-powered operational tooling using APIs like OpenAI API
- Manage and scale Kubernetes workloads, deployments, and autoscaling
- Build and evolve observability systems with Prometheus, Grafana, ELK, OpenTelemetry
📋 Requirements
- 8+ years in SRE / DevOps / Platform Engineering
- Strong hands-on experience with AWS infrastructure at scale
- Strong hands-on experience with Kubernetes (production-grade clusters)
- Proven ability to debug complex distributed systems under pressure
- Strong coding skills in Python or Go
✨ Nice to Have
- Experience working with LLM APIs such as the OpenAI API
- Familiarity with agent frameworks like LangChain or AutoGen
- Exposure to AIOps or intelligent automation systems
0 0 0