21h ago

Staff Site Reliability Engineer

Bengaluru

$120k-$160k / yearest.

full-timesenior Hybridcybersecurity

🛠 Tech Stack

💼 About This Role

You'll own reliability across a cloud-native and AI-driven platform, working at the intersection of distributed systems, Kubernetes operations, and LLM-powered automation. You'll build systems that not only scale but also think and fix themselves, impacting Saviynt's AI-powered identity platform trusted by Fortune 500 companies.

🎯 What You'll Do

  • Own uptime, reliability, and performance of services on AWS + Kubernetes (EKS)
  • Design and implement self-healing infrastructure using automation and AI agents
  • Build LLM-powered operational tooling using APIs like OpenAI API
  • Manage and scale Kubernetes workloads, deployments, and autoscaling
  • Build and evolve observability systems with Prometheus, Grafana, ELK, OpenTelemetry

📋 Requirements

  • 8+ years in SRE / DevOps / Platform Engineering
  • Strong hands-on experience with AWS infrastructure at scale
  • Strong hands-on experience with Kubernetes (production-grade clusters)
  • Proven ability to debug complex distributed systems under pressure
  • Strong coding skills in Python or Go

✨ Nice to Have

  • Experience working with LLM APIs such as the OpenAI API
  • Familiarity with agent frameworks like LangChain or AutoGen
  • Exposure to AIOps or intelligent automation systems
0 0 0