3h ago
Lead DevOps Engineer
Bengaluru
full-timeseniorAI / Customer Experience
Tech Stack
+4
Description
You will own and scale systems powering real-world customer interactions, driving high-impact initiatives like GPU orchestration, self-hosting, and low-latency AI deployments while working closely with ML teams to productionize models. You have end-to-end ownership, a modern tech stack, and the opportunity to shape MLOps best practices.
Requirements
- 6+ years in DevOps, SRE, or Cloud Infrastructure roles, preferably in AI or data-intensive environments.
- Strong expertise in Kubernetes (EKS, AKS) for deploying AI workloads and managing GPU clusters.
- Experience with self-hosting Elasticsearch, Prometheus, Grafana, Kafka.
- Hands-on with Terraform, CloudFormation, and cloud platforms (AWS, Azure, GCP).
- Strong automation skills in Python, Bash, or Go.
Responsibilities
- Lead transition from managed services to self-hosted Elasticsearch, Prometheus, etc.
- Optimize AI infrastructure for deploying and scaling ML models with low latency.
- Design scalable, fault-tolerant systems for large-scale AI workloads and data pipelines.
- Enhance and automate ML model deployment pipelines using Kubeflow, MLflow, Argo Workflows.
- Implement monitoring, logging, and alerting strategies with Prometheus, Grafana, ELK, OpenTelemetry.
0 views 0 saves 0 applications