11h ago
Lead DevOps Engineer
Bengaluru, Karnataka, India
โจ $225k-$325k / yearest.
full-timeseniorai-ml
๐ Tech Stack
+4
๐ผ About This Role
You'll lead self-hosting and AI infrastructure initiatives at Observe.AI, optimizing GPU orchestration and low-latency AI deployments. You'll own end-to-end infrastructure scalability and drive MLOps best practices. This role offers strong technical leadership in a fast-scaling AI company.
๐ฏ What You'll Do
- Lead transition from managed to self-hosted Elasticsearch, Prometheus, Kafka
- Optimize AI/ML model deployment and scaling for high availability
- Design scalable, fault-tolerant systems for AI workloads
- Enhance CI/CD pipelines using MLOps tools like Kubeflow, MLflow
๐ Requirements
- 6+ years in DevOps, SRE, or Cloud Infrastructure
- Strong expertise in Kubernetes (EKS, AKS) for GPU clusters
- Experience self-hosting Elasticsearch, Prometheus, Kafka
- Hands-on with Infrastructure as Code (Terraform, CloudFormation)
โจ Nice to Have
- FinOps expertise for GPU cloud compute cost optimization
- Familiarity with service meshes (Istio, Linkerd)
- Knowledge of compliance frameworks (SOC2, ISO 27001)
๐ Benefits & Perks
- ๐ฅ Excellent medical insurance with free online doctor consultations
- ๐ Learning and Development fund for continuous learning
- ๐ Generous holidays and flexible benefit plans for tax exemptions
- ๐ถ Parental leave policies
- ๐ Fun events to build culture across the organization
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Interviewยท 60 min
- 3System Design / Architectureยท 60 min
0 0 0