14h ago
Senior DevOps / Platform Reliability Engineer
East Coast - United States
โจ $160k-$190k / yearest.
full-timesenior Remotesoftware
๐ Tech Stack
๐ผ About This Role
You'll own the CI/CD, infrastructure, and observability backbone for our agentic CX platform, enabling safe multi-agent deployments to enterprise customers. You'll build AI-native DevOps capabilities and operate a production AI platform. If you want to use AI to operate AI, this role is for you.
๐ฏ What You'll Do
- Own and evolve CI/CD pipelines using GitHub Actions and OIDC
- Automate infrastructure provisioning with Terraform and CloudFormation
- Operate and scale Kubernetes platform (EKS + Argo CD)
- Build observability as a product using Prometheus, Grafana, and OpenTelemetry
- Design and operate auto-remediation agents for production toil
๐ Requirements
- 5+ years of experience in DevOps, SRE, or Platform Engineering on AWS
- Strong experience with CI/CD pipelines (GitHub Actions, GitLab CI, etc.)
- Hands-on experience operating production EKS environments
- Deep experience with Terraform and GitHub Actions using OIDC
โจ Nice to Have
- Experience operating LLM or ML workloads in production
- Experience building MCP servers or deploying agent frameworks like LangGraph or CrewAI
๐ Benefits & Perks
- ๐ฐ Competitive compensation
- ๐ฅ Comprehensive health benefits (100% employee premiums)
- ๐๏ธ Unlimited PTO
- ๐ Flexible remote work from anywhere
- ๐ป Home office stipend ($500 setup + $100/month)
๐จ Hiring Process
Estimated timeline: 2-3 weeks ยท AI estimate
- 1Recruiter screenยท 30 min
- 2Technical interviewยท 60 min
- 3Hiring manager interviewยท 45 min
0 0 0