14h ago

Senior DevOps / Platform Reliability Engineer

East Coast - United States

โœจ $160k-$190k / yearest.

full-timesenior Remotesoftware

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll own the CI/CD, infrastructure, and observability backbone for our agentic CX platform, enabling safe multi-agent deployments to enterprise customers. You'll build AI-native DevOps capabilities and operate a production AI platform. If you want to use AI to operate AI, this role is for you.

๐ŸŽฏ What You'll Do

  • Own and evolve CI/CD pipelines using GitHub Actions and OIDC
  • Automate infrastructure provisioning with Terraform and CloudFormation
  • Operate and scale Kubernetes platform (EKS + Argo CD)
  • Build observability as a product using Prometheus, Grafana, and OpenTelemetry
  • Design and operate auto-remediation agents for production toil

๐Ÿ“‹ Requirements

  • 5+ years of experience in DevOps, SRE, or Platform Engineering on AWS
  • Strong experience with CI/CD pipelines (GitHub Actions, GitLab CI, etc.)
  • Hands-on experience operating production EKS environments
  • Deep experience with Terraform and GitHub Actions using OIDC

โœจ Nice to Have

  • Experience operating LLM or ML workloads in production
  • Experience building MCP servers or deploying agent frameworks like LangGraph or CrewAI

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive compensation
  • ๐Ÿฅ Comprehensive health benefits (100% employee premiums)
  • ๐Ÿ–๏ธ Unlimited PTO
  • ๐Ÿ  Flexible remote work from anywhere
  • ๐Ÿ’ป Home office stipend ($500 setup + $100/month)

๐Ÿ“จ Hiring Process

Estimated timeline: 2-3 weeks ยท AI estimate

  1. 1Recruiter screenยท 30 min
  2. 2Technical interviewยท 60 min
  3. 3Hiring manager interviewยท 45 min
0 0 0