20h ago
Principal MLOps Platform Engineer
United States
$170k-$190k / year
full-timelead Remotesoftware
๐ Tech Stack
๐ผ About This Role
You'll design and operate the core MLOps platform infrastructure for production-grade AI and ML systems. You'll define how models and LLM-powered services are deployed, monitored, and managed at scale across cloud environments. This high-impact role combines deep cloud engineering, platform architecture, and MLOps expertise to shape developer experience and operational efficiency.
๐ฏ What You'll Do
- Build infrastructure as code using Terraform or AWS CDK
- Design CI/CD pipelines with GitHub Actions, GitLab CI, or CodePipeline
- Implement observability frameworks for ML/LLM systems
- Define environment isolation and model lifecycle management
๐ Requirements
- 7+ years in platform engineering, DevOps, MLOps, or cloud infrastructure
- Deep expertise in AWS production architecture and operations
- Strong experience with Terraform or AWS CDK
- Hands-on experience with Docker and orchestration (ECS, EKS, or Kubernetes)
โจ Nice to Have
- Experience managing ML/LLM workloads in production
- AWS certifications (Solutions Architect Associate or Professional)
- Kubernetes or CNCF certifications
๐ Benefits & Perks
- ๐ฐ Competitive salary $170,000โ$190,000 OTE
- ๐ฅ Comprehensive medical, dental, and vision insurance
- ๐๏ธ Paid time off, holidays, parental and caregiver leave
- ๐ Remote-friendly work environment
- ๐ Continuous learning support including certifications
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Initial phone screenยท 30 min
- 2Technical interview (cloud/MLOps)ยท 60 min
- 3System design or platform architecture interviewยท 60 min
0 0 0