17h ago
Principal MLOps Engineer
Remote, US; DMV; McLean, VA; Boston, MA; San Antonio, TX; Colorado Springs, CO; Tampa, FL; Honolulu, HI
β¨ $175k-$225k / yearest.
full-timelead Remote
π Tech Stack
πΌ About This Role
You'll design and mature Raft's ML platform and MLOps infrastructure for the Department of Defense. You'll build secure, scalable deployment pipelines and manage GPU-enabled Kubernetes clusters to support mission-critical AI systems. This role offers the chance to work on cutting-edge defense technology with high-impact problem-solving.
π― What You'll Do
- Design and maintain secure, scalable MLOps infrastructure and deployment pipelines
- Manage machine learning workloads on Kubernetes, including GPU-enabled clusters
- Build CI/CD workflows for ML services and model artifacts
- Partner with ML engineers to move models from experimentation to production
π Requirements
- 7+ years of experience in software engineering, platform engineering, or MLOps
- 5+ years of experience with Docker and Kubernetes in production
- Experience building and maintaining ML platforms or infrastructure
- Ability to obtain and maintain Top Secret clearance
β¨ Nice to Have
- Experience with Triton Inference Server, KServe, or similar model serving platforms
- Experience with Kubeflow or other Kubernetes-based ML platforms
- Familiarity with Istio or service mesh technologies
π Benefits & Perks
- ποΈ Flexible work environment
- π° Competitive salary
- π Professional development opportunities
- π» Remote work options
- π― Impactful mission
π¨ Hiring Process
Estimated timeline: 2-4 weeks Β· AI estimate
- 1Recruiter ScreenΒ· 30 min
- 2Technical InterviewΒ· 60 min
- 3Hiring Manager InterviewΒ· 45 min
π© Heads Up
- Requires Security+ certification within 90 days, which may be an additional burden
- Demands ability to obtain Top Secret clearance, which can be a lengthy process
- Mentions both on-prem and edge deployment, indicating complex environments
0 0 0