15h ago
ML Ops Engineer
Ukraine
โจ $140k-$200k / yearest.
full-timesenior Remote
๐ Tech Stack
๐ผ About This Role
You'll design and operate production-grade ML inference infrastructure powering real-time AI applications in a fast-scaling cloud startup. Core impact: building scalable, reliable, cost-efficient systems for GPU-based model serving at low latency.
๐ฏ What You'll Do
- Build and operate production-grade model serving infrastructure with vLLM, TGI, or Triton
- Design and implement blue/green and canary deployment pipelines for ML models
- Develop auto-scaling systems and multi-model serving architectures
- Optimize GPU utilization, memory efficiency, and network throughput
๐ Requirements
- 4+ years of experience in ML Ops or similar infrastructure roles
- Hands-on experience with model serving frameworks like vLLM, TGI, or Triton
- Strong background in container orchestration and GPU workloads in production
- Proficiency in Python and infrastructure-as-code tools like Terraform or Helm
โจ Nice to Have
- Experience with ML platforms like Kubeflow or MLflow
- Knowledge of CUDA/ROCm optimization or multi-tenant inference systems
- Background in early-stage startups or greenfield infrastructure projects
๐ Benefits & Perks
- ๐๏ธ Remote work from anywhere in EMEA
- ๐ฐ Competitive equity in a well-funded startup
- ๐ Own critical infrastructure from the ground up
- ๐ง Deep expertise in next-gen AI infrastructure
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3System Design Interviewยท 60 min
0 0 0