15h ago

ML Ops Engineer

Ukraine

โœจ $140k-$200k / yearest.

full-timesenior Remote

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll design and operate production-grade ML inference infrastructure powering real-time AI applications in a fast-scaling cloud startup. Core impact: building scalable, reliable, cost-efficient systems for GPU-based model serving at low latency.

๐ŸŽฏ What You'll Do

  • Build and operate production-grade model serving infrastructure with vLLM, TGI, or Triton
  • Design and implement blue/green and canary deployment pipelines for ML models
  • Develop auto-scaling systems and multi-model serving architectures
  • Optimize GPU utilization, memory efficiency, and network throughput

๐Ÿ“‹ Requirements

  • 4+ years of experience in ML Ops or similar infrastructure roles
  • Hands-on experience with model serving frameworks like vLLM, TGI, or Triton
  • Strong background in container orchestration and GPU workloads in production
  • Proficiency in Python and infrastructure-as-code tools like Terraform or Helm

โœจ Nice to Have

  • Experience with ML platforms like Kubeflow or MLflow
  • Knowledge of CUDA/ROCm optimization or multi-tenant inference systems
  • Background in early-stage startups or greenfield infrastructure projects

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Remote work from anywhere in EMEA
  • ๐Ÿ’ฐ Competitive equity in a well-funded startup
  • ๐Ÿš€ Own critical infrastructure from the ground up
  • ๐Ÿง  Deep expertise in next-gen AI infrastructure

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3System Design Interviewยท 60 min
0 0 0