1d ago

Machine Learning Operations Engineer

Somerville, MA

$150k-$200k / year

full-timesenior Hybridai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll own and scale the production inference systems behind Modulate's ML models, ensuring high availability and reliability. Your impact will be felt as you build systems to handle rapid customer and model growth. You'll reduce operational burden through better tooling and automation, defining how ML systems run at scale.

๐ŸŽฏ What You'll Do

  • Deploy, monitor, and maintain production ML inference systems
  • Design monitoring, alerting, and incident response for ML workloads
  • Build systems for scaling inference infrastructure under variable load
  • Improve reliability and observability of production ML services

๐Ÿ“‹ Requirements

  • Experience deploying and maintaining production software systems
  • Experience building monitoring and alerting systems for production
  • Strong experience with AWS, Python, and Linux
  • Exposure to PyTorch or similar ML frameworks

โœจ Nice to Have

  • Experience with ML model serving systems or dedicated model servers
  • Experience monitoring GPU performance for inference workloads
  • Experience optimizing machine learning model inference

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive salary + equity
  • ๐Ÿฅ Full health, dental, and vision coverage
  • ๐Ÿ–๏ธ Flexible PTO with strong culture of taking it
  • ๐Ÿ• Weekly team lunches with dietary accommodations
  • ๐ŸŒ Up to 8 weeks work-from-anywhere policy

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Application reviewยท 1 week
  2. 2Technical phone screenยท 1 hour
  3. 3On-site interviewsยท 4 hours
0 0 0