1d ago
Machine Learning Operations Engineer
Somerville, MA
$150k-$200k / year
full-timesenior Hybridai-ml
๐ Tech Stack
๐ผ About This Role
You'll own and scale the production inference systems behind Modulate's ML models, ensuring high availability and reliability. Your impact will be felt as you build systems to handle rapid customer and model growth. You'll reduce operational burden through better tooling and automation, defining how ML systems run at scale.
๐ฏ What You'll Do
- Deploy, monitor, and maintain production ML inference systems
- Design monitoring, alerting, and incident response for ML workloads
- Build systems for scaling inference infrastructure under variable load
- Improve reliability and observability of production ML services
๐ Requirements
- Experience deploying and maintaining production software systems
- Experience building monitoring and alerting systems for production
- Strong experience with AWS, Python, and Linux
- Exposure to PyTorch or similar ML frameworks
โจ Nice to Have
- Experience with ML model serving systems or dedicated model servers
- Experience monitoring GPU performance for inference workloads
- Experience optimizing machine learning model inference
๐ Benefits & Perks
- ๐ฐ Competitive salary + equity
- ๐ฅ Full health, dental, and vision coverage
- ๐๏ธ Flexible PTO with strong culture of taking it
- ๐ Weekly team lunches with dietary accommodations
- ๐ Up to 8 weeks work-from-anywhere policy
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Application reviewยท 1 week
- 2Technical phone screenยท 1 hour
- 3On-site interviewsยท 4 hours
0 0 0