14h ago
Software Engineer, Machine Learning Platform
San Francisco, CA
$187k-$259k / year
full-timeseniorfinance
๐ Tech Stack
+2
๐ผ About This Role
You'll design and build scalable ML infrastructure on AWS, enabling data scientists to develop, train, deploy, and monitor models reliably and efficiently. You'll work at the intersection of distributed systems and applied machine learning, creating robust foundations for ML teams.
๐ฏ What You'll Do
- Design, build, and operate scalable ML infrastructure on AWS
- Develop distributed training and batch processing systems using Ray
- Build and maintain infrastructure-as-code using Terraform
- Enhance observability, reliability, and cost visibility across ML workloads
๐ Requirements
- 5+ years in ML infrastructure, platform engineering, or production ML systems
- Knowledge of the machine learning model development lifecycle
- Experience with distributed systems, cloud computing, or large-scale data processing
- Hands-on experience with CI/CD pipelines, DevOps practices, and infrastructure as code
โจ Nice to Have
- Experience with Ray or distributed compute frameworks
- Experience building or operating a feature store
- Experience with streaming technologies like Kafka, Kinesis, Flink
๐ Benefits & Perks
- ๐ฐ Competitive salary plus bonus and equity package
- ๐ Generous vacation policy and company-wide Chime Days
- ๐ข In-office perks including backup child, elder, and/or pet care
- ๐ Subsidized commuter benefit
- ๐ฅ Medical, dental, vision, life, and disability benefits plus 401k match
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Phone Interviewยท 45 min
- 3On-site Interviewsยท 4 hours
0 0 0