14h ago

Software Engineer, Machine Learning Platform

San Francisco, CA

$187k-$259k / year

full-timeseniorfinance

๐Ÿ›  Tech Stack

+2

๐Ÿ’ผ About This Role

You'll design and build scalable ML infrastructure on AWS, enabling data scientists to develop, train, deploy, and monitor models reliably and efficiently. You'll work at the intersection of distributed systems and applied machine learning, creating robust foundations for ML teams.

๐ŸŽฏ What You'll Do

  • Design, build, and operate scalable ML infrastructure on AWS
  • Develop distributed training and batch processing systems using Ray
  • Build and maintain infrastructure-as-code using Terraform
  • Enhance observability, reliability, and cost visibility across ML workloads

๐Ÿ“‹ Requirements

  • 5+ years in ML infrastructure, platform engineering, or production ML systems
  • Knowledge of the machine learning model development lifecycle
  • Experience with distributed systems, cloud computing, or large-scale data processing
  • Hands-on experience with CI/CD pipelines, DevOps practices, and infrastructure as code

โœจ Nice to Have

  • Experience with Ray or distributed compute frameworks
  • Experience building or operating a feature store
  • Experience with streaming technologies like Kafka, Kinesis, Flink

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive salary plus bonus and equity package
  • ๐Ÿ– Generous vacation policy and company-wide Chime Days
  • ๐Ÿข In-office perks including backup child, elder, and/or pet care
  • ๐ŸšŒ Subsidized commuter benefit
  • ๐Ÿฅ Medical, dental, vision, life, and disability benefits plus 401k match

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Phone Interviewยท 45 min
  3. 3On-site Interviewsยท 4 hours
0 0 0