1d ago

Machine Learning Engineer, Core Evaluations

Remote

โœจ $150k-$220k / yearest.

full-timesenior Remoteai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll design and build evaluation pipelines for cutting-edge speech generation and recognition models at a social AI company. As the founding evaluation team member, you'll lead the development of automated dashboards and user studies to measure model performance. This role offers the chance to shape the evaluation culture of a fast-growing startup.

๐ŸŽฏ What You'll Do

  • Design evaluation pipelines for models in development and production
  • Design user studies for subjective model evaluations
  • Convert requirements into measurable metrics
  • Design and develop automated evaluation dashboards

๐Ÿ“‹ Requirements

  • Metric design experience for model performance evaluation
  • User study design on platforms like Mechanical Turk
  • Model training and fine-tuning for evaluation purposes
  • Statistical analysis to compare evaluation results

โœจ Nice to Have

  • Experience with ASR and TTS model training
  • Experience with large-scale ML (3B+ models, >1M hours data)

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Remote-first work environment
  • ๐Ÿ’ฐ Competitive compensation
  • ๐Ÿ“ˆ Growth opportunities as founding team member
  • ๐Ÿถ Pet-friendly culture

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter phone screenยท 30 min
  2. 2Technical interviewยท 60 min
  3. 3Onsite/virtual final roundยท 3 hours
0 0 0