11h ago

ML Engineer, Post-Training and Evaluation

San Francisco

โœจ $200k-$350k / yearest.

full-timemidai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll fine-tune Reflection's open-weight models for enterprise customers' specific domains, building evaluation infrastructure and deploying models to production. You will work hands-on with customer data, run fine-tuning workflows, and debug training issues while contributing to best practices. This role offers direct impact on customer success and collaboration with top AI researchers.

๐ŸŽฏ What You'll Do

  • Fine-tune models using SFT, DPO, and reinforcement fine-tuning.
  • Build and maintain evaluation suites and test sets.
  • Prepare and clean training data from raw customer inputs.
  • Debug training and inference issues in production.

๐Ÿ“‹ Requirements

  • 3+ years of applied ML or ML engineering experience.
  • Hands-on experience fine-tuning language models with SFT, DPO, or RLHF.
  • Strong Python skills and clean, reproducible code.
  • Comfort with training infrastructure (GPUs, debugging training failures).

โœจ Nice to Have

  • Experience with customer-facing roles and translating domain requirements.
  • Familiarity with data pipelines and version control for datasets.

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Top-tier salary and equity
  • ๐Ÿฅ Comprehensive medical, dental, vision insurance
  • ๐Ÿ‘ถ Fully paid parental leave
  • ๐Ÿฑ Daily lunch and dinner provided
  • ๐Ÿ–๏ธ Paid time off and relocation support

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter callยท 30 min
  2. 2Technical interviewยท 60 min
  3. 3On-site interviewsยท 4 hours
0 0 0