11h ago
ML Engineer, Post-Training and Evaluation
San Francisco
โจ $200k-$350k / yearest.
full-timemidai-ml
๐ Tech Stack
๐ผ About This Role
You'll fine-tune Reflection's open-weight models for enterprise customers' specific domains, building evaluation infrastructure and deploying models to production. You will work hands-on with customer data, run fine-tuning workflows, and debug training issues while contributing to best practices. This role offers direct impact on customer success and collaboration with top AI researchers.
๐ฏ What You'll Do
- Fine-tune models using SFT, DPO, and reinforcement fine-tuning.
- Build and maintain evaluation suites and test sets.
- Prepare and clean training data from raw customer inputs.
- Debug training and inference issues in production.
๐ Requirements
- 3+ years of applied ML or ML engineering experience.
- Hands-on experience fine-tuning language models with SFT, DPO, or RLHF.
- Strong Python skills and clean, reproducible code.
- Comfort with training infrastructure (GPUs, debugging training failures).
โจ Nice to Have
- Experience with customer-facing roles and translating domain requirements.
- Familiarity with data pipelines and version control for datasets.
๐ Benefits & Perks
- ๐ฐ Top-tier salary and equity
- ๐ฅ Comprehensive medical, dental, vision insurance
- ๐ถ Fully paid parental leave
- ๐ฑ Daily lunch and dinner provided
- ๐๏ธ Paid time off and relocation support
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter callยท 30 min
- 2Technical interviewยท 60 min
- 3On-site interviewsยท 4 hours
0 0 0