1d ago
Research, Post-Training Data
San Francisco Bay Area
✨ $200k-$350k / yearest.
full-timeseniorai-ml
🛠 Tech Stack
💼 About This Role
You'll research data strategies for post-training LLMs, blending human feedback and synthetic data. Your core impact is designing scalable pipelines for high-quality data and improving model alignment. You'll work at a fast-moving AI lab with access to frontier-scale compute and data.
🎯 What You'll Do
- Design and execute data collection and synthesis strategies for post-training.
- Develop scalable pipelines for human and synthetic data generation.
- Research and model human preferences to improve reasoning and truthfulness.
- Design evaluation metrics and benchmarks for data quality and alignment.
📋 Requirements
- Strong engineering skills with ability to contribute code in complex codebases.
- Experience with data curation or synthetic data generation for LLMs.
- Proficiency in Python and at least one deep learning framework (PyTorch, TensorFlow).
- Strong grasp of probability, statistics, and ML fundamentals.
✨ Nice to Have
- Prior experience with RLHF, RLAIF, or reward learning for large models.
- Experience managing human data collection campaigns or annotation workflows.
- Familiarity with synthetic data pipelines or active learning.
🎁 Benefits & Perks
- 🚀 Fast-moving environment with minimal process overhead.
- 💻 Access to frontier-scale compute and data from day one.
- 🏆 Small, highly selective team where research reaches deployment quickly.
0 0 0