7h ago
Research Engineer - Posttraining
Menlo Park, Remote
✨ $150k-$250k / yearest.
full-time Remoteai-ml
💼 About This Role
You'll post-train frontier models to autonomously run parts of the scientific discovery pipeline, including generating hypotheses and designing experiments. Your work will directly enable novel scientific discoveries. You'll collaborate with world-class experts in physical sciences.
🎯 What You'll Do
- Post-train frontier models for scientific discovery pipeline
- Create high-quality evaluation and training tasks
- Scale up RL environments for LLMs
- Design creative reward functions and run RL runs
📋 Requirements
- Experience with RL environments for LLMs
- Experience creating high-quality evals for frontier models
- Skill in training datasets and reward functions with LLMs
- Ability to collaborate with domain experts on evaluation criteria
0 0 0