6h ago
Research Engineer/Scientist - Human Alignment, Consumer Devices
San Francisco, CA
$380k-$445k / year
full-timesenior Hybridai-ml
🛠 Tech Stack
💼 About This Role
You'll develop RLHF and post-training methods for multimodal AI systems at OpenAI. Your work will directly shape how models learn from feedback over time, enabling personalized and context-aware behavior. This is a product-grounded research role with access to frontier infrastructure and safety teams.
🎯 What You'll Do
- Develop RLHF and post-training methods for multimodal models.
- Build reward models and preference-learning pipelines for adaptive behavior.
- Design datasets, rubrics, and evaluation frameworks for user preferences.
- Collaborate with safety researchers on aligned personalization.
📋 Requirements
- 3+ years experience in machine learning research with RLHF or post-training.
- Experience with reward modeling, preference optimization, or reinforcement learning.
- Proven track record of designing experiments and reliable evaluations.
- Experience building datasets or eval pipelines grounded in human preferences.
✨ Nice to Have
- Experience with multimodal AI systems.
- Background in personalization, memory, or recommender systems.
- Familiarity with long-horizon evaluation or user modeling.
🎁 Benefits & Perks
- 💰 Competitive compensation ($380K – $445K + equity)
- 🏖️ Flexible PTO and hybrid work model (4 days in office)
- ✈️ Relocation assistance available
- 🧠 Access to frontier AI research and compute resources
- 🏥 Comprehensive health benefits
0 0 0