4h ago
Member of Technical Staff - Evaluations
San Francisco
✨ $200k-$350k / yearest.
full-timeai-ml
💼 About This Role
You'll conduct critical comparative analysis to advance understanding of model capabilities and build tight feedback loops between data, evals, and model behavior. You'll collaborate with pre-training, post-training, and applied teams to translate insights into model improvements.
🎯 What You'll Do
- Conduct comparative analysis to evaluate model capabilities
- Build and refine evaluation systems and processes
- Develop generalizable evaluation frameworks for reasoning and alignment
- Collaborate with teams to translate insights into model improvements
📋 Requirements
- Strong statistical analysis and experimental design skills
- Familiarity with LLM evaluation methodologies
- High agency in a fast-paced startup environment
✨ Nice to Have
- Experience with agentic evaluation tasks
- Background in synthetic evaluation generation
🎁 Benefits & Perks
- 💰 Top-tier compensation with salary and equity
- 🏥 Comprehensive health insurance (medical, dental, vision)
- 👶 Fully paid parental leave for all new parents
- 🍽️ Daily lunch and dinner provided
- ✈️ Relocation support and paid time off
0 0 0