4h ago

Member of Technical Staff - Evaluations

San Francisco

$200k-$350k / yearest.

full-timeai-ml

💼 About This Role

You'll conduct critical comparative analysis to advance understanding of model capabilities and build tight feedback loops between data, evals, and model behavior. You'll collaborate with pre-training, post-training, and applied teams to translate insights into model improvements.

🎯 What You'll Do

  • Conduct comparative analysis to evaluate model capabilities
  • Build and refine evaluation systems and processes
  • Develop generalizable evaluation frameworks for reasoning and alignment
  • Collaborate with teams to translate insights into model improvements

📋 Requirements

  • Strong statistical analysis and experimental design skills
  • Familiarity with LLM evaluation methodologies
  • High agency in a fast-paced startup environment

✨ Nice to Have

  • Experience with agentic evaluation tasks
  • Background in synthetic evaluation generation

🎁 Benefits & Perks

  • 💰 Top-tier compensation with salary and equity
  • 🏥 Comprehensive health insurance (medical, dental, vision)
  • 👶 Fully paid parental leave for all new parents
  • 🍽️ Daily lunch and dinner provided
  • ✈️ Relocation support and paid time off
0 0 0