17h ago
Senior Research Engineer, Model Evaluation
Toronto
✨ $200k-$300k / yearest.
full-timesenior Hybridai-ml
🛠 Tech Stack
💼 About This Role
You'll develop next-generation evaluation methods and scalable infrastructure to measure LLM progress. You'll create evaluation benchmarks, datasets, and environments for frontier model capabilities. You'll also conduct research to push the state-of-the-art in LLM evaluation methods.
🎯 What You'll Do
- Develop evaluation benchmarks, datasets, and environments for frontier model capabilities
- Conduct research to advance LLM evaluation methods
- Build scalable tools for investigating and understanding evaluation results
📋 Requirements
- Experience pushing limits of LLMs and building high-quality evaluation resources
- Track record of developing new methods or data to evaluate LLMs
- Deep experience building with and around LLMs and analyzing their performance
- Strong software engineering skills
🎁 Benefits & Perks
- 🤝 Open and inclusive culture
- 🧑💻 Work on cutting-edge AI research
- 🍽 Weekly lunch stipend
- 🦷 Full health and dental benefits
- ✈️ 6 weeks of vacation
0 0 0