17h ago

Senior Research Engineer, Model Evaluation

Toronto

$200k-$300k / yearest.

full-timesenior Hybridai-ml

🛠 Tech Stack

💼 About This Role

You'll develop next-generation evaluation methods and scalable infrastructure to measure LLM progress. You'll create evaluation benchmarks, datasets, and environments for frontier model capabilities. You'll also conduct research to push the state-of-the-art in LLM evaluation methods.

🎯 What You'll Do

  • Develop evaluation benchmarks, datasets, and environments for frontier model capabilities
  • Conduct research to advance LLM evaluation methods
  • Build scalable tools for investigating and understanding evaluation results

📋 Requirements

  • Experience pushing limits of LLMs and building high-quality evaluation resources
  • Track record of developing new methods or data to evaluate LLMs
  • Deep experience building with and around LLMs and analyzing their performance
  • Strong software engineering skills

🎁 Benefits & Perks

  • 🤝 Open and inclusive culture
  • 🧑‍💻 Work on cutting-edge AI research
  • 🍽 Weekly lunch stipend
  • 🦷 Full health and dental benefits
  • ✈️ 6 weeks of vacation
0 0 0