Senior Research Scientist, Model Evaluation at Cohere

16h ago

Senior Research Scientist, Model Evaluation

Toronto

✨ $200k-$300k / yearest.

full-timesenior Hybridai-ml

🛠 Tech Stack

💼 About This Role

You'll create next-generation evaluation methods and infrastructure to measure LLM progress at a company scaling intelligence. Your work will directly shape model capabilities and set the agenda for future AI. You'll collaborate with cross-functional teams to translate model feedback into trustworthy evaluations.

🎯 What You'll Do

Create ambitious new evaluation benchmarks for LLMs.
Translate model feedback into trustworthy, repeatable evaluations.
Conduct research on LLM evaluation methods and efficiency.
Build scalable tools for digging into model performance.

📋 Requirements

Rapidly build prototypes to demonstrate LLM capabilities.
Experience reviewing complex data and LLM outputs for quality.
Obsessive about rigorously measuring AI capabilities.
Strong software engineering skills.

✨ Nice to Have

Experience training LLM judges.
Refining LLM-based data synthesis pipelines.
Improving evaluation efficiency.

🎁 Benefits & Perks

🤝 Inclusive culture and work environment
🧑‍💻 Work on cutting-edge AI research
🍽 Weekly lunch stipend and in-office meals
🦷 Full health and dental benefits with mental health budget
✈️ 6 weeks of vacation

Cohere

Cohere Jobs

Other jobs at Cohere

No other jobs found.

0 0 0