3h ago
Member of Technical Staff - Model Evaluation
London, UK
✨ $150k-$200k / yearest.
full-timeseniorArtificial Intelligence
🛠 Tech Stack
💼 About This Role
You'll join the Enterprise Model Evaluation team to measure how Grok performs on high-value real-world tasks. Your work will directly shape Grok's capabilities in a flat, high-intensity culture.
🎯 What You'll Do
- Identify high-value enterprise use cases
- Provide complete assessment of models
- Deep dive into model training data to identify weakness points
- Communicate with modeling and data team to improve model quality
📋 Requirements
- Experience in model assessment and evaluation task development
- Ability to collect and synthesize data for new evaluations
- Build infrastructure and framework for model evaluation
- Familiarity with inference frameworks like SGlang and vLLM
0 0 0