3h ago

Member of Technical Staff - Model Evaluation

London, UK

$150k-$200k / yearest.

full-timeseniorArtificial Intelligence

🛠 Tech Stack

💼 About This Role

You'll join the Enterprise Model Evaluation team to measure how Grok performs on high-value real-world tasks. Your work will directly shape Grok's capabilities in a flat, high-intensity culture.

🎯 What You'll Do

  • Identify high-value enterprise use cases
  • Provide complete assessment of models
  • Deep dive into model training data to identify weakness points
  • Communicate with modeling and data team to improve model quality

📋 Requirements

  • Experience in model assessment and evaluation task development
  • Ability to collect and synthesize data for new evaluations
  • Build infrastructure and framework for model evaluation
  • Familiarity with inference frameworks like SGlang and vLLM
0 0 0