ML Evals Engineer at Exa

8h ago

ML Evals Engineer

San Francisco, California

$150k-$300k / year

full-timeai-ml Visa Sponsor

🛠 Tech Stack

💼 About This Role

You'll design and build the evaluation stack for Exa's AI-powered search engine, creating benchmarks and pipelines that measure search quality in an LLM world. Your work will directly shape the direction of search research and influence what the team optimizes for. You'll collaborate with ML researchers and engineers to build the most comprehensive eval suite for billions of documents.

🎯 What You'll Do

Design evaluation frameworks that probe search limits
Build scalable eval pipelines tracking regressions and drift
Create golden datasets, synthetic benchmarks, and agentic tasks
Partner with ML researchers and engineers on feedback loops

📋 Requirements

Hands-on ML experience training or evaluating models
Strong engineering fundamentals in Python or Rust
Experience with distributed pipelines and GPU/cluster jobs
Ability to dive into data and design creative measurement strategies

✨ Nice to Have

Experience with embeddings or LLMs
Background in search or information retrieval

🎁 Benefits & Perks

🏖️ Unlimited PTO
🩺 Premium medical, dental, vision
👶 Fertility benefits
💪 Monthly wellness stipend

Exa

Exa Jobs

Other jobs at Exa

No other jobs found.

0 0 0