23h ago
Computational Linguist, AI Evaluation
San Francisco
$150k-$160k / year
full-timesenior Hybridai-ml
๐ Tech Stack
๐ผ About This Role
You'll design and build robust model evaluation frameworks for video-language models, automating repetitive processes. Your work will directly enhance data quality and model assessment while collaborating with research and product teams. You'll drive our post-training evaluation strategy and improve tools for sustainable data operations.
๐ฏ What You'll Do
- Design and build robust model evaluation frameworks.
- Manage resource allocation and timelines across data streams.
- Enhance dataset quality through vendor collaboration.
- Establish labeling guidelines and monitor data quality.
- Partner with Engineering and AI Model teams on data needs.
๐ Requirements
- 5+ years experience in AI-focused data operations.
- Track record designing and executing large scale data or evaluation projects.
- Ability to analyze complex data and distill findings into guidelines.
- Proficiency with Python for automation.
โจ Nice to Have
- Experience in data collection for multimodal language models.
- Experience in red teaming or localization testing.
- Experience working with research scientists.
๐ Benefits & Perks
- ๐ค Open and inclusive culture and work environment.
- ๐งโ๐ป Work closely with a collaborative, mission-driven team on cutting-edge AI.
- ๐ฆท Full health, dental, and vision benefits.
- โ๏ธ Flexible PTO and parental leave policy.
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Onsite/Final Roundยท 120 min
0 0 0