23h ago

Computational Linguist, AI Evaluation

San Francisco

$150k-$160k / year

full-timesenior Hybridai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll design and build robust model evaluation frameworks for video-language models, automating repetitive processes. Your work will directly enhance data quality and model assessment while collaborating with research and product teams. You'll drive our post-training evaluation strategy and improve tools for sustainable data operations.

๐ŸŽฏ What You'll Do

  • Design and build robust model evaluation frameworks.
  • Manage resource allocation and timelines across data streams.
  • Enhance dataset quality through vendor collaboration.
  • Establish labeling guidelines and monitor data quality.
  • Partner with Engineering and AI Model teams on data needs.

๐Ÿ“‹ Requirements

  • 5+ years experience in AI-focused data operations.
  • Track record designing and executing large scale data or evaluation projects.
  • Ability to analyze complex data and distill findings into guidelines.
  • Proficiency with Python for automation.

โœจ Nice to Have

  • Experience in data collection for multimodal language models.
  • Experience in red teaming or localization testing.
  • Experience working with research scientists.

๐ŸŽ Benefits & Perks

  • ๐Ÿค Open and inclusive culture and work environment.
  • ๐Ÿง‘โ€๐Ÿ’ป Work closely with a collaborative, mission-driven team on cutting-edge AI.
  • ๐Ÿฆท Full health, dental, and vision benefits.
  • โœˆ๏ธ Flexible PTO and parental leave policy.

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Onsite/Final Roundยท 120 min
0 0 0