8h ago

Senior Engineer - AI Evaluator

Miami

$208k-$416k / year

contractlead Remotesoftware

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll evaluate the quality of interactions with coding agents like OpenAI Codex and Claude Code, focusing on engineering taste. You'll judge whether responses reflect strong engineering judgment and provide opinionated feedback. This role offers a chance to define what great looks like for AI-assisted development.

๐ŸŽฏ What You'll Do

  • Evaluate AI-generated coding interactions end-to-end
  • Judge whether outputs are useful and aligned with strong engineering thinking
  • Distinguish quality levels (e.g., 2 vs 4) in model responses
  • Provide clear, opinionated feedback on what worked and what felt off

๐Ÿ“‹ Requirements

  • Staff/Principal-level engineer or equivalent experience
  • Strong background in TypeScript/JavaScript or Python
  • Hands-on experience with OpenAI Codex, Claude Code, or Cursor
  • Comfortable evaluating code without deep line-by-line review

โœจ Nice to Have

  • Experience with AI-first IDEs like Cursor
  • Prior exposure to prompt design or evaluation workflows
  • Experience mentoring senior engineers or defining engineering standards

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ $100โ€“$200/hour rate
  • โฐ 10โ€“20 hours/week flexible schedule
  • ๐Ÿš€ Remote work from anywhere
  • ๐Ÿ“… Through early May with possible extension

๐Ÿ“จ Hiring Process

Estimated timeline: 1-2 weeks ยท AI estimate

  1. 1Take-home evaluation exerciseยท 2-3 hours
  2. 2Behavioral interviewยท 45 min
0 0 0