8h ago
Senior Engineer - AI Evaluator
Miami
$208k-$416k / year
contractlead Remotesoftware
๐ Tech Stack
๐ผ About This Role
You'll evaluate the quality of interactions with coding agents like OpenAI Codex and Claude Code, focusing on engineering taste. You'll judge whether responses reflect strong engineering judgment and provide opinionated feedback. This role offers a chance to define what great looks like for AI-assisted development.
๐ฏ What You'll Do
- Evaluate AI-generated coding interactions end-to-end
- Judge whether outputs are useful and aligned with strong engineering thinking
- Distinguish quality levels (e.g., 2 vs 4) in model responses
- Provide clear, opinionated feedback on what worked and what felt off
๐ Requirements
- Staff/Principal-level engineer or equivalent experience
- Strong background in TypeScript/JavaScript or Python
- Hands-on experience with OpenAI Codex, Claude Code, or Cursor
- Comfortable evaluating code without deep line-by-line review
โจ Nice to Have
- Experience with AI-first IDEs like Cursor
- Prior exposure to prompt design or evaluation workflows
- Experience mentoring senior engineers or defining engineering standards
๐ Benefits & Perks
- ๐ฐ $100โ$200/hour rate
- โฐ 10โ20 hours/week flexible schedule
- ๐ Remote work from anywhere
- ๐ Through early May with possible extension
๐จ Hiring Process
Estimated timeline: 1-2 weeks ยท AI estimate
- 1Take-home evaluation exerciseยท 2-3 hours
- 2Behavioral interviewยท 45 min
0 0 0