11h ago
Research Engineer, Judgment Systems
San Francisco
$250k-$400k / year
full-timeai-ml
🛠 Tech Stack
💼 About This Role
You'll train and improve AI agents for high-stakes fraud and abuse detection at a small, talent-dense San Francisco startup. You'll own research threads, build evals, and turn ideas into production capabilities. This role pushes the frontier of reliable AI decision-making in adversarial, real-world environments.
🎯 What You'll Do
- Train, fine-tune, and improve models for fraud and abuse workflows
- Design and run experiments across post-training, retrieval, and tool use
- Build proprietary benchmarks, datasets, and evals for real-world failure modes
- Study where models break and prototype new training strategies
📋 Requirements
- Experience training, fine-tuning, or evaluating modern ML systems
- Strong programming skills in research-heavy codebases
- Familiarity with LLMs, agent systems, or post-training
- Ability to design clean experiments from noisy results
✨ Nice to Have
- Experience in fraud, risk, or trust and safety
- Background in reinforcement learning or retrieval
- Strong engineering judgment and bias toward building
🎁 Benefits & Perks
- 🏖️ Unlimited PTO
- 🏥 Platinum-level medical, dental, and vision insurance
- 💪 $100/month health and wellness reimbursement
- 👶 Parental leave
- 💰 401(k) plan
0 0 0