1d ago
Model Policy Manager
San Francisco, CA
$207k-$295k / year
full-timesenior Hybridai-ml
๐ผ About This Role
You'll define model behavior policies for OpenAI's frontier AI systems in high-risk contexts like agentic and multimodal systems. Your work will directly shape safety guardrails that protect users while enabling beneficial AI use, translating ambiguity into measurable safety criteria.
๐ฏ What You'll Do
- Design and maintain model policies across safety-relevant domains.
- Translate risk models into behavioral specifications and evaluation criteria.
- Use red-teaming and deployment data to improve policy quality.
- Contribute to system cards, safety reports, and launch reviews.
๐ Requirements
- strong judgment about how advanced AI systems affect real-world risk
- experience building policy frameworks for complex technical systems
- ability to move across domains without deep expertise in every area
- comfort using empirical evidence to inform policy decisions
โจ Nice to Have
- Experience with red-teaming or threat modeling
- Background in AI safety research
- Knowledge of adversarial machine learning
๐ Benefits & Perks
- ๐๏ธ Hybrid work model (3 days in office)
- ๐ฐ Equity offered
- ๐ฝ๏ธ Three in-house prepared meals daily
- โ Well-stocked kitchens with snacks and drinks
- ๐ฒ Private bike storage
๐จ Hiring Process
Estimated timeline: 3-5 weeks ยท AI estimate
- 1Application Reviewยท 1-2 weeks
- 2Technical Interviewยท 1 hour
- 3Onsite Interviewsยท 3-4 hours
0 0 0