3h ago
Research Engineer / Scientist, Alignment Science
San Francisco, CA
full-timemid HybridArtificial Intelligence
Tech Stack
Description
You will design and run machine learning experiments to understand and steer AI behavior, contributing to AI safety research focused on risks from powerful future systems. Collaborate with teams on scalable oversight, AI control, and alignment assessments.
Requirements
- Experience as both a scientist and engineer
- Strong background in machine learning and experimental design
- Interest in AI safety and alignment challenges
- Proficiency in Python for interviews
Responsibilities
- Build and run machine learning experiments to understand and steer AI behavior
- Contribute to exploratory experimental research on AI safety
- Collaborate with teams on scalable oversight, AI control, and alignment assessments
- Develop techniques to keep models helpful and honest as they surpass human intelligence
- Create methods to ensure advanced AI systems remain safe in adversarial scenarios
0 views 0 saves 0 applications