Research Engineer / Scientist, Alignment Science at Jobs at Anthropic

3h ago

Research Engineer / Scientist, Alignment Science

San Francisco, CA

full-timemid HybridArtificial Intelligence

Tech Stack

Description

You will design and run machine learning experiments to understand and steer AI behavior, contributing to AI safety research focused on risks from powerful future systems. Collaborate with teams on scalable oversight, AI control, and alignment assessments.

Requirements

Experience as both a scientist and engineer
Strong background in machine learning and experimental design
Interest in AI safety and alignment challenges
Proficiency in Python for interviews

Responsibilities

Build and run machine learning experiments to understand and steer AI behavior
Contribute to exploratory experimental research on AI safety
Collaborate with teams on scalable oversight, AI control, and alignment assessments
Develop techniques to keep models helpful and honest as they surpass human intelligence
Create methods to ensure advanced AI systems remain safe in adversarial scenarios

Jobs at Anthropic

Other jobs at Jobs at Anthropic

No other jobs found.

0 views 0 saves 0 applications