4h ago
Research Engineer, Frontier Red Team (Autonomy)
San Francisco, CA
Artificial Intelligence
Tech Stack
Description
You'll design and build autonomous AI systems to understand and defend against advanced adversarial AI, creating evals, defensive agents, and cyberphysical interfaces. Your work directly shapes Anthropic's and the world's preparedness for advanced AI systems.
Requirements
- Strong software engineering skills, particularly in Python
- Experience building and working with LLM-based agents or autonomous systems
- Ability to solve ambiguously scoped, high-stakes problems
- Design and run experiments quickly, iterating fast
- Care deeply about AI safety and want real-world impact
Responsibilities
- Design and build autonomous AI systems with tool use and diverse environment operation
- Create evals and training environments for agent behavior understanding and shaping
- Develop defensive agents that can detect, disrupt, or outcompete adversarial AI
- Interface Claude with hardware platforms for cyberphysical risk understanding
- Translate technical findings into demonstrations for policymakers and public
0 views 0 saves 0 applications