Research Engineer, Environment Scaling at Jobs at Anthropic

4h ago

Research Engineer, Environment Scaling

San Francisco, CA

$350,000-$850,000 / year

full-timesenior HybridArtificial Intelligence Visa Sponsor

Tech Stack

Description

You'll own the end-to-end process of creating RL environments for new capabilities: identifying high-value tasks, designing reward signals, managing vendor relationships, and measuring impact on model performance.

Requirements

Experience with fine-tuning large language models for specific domains or real-world use cases
Experience with reinforcement learning, reward design, or training data curation for LLMs
Comfortable managing technical vendor relationships and iterating quickly on feedback
Strong project management and interpersonal skills
Passionate about making AI more useful and accessible across different industries

Responsibilities

Improve and execute fine-tuning strategies for adapting Claude to new domains and tasks
Manage technical relationships with external data vendors, including evaluation of data quality and reward design
Collaborate with domain experts to design data pipelines and evaluations
Explore novel ways of creating RL environments for high value tasks
Develop and improve QA frameworks to catch reward hacking and ensure environment quality

Jobs at Anthropic

Other jobs at Jobs at Anthropic

No other jobs found.

0 views 0 saves 0 applications