3h ago

Research Engineer, Production Model Post-Training

San Francisco, CA; New York City, NY; Seattle, WA

$350,000-$500,000 / year

full-timeseniorArtificial Intelligence Visa Sponsor

Tech Stack

Description

In this role, you will train Anthropic's base models through the complete post-training stack to deliver production Claude models. You'll work at the intersection of research and engineering, implementing and scaling alignment techniques like Constitutional AI and RLHF, directly impacting model quality and safety.

Requirements

  • Strong software engineering skills with experience building complex ML systems
  • Experience with training, fine-tuning, or evaluating large language models
  • Comfortable working with large-scale distributed systems and high-performance computing
  • Proficiency in Python, deep learning frameworks, and distributed computing
  • Ability to navigate ambiguity and make progress in fast-moving research environments

Responsibilities

  • Implement and optimize post-training techniques at scale on frontier models
  • Conduct research to develop and optimize post-training recipes that improve production model quality
  • Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation
  • Develop tools to measure and improve model performance across various dimensions
  • Collaborate with research teams to translate emerging techniques into production-ready implementations
0 views 0 saves 0 applications