6h ago

Research Engineer, Production Model Post-Training

Zürich, CH
full-timeseniorArtificial Intelligence Visa Sponsor

Tech Stack

Description

You will train base models through the complete post-training stack to deliver production Claude models, implementing techniques like Constitutional AI and RLHF. Your work directly impacts model quality, safety, and capabilities.

Requirements

  • Strong software engineering skills with experience building complex ML systems
  • Comfortable working with large-scale distributed systems and high-performance computing
  • Experience with training, fine-tuning, or evaluating large language models
  • Proficiency in Python, deep learning frameworks, and distributed computing
  • Able to navigate ambiguity and make progress in fast-moving research environments

Responsibilities

  • Implement and optimize post-training techniques at scale on frontier models
  • Conduct research to develop and optimize post-training recipes that directly improve production model quality
  • Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation
  • Develop tools to measure and improve model performance across various dimensions
  • Collaborate with research teams to translate emerging techniques into production-ready implementations
0 views 0 saves 0 applications