4h ago

Research Engineer/Research Scientist, Pre-training

Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY

$350,000-$850,000 / year

full-time HybridArtificial Intelligence

Tech Stack

Description

You will work on the Pre-training team developing next-generation large language models. Your role involves conducting research and implementing solutions in model architecture, algorithms, data processing, and optimizer development, while collaborating on initiatives to build safe and trustworthy AI systems.

Requirements

  • Advanced degree (MS or PhD) in Computer Science, Machine Learning, or related field
  • Strong software engineering skills with track record of building complex systems
  • Expertise in Python and deep learning frameworks (PyTorch preferred)
  • Familiarity with large-scale machine learning, especially language models
  • Excellent communication and collaboration skills

Responsibilities

  • Conduct research and implement solutions in model architecture, algorithms, data processing, and optimizer development
  • Independently lead small research projects and collaborate on larger initiatives
  • Design, run, and analyze scientific experiments to advance understanding of large language models
  • Optimize and scale training infrastructure for efficiency and reliability
  • Contribute to the entire stack from low-level optimizations to high-level model design
0 views 0 saves 0 applications