4h ago
Research Engineer/Research Scientist, Pre-training
Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
$350,000-$850,000 / year
full-time HybridArtificial Intelligence
Tech Stack
Description
You will work on the Pre-training team developing next-generation large language models. Your role involves conducting research and implementing solutions in model architecture, algorithms, data processing, and optimizer development, while collaborating on initiatives to build safe and trustworthy AI systems.
Requirements
- Advanced degree (MS or PhD) in Computer Science, Machine Learning, or related field
- Strong software engineering skills with track record of building complex systems
- Expertise in Python and deep learning frameworks (PyTorch preferred)
- Familiarity with large-scale machine learning, especially language models
- Excellent communication and collaboration skills
Responsibilities
- Conduct research and implement solutions in model architecture, algorithms, data processing, and optimizer development
- Independently lead small research projects and collaborate on larger initiatives
- Design, run, and analyze scientific experiments to advance understanding of large language models
- Optimize and scale training infrastructure for efficiency and reliability
- Contribute to the entire stack from low-level optimizations to high-level model design
0 views 0 saves 0 applications