20h ago
Research Engineer - Language Model Pre-Training
San Francisco
โจ $150k-$250k / yearest.
full-timeseniorai-ml Visa Sponsor
๐ Tech Stack
๐ผ About This Role
You'll shape our language model roadmap through end-to-end pretraining development. You will work extremely closely with the pretraining team to integrate your insights into next-generation models. This role involves large-scale training runs and performance optimization of our pretraining stack.
๐ฏ What You'll Do
- Develop and optimize large-scale pretraining pipelines
- Implement model and data parallelism strategies
- Conduct architecture and methodology research
- Curate and evaluate training datasets
๐ Requirements
- Strong engineering aptitude for reliable and robust systems
- Ability to rapidly learn and implement new ideas
- Excellent communication and collaboration skills
- Experience with large-scale GPU clusters and distributed training
โจ Nice to Have
- Published machine learning research in top venues
- Postgraduate degree in a scientific field
- Deep understanding of experimental methodology
๐ Benefits & Perks
- ๐ฅ Comprehensive medical, dental, vision
- ๐ฐ Competitive compensation and 401(k)
- ๐ฝ๏ธ In-office snacks and meals
- ๐๏ธ Unlimited PTO
- ๐ Relocation and immigration support
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter screenยท 30 min
- 2Technical assessmentยท 60 min
- 3On-site interviewsยท 4 hours
0 0 0