20h ago

Research Engineer - Language Model Pre-Training

San Francisco

โœจ $150k-$250k / yearest.

full-timeseniorai-ml Visa Sponsor

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll shape our language model roadmap through end-to-end pretraining development. You will work extremely closely with the pretraining team to integrate your insights into next-generation models. This role involves large-scale training runs and performance optimization of our pretraining stack.

๐ŸŽฏ What You'll Do

  • Develop and optimize large-scale pretraining pipelines
  • Implement model and data parallelism strategies
  • Conduct architecture and methodology research
  • Curate and evaluate training datasets

๐Ÿ“‹ Requirements

  • Strong engineering aptitude for reliable and robust systems
  • Ability to rapidly learn and implement new ideas
  • Excellent communication and collaboration skills
  • Experience with large-scale GPU clusters and distributed training

โœจ Nice to Have

  • Published machine learning research in top venues
  • Postgraduate degree in a scientific field
  • Deep understanding of experimental methodology

๐ŸŽ Benefits & Perks

  • ๐Ÿฅ Comprehensive medical, dental, vision
  • ๐Ÿ’ฐ Competitive compensation and 401(k)
  • ๐Ÿฝ๏ธ In-office snacks and meals
  • ๐Ÿ–๏ธ Unlimited PTO
  • ๐Ÿš€ Relocation and immigration support

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter screenยท 30 min
  2. 2Technical assessmentยท 60 min
  3. 3On-site interviewsยท 4 hours
0 0 0