1d ago
AI Researcher, Post Training
Stockholm
โจ $150k-$250k / yearest.
full-timeseniorai-ml
๐ Tech Stack
๐ผ About This Role
You'll own the full post-training pipeline for Lovable's AI models, translating research into production training recipes for code generation and agent workloads. You'll build evaluation infrastructure and operate distributed training systems to get improvements to users in days or weeks. This role offers a chance to shape the AI behind a fast-growing product used by over 2 million people.
๐ฏ What You'll Do
- Own post-training pipeline from data curation to deployment
- Apply RL, preference optimization, SFT for code generation
- Build evaluation infrastructure for helpfulness, safety, latency
- Operate GPU orchestration and data pipelines at scale
๐ Requirements
- Run post-training jobs on large language models (RFT/RLVR, preference optimization)
- Write solid production code
- Fluent in PyTorch or JAX with distributed training
- Understand math behind preference optimization, reward modeling
โจ Nice to Have
- Worked on code generation or agentic use cases
- Put post-trained models into production with real users
- Owned full loop: data, training, eval, deployment, monitoring
๐ Benefits & Perks
- ๐ Extreme ownership and high velocity culture
- ๐ข Stockholm office with small, talent-dense team
- ๐ Define the future of software creation
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Interviewยท 60 min
- 3Team Interviewยท 45 min
0 0 0