1d ago

AI Researcher, Post Training

Stockholm

โœจ $150k-$250k / yearest.

full-timeseniorai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll own the full post-training pipeline for Lovable's AI models, translating research into production training recipes for code generation and agent workloads. You'll build evaluation infrastructure and operate distributed training systems to get improvements to users in days or weeks. This role offers a chance to shape the AI behind a fast-growing product used by over 2 million people.

๐ŸŽฏ What You'll Do

  • Own post-training pipeline from data curation to deployment
  • Apply RL, preference optimization, SFT for code generation
  • Build evaluation infrastructure for helpfulness, safety, latency
  • Operate GPU orchestration and data pipelines at scale

๐Ÿ“‹ Requirements

  • Run post-training jobs on large language models (RFT/RLVR, preference optimization)
  • Write solid production code
  • Fluent in PyTorch or JAX with distributed training
  • Understand math behind preference optimization, reward modeling

โœจ Nice to Have

  • Worked on code generation or agentic use cases
  • Put post-trained models into production with real users
  • Owned full loop: data, training, eval, deployment, monitoring

๐ŸŽ Benefits & Perks

  • ๐Ÿš€ Extreme ownership and high velocity culture
  • ๐Ÿข Stockholm office with small, talent-dense team
  • ๐ŸŒ Define the future of software creation

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Callยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Team Interviewยท 45 min
0 0 0