7h ago

Applied AI Researcher, Post-Training

San Francisco

$150k-$250k / year

full-time Hybridai-ml

🛠 Tech Stack

💼 About This Role

You'll join Distyl's post-training team to adapt foundation models for real-world enterprise performance and alignment. You'll investigate new methods for aligning models with human and system-level objectives, bridging raw capability with trustworthy, contextually aligned system behavior. Your work will impact critical operations across telecom, healthcare, and manufacturing.

🎯 What You'll Do

  • Adapt foundation models using supervised fine-tuning and preference optimization
  • Develop and evaluate techniques like DPO, RLHF, and continual adaptation
  • Align models with enterprise systems for trustworthy behavior
  • Explore trade-offs between generalization and specialization

📋 Requirements

  • Deep understanding of post-training techniques (SFT, RLHF/DPO, LoRA/PEFT)
  • Experience adapting frontier models (LLMs/SLMs) to specialized domains
  • Experience building with models, not just building models (compound AI systems, agentic collaboration)
  • Strong programming and data analysis skills for prototyping and experimentation

✨ Nice to Have

  • Published research in top journals or venues
  • Track record of sharing work on platforms like Twitter or Arxiv
  • Daily use of AI tools like ChatGPT, Cursor, Perplexity

🎁 Benefits & Perks

  • 🏥 100% covered medical, dental, vision for employees and dependents
  • 💰 Equity alongside base compensation
  • 🏋️ 401(k) with commuter benefits and in-office lunch
  • 🤖 Access to state-of-the-art models and generous AI tool usage
  • 📈 Ownership of high-impact projects across top enterprises
0 0 0