6h ago

AI Researcher (Voice)

San Francisco

$160k-$250k / year

full-timesenior Hybridai-ml

🛠 Tech Stack

💼 About This Role

You'll lead research on generative video and audio models for real-time human simulation, working at a Series A startup backed by Sequoia and YC. Your core impact will be advancing text-to-speech and speech-to-speech models that power expressive AI avatars. This role offers the chance to publish at top venues and see your research productionized.

🎯 What You'll Do

  • Lead research on generative audio and video models
  • Collaborate with Applied ML team on productionization
  • Stay current with latest advancements and drive innovation

📋 Requirements

  • Proven experience with flow matching, diffusion models, or auto regressive networks in audio domain
  • Experience training deep learning models from medium to large scale
  • Experience building streaming text-to-speech or speech-to-speech models
  • Publications in top-tier venues like CVPR, NeurIPS, or equivalent

✨ Nice to Have

  • PhD or equivalent experience preferred
  • Skills in 3D graphics or Gaussian splatting
  • Experience leading research teams

🎁 Benefits & Perks

  • 🏖️ Unlimited PTO
  • 🏥 Competitive healthcare
  • 💻 Gear stipends
  • 📈 Equity
  • 🔄 Flexible work schedule
0 0 0