2h ago

Senior Research Engineer - Audio Post-Training

Europe

$175k-$250k / yearest.

full-timesenior Remoteai-ml

🛠 Tech Stack

💼 About This Role

You'll join a team of 40+ researchers working on generative speech and voice synthesis to bring in-house voice models to production quality. Your work will directly impact solutions used by over 60,000 businesses worldwide. You'll tackle challenging post-training optimization to achieve real-time, expressive synthetic voices.

🎯 What You'll Do

  • Adapt models for new conditioning inputs like emotion, speed, and prosody.
  • Fine-tune speech models using DPO, LoRA, and parameter-efficient methods.
  • Implement post-training optimization techniques (quantization, pruning, distillation).
  • Design and implement new evaluation metrics for TTS systems.

📋 Requirements

  • Strong understanding of generative modeling applied to sequential or multimodal data.
  • Hands-on experience with LLMs or similar transformer-based architectures.
  • High proficiency in PyTorch with distributed training and model optimization.
  • Proven experience training deep learning models end-to-end from data to evaluation.

✨ Nice to Have

  • Familiarity with state-of-the-art audio generation architectures (diffusion, neural codecs).
  • Experience with speech-to-speech or text-to-speech systems.
  • Original research contributions at ICASSP, Interspeech, NeurIPS, or ICML.

🎁 Benefits & Perks

  • 💰 Competitive compensation with salary, stock options, and bonus.
  • 🏠 Fully remote from Europe or hybrid in London, Amsterdam, Zurich, Munich.
  • 🗓️ 25 days annual leave plus public holidays.
  • 🤝 Great company culture with regular planning and socials at hubs.
0 0 0