7h ago

Research Scientist for Speech Synthesis

Valka.ai

$130k-$180k / yearest.

contractsenior HybridArtificial Intelligence

🛠 Tech Stack

💼 About This Role

You'll join a visionary team to develop text-to-speech and voice cloning models for interactive avatars. You'll push the limits of synthetic voice generation, building efficient training pipelines and custom validation metrics. This role offers a chance to redefine generative content in gaming, entertainment, and education.

🎯 What You'll Do

  • Design and optimize text-to-speech models for voice actor authenticity.
  • Experiment with neural TTS and voice cloning architectures.
  • Define validation strategies and implement custom evaluation metrics.
  • Contribute to MLOps practices and production infrastructure.

📋 Requirements

  • Experience with deep learning frameworks (PyTorch, TensorFlow, or JAX).
  • Understanding of audio processing (sampling, spectrograms, vocoders).
  • Experience training text-to-speech and voice cloning models.
  • Familiarity with speech synthesis models (WaveNet, Tacotron, VITS).

✨ Nice to Have

  • Experience with voice cloning models like XTTS, YourTTS.
  • Experience with transformers and diffusion models.
  • Ability to implement ideas from research papers.

🎁 Benefits & Perks

  • 🚀 Cutting-edge AI work on novel interactive platform.
  • 💻 Hybrid work flexibility.
  • 🌍 Global team with diverse innovators.
  • 📈 Opportunity to shape foundational technology.
0 0 0