17h ago

Audio Inference Engineer, Model Efficiency

New York

$150k-$230k / yearest.

full-timesenior Remoteai-ml

🛠 Tech Stack

💼 About This Role

You'll join a team optimizing audio inference serving efficiency using innovative techniques. You'll advance core metrics like latency, throughput, and quality for real-time audio processing. You'll collaborate with training and serving infrastructure teams for seamless deployment.

🎯 What You'll Do

  • Optimize audio inference serving systems for latency and throughput
  • Identify bottlenecks in audio processing and streaming workloads
  • Develop creative solutions for real-time audio inference
  • Collaborate with training and serving infrastructure teams

📋 Requirements

  • Significant experience developing high-performance audio or ML inference systems
  • Proficiency in C++ and Python
  • Hands-on experience with deep learning models for audio, speech, or language

✨ Nice to Have

  • GPU programming and low-level system optimization
  • Experience with duplex real-time streaming architectures
  • Internals of ML frameworks for audio (PyTorch, TensorFlow)

🎁 Benefits & Perks

  • 🤝 Open and inclusive culture
  • 🧑‍💻 Work on cutting-edge AI research
  • 🍽 Weekly lunch stipend and in-office lunches
  • 🦷 Full health and dental benefits with mental health budget
  • ✈️ 6 weeks of vacation
0 0 0