16h ago

Research Scientist, Video Generation

Union Square, New York City

$175k-$275k / year

full-timeai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll advance multimodal video generation at Mirage, working on novel modeling approaches for a platform used by millions of creators. Your core impact will be pushing generation quality, controllability, and realism in facial expression, audio-to-video sync, and human motion. You'll validate ideas through real-world product impact.

๐ŸŽฏ What You'll Do

  • Develop novel approaches to video and multimodal generative modeling
  • Design new training objectives, loss functions, and evaluation methods
  • Explore temporal modeling, controllability, and multimodal alignment
  • Conduct empirical studies to understand scaling behavior and model performance

๐Ÿ“‹ Requirements

  • MS/PhD in ML, CS, or related field
  • Strong publication record (NeurIPS, ICML, ICLR, etc.)
  • Deep expertise in generative modeling (diffusion, autoregressive architectures)
  • Deep understanding of transformers and modern multimodal systems

โœจ Nice to Have

  • Experience with large-scale training and empirical research
  • Experience optimizing models for real-time inference efficiency
  • Strong experience with audio representations and audio-visual datasets

๐ŸŽ Benefits & Perks

  • ๐Ÿฅ Comprehensive medical, dental, and vision plans
  • ๐Ÿ’ฐ 401K with employer match
  • ๐Ÿฑ Catered lunch multiple days per week
  • ๐ŸŒด Generous PTO policy
  • ๐Ÿ–๏ธ Team offsites and monthly events

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Onsite Interviewยท 3 hours
0 0 0