16h ago
Research Scientist, Video Generation
Union Square, New York City
$175k-$275k / year
full-timeai-ml
๐ Tech Stack
๐ผ About This Role
You'll advance multimodal video generation at Mirage, working on novel modeling approaches for a platform used by millions of creators. Your core impact will be pushing generation quality, controllability, and realism in facial expression, audio-to-video sync, and human motion. You'll validate ideas through real-world product impact.
๐ฏ What You'll Do
- Develop novel approaches to video and multimodal generative modeling
- Design new training objectives, loss functions, and evaluation methods
- Explore temporal modeling, controllability, and multimodal alignment
- Conduct empirical studies to understand scaling behavior and model performance
๐ Requirements
- MS/PhD in ML, CS, or related field
- Strong publication record (NeurIPS, ICML, ICLR, etc.)
- Deep expertise in generative modeling (diffusion, autoregressive architectures)
- Deep understanding of transformers and modern multimodal systems
โจ Nice to Have
- Experience with large-scale training and empirical research
- Experience optimizing models for real-time inference efficiency
- Strong experience with audio representations and audio-visual datasets
๐ Benefits & Perks
- ๐ฅ Comprehensive medical, dental, and vision plans
- ๐ฐ 401K with employer match
- ๐ฑ Catered lunch multiple days per week
- ๐ด Generous PTO policy
- ๐๏ธ Team offsites and monthly events
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Onsite Interviewยท 3 hours
0 0 0