16h ago
Machine Learning Engineer
Singapore
โจ $80k-$180k / yearest.
full-timesoftware
๐ Tech Stack
๐ผ About This Role
You'll build and scale data pipelines for large-scale video and multimodal model training at Cantina, a social AI company. Your work will directly power how characters come to life in real-time. You'll own the full pipeline from raw content to training-ready datasets, focusing on speed, reliability, and cost-efficiency.
๐ฏ What You'll Do
- Design and scale distributed data pipelines for preprocessing and dataset generation
- Own workflow orchestration, job scheduling, monitoring, and failure recovery
- Implement and maintain containerized pipeline infrastructure using Kubernetes
- Optimize cloud-based data storage and movement across providers for cost and efficiency
- Define and implement best practices for dataset storage layout, versioning, and caching
๐ Requirements
- Hands-on experience with large-scale data systems and pipelines for machine learning
- Experience with distributed data processing frameworks like PySpark or Ray
- Familiarity with containerization and orchestration using Docker and Kubernetes
- Experience with cloud-based storage and compute (AWS, GCS, or Azure)
โจ Nice to Have
- Experience with VLM-based captioning or quality/aesthetic scoring models for video/image
- Familiarity with CLIP-based or embedding-based filtering techniques
- Proficiency in video/media processing tools like FFmpeg, PyAV, or OpenCV
๐ Benefits & Perks
- ๐ฐ Competitive salary and generous company equity
- ๐๏ธ Personal time off and paid holidays
- ๐ฅ Health insurance
- ๐ Global travel insurance
- ๐ต Monthly spending stipend: $500 (~S$635)
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Interviewยท 60 min
- 3Hiring Manager Interviewยท 45 min
0 0 0