20h ago
Agentic AI/ML Engineer - Multimodal
Irvine, CA
โจ $150k-$220k / yearest.
full-timemid
๐ Tech Stack
๐ผ About This Role
You'll drive research and model development for Field AI's Field-insight Foundation Model (FiFM), focusing on computer vision, vision-language models, and agentic AI. Your work powers a global fleet of autonomous robots, transforming multimodal data into actionable insights. You'll curate datasets, fine-tune models, and deploy into production, blending applied research and engineering.
๐ฏ What You'll Do
- Train and fine-tune multimodal models for computer vision and video understanding.
- Track state-of-the-art research and adapt novel algorithms.
- Curate datasets and develop model interpretability tools.
- Build scalable evaluation pipelines for vision and multimodal models.
๐ Requirements
- Master's/Ph.D. in Computer Science, AI/ML, or equivalent experience.
- 2+ years industry experience in CV/ML/AI.
- Proficiency in Python and PyTorch with production-level coding.
- Experience fine-tuning open-source multimodal models (HuggingFace, DeepSpeed, etc.).
โจ Nice to Have
- Experience with Agentic/RAG pipelines and knowledge graphs.
- Background in optimization (token cost reduction, quantization).
- Exposure to open-vocabulary detection or multimodal RAG.
๐ Benefits & Perks
- ๐ป Work on embodied AI that ships on real robots.
- ๐ Rapid experimentation with state-of-the-art models.
- ๐ Global deployment of your work across a fleet of robots.
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Hiring Manager Interviewยท 45 min
0 0 0