20h ago

Agentic AI/ML Engineer - Multimodal

Irvine, CA

โœจ $150k-$220k / yearest.

full-timemid

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll drive research and model development for Field AI's Field-insight Foundation Model (FiFM), focusing on computer vision, vision-language models, and agentic AI. Your work powers a global fleet of autonomous robots, transforming multimodal data into actionable insights. You'll curate datasets, fine-tune models, and deploy into production, blending applied research and engineering.

๐ŸŽฏ What You'll Do

  • Train and fine-tune multimodal models for computer vision and video understanding.
  • Track state-of-the-art research and adapt novel algorithms.
  • Curate datasets and develop model interpretability tools.
  • Build scalable evaluation pipelines for vision and multimodal models.

๐Ÿ“‹ Requirements

  • Master's/Ph.D. in Computer Science, AI/ML, or equivalent experience.
  • 2+ years industry experience in CV/ML/AI.
  • Proficiency in Python and PyTorch with production-level coding.
  • Experience fine-tuning open-source multimodal models (HuggingFace, DeepSpeed, etc.).

โœจ Nice to Have

  • Experience with Agentic/RAG pipelines and knowledge graphs.
  • Background in optimization (token cost reduction, quantization).
  • Exposure to open-vocabulary detection or multimodal RAG.

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ป Work on embodied AI that ships on real robots.
  • ๐Ÿš€ Rapid experimentation with state-of-the-art models.
  • ๐ŸŒ Global deployment of your work across a fleet of robots.

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Hiring Manager Interviewยท 45 min
0 0 0