1d ago

AI Data Engineer

Columbia

โœจ $130k-$170k / yearest.

full-timesenior Remoteai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll design and maintain large-scale data pipelines for generative AI applications, integrating ML models with traditional data handling. Your work directly impacts ML model efficiency by orchestrating complex data processes that seamlessly combine ML inference with data transformations. This role offers the opportunity to innovate within a fast-paced environment pushing AI boundaries.

๐ŸŽฏ What You'll Do

  • Design and maintain AI-augmented large-scale data pipelines.
  • Own remote ML model inference orchestration within pipelines.
  • Build scalable pipelines for generating and serving vector embeddings.
  • Source, filter, and curate training datasets with governance.
  • Design and operate pipelines using LLMs and vision models.

๐Ÿ“‹ Requirements

  • Bachelor's degree in Computer Science or related field.
  • 5+ years in data engineering, ML engineering, or hybrid role.
  • Experience building production data pipelines invoking ML models at scale.
  • Strong understanding of data governance and quality processes.

โœจ Nice to Have

  • Familiarity with asynchronous data tasks and batch processing.
  • Experience with generative AI or vision models.
  • Knowledge of vector embeddings and storage.

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ป Flexible work schedule with remote options
  • ๐Ÿ“š Professional development and continuous learning
  • ๐Ÿ”ฌ Access to cutting-edge technologies and tools
  • ๐Ÿฅ Medical, dental, vision coverage for eligible employees

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Callยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Hiring Manager Interviewยท 45 min
0 0 0