1d ago
AI Data Engineer
Columbia
โจ $130k-$170k / yearest.
full-timesenior Remoteai-ml
๐ Tech Stack
๐ผ About This Role
You'll design and maintain large-scale data pipelines for generative AI applications, integrating ML models with traditional data handling. Your work directly impacts ML model efficiency by orchestrating complex data processes that seamlessly combine ML inference with data transformations. This role offers the opportunity to innovate within a fast-paced environment pushing AI boundaries.
๐ฏ What You'll Do
- Design and maintain AI-augmented large-scale data pipelines.
- Own remote ML model inference orchestration within pipelines.
- Build scalable pipelines for generating and serving vector embeddings.
- Source, filter, and curate training datasets with governance.
- Design and operate pipelines using LLMs and vision models.
๐ Requirements
- Bachelor's degree in Computer Science or related field.
- 5+ years in data engineering, ML engineering, or hybrid role.
- Experience building production data pipelines invoking ML models at scale.
- Strong understanding of data governance and quality processes.
โจ Nice to Have
- Familiarity with asynchronous data tasks and batch processing.
- Experience with generative AI or vision models.
- Knowledge of vector embeddings and storage.
๐ Benefits & Perks
- ๐ป Flexible work schedule with remote options
- ๐ Professional development and continuous learning
- ๐ฌ Access to cutting-edge technologies and tools
- ๐ฅ Medical, dental, vision coverage for eligible employees
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Interviewยท 60 min
- 3Hiring Manager Interviewยท 45 min
0 0 0