4h ago

Solutions Engineer (Media)

Remote

$110k-$140k / yearest.

full-timemid Remoteai-ml

🛠 Tech Stack

💼 About This Role

You'll own data quality and curation for Protege's media catalog, translating customer AI data needs into structured datasets. You will work with imperfect, real-world partner data using SQL, embeddings, and AI tools. This role is central to delivering high-quality training data that powers ambitious AI teams.

🎯 What You'll Do

  • Translate customer requirements into curation strategies using SQL and internal APIs
  • Normalize and standardize datasets with mismatched metadata and schema differences
  • Run iterative sample reviews with customers and refine selections based on feedback
  • Build validation checks and workflows to ensure dataset integrity before delivery

📋 Requirements

  • 4-7 years of experience in data science, media analytics, or technical curation
  • Strong SQL proficiency for querying large, messy datasets
  • Experience working with media metadata, embeddings, or unstructured content
  • Ability to translate nuanced customer requirements into concrete dataset specs

✨ Nice to Have

  • Familiarity with video/audio processing or multimodal AI workflows
  • Prior experience curating or packaging datasets for machine learning
  • Background in content analysis, recommendation systems, or information retrieval

🎁 Benefits & Perks

  • 🚀 Real ownership and autonomy in a lean, high-trust team
  • 📈 Rapid growth surrounded by impact-driven builders
  • 🤝 Kind and direct culture with early and frequent feedback
0 0 0