7h ago

Machine Learning Data Engineer

Karlsruhe

$75k-$110k / yearest.

full-timemid HybridAutonomous systems / Simulation

🛠 Tech Stack

💼 About This Role

You'll build and scale data pipelines for Parallel Domain's Replica product, enabling ML model development for autonomous systems. Your work will directly impact how raw customer inputs become structured datasets for training and evaluation. You'll own the data flow from ingestion to curation, ensuring efficiency and quality at scale.

🎯 What You'll Do

  • Build reliable pipelines to normalize and validate customer data
  • Create schemas, validation checks, and quality metrics for datasets
  • Implement tools for dataset filtering, versioning, and annotation support
  • Generate high-quality data feeds for ML training and evaluation

📋 Requirements

  • Proven experience building scalable data pipelines and tooling
  • Understanding of how data is used in model training and evaluation
  • Practical experience with 3D concepts, geometry, and linear algebra
  • Strong Python proficiency and comfort with large datasets

✨ Nice to Have

  • MS or PhD in ML, computer vision, robotics, or related field
  • Familiarity with cloud storage and distributed processing frameworks
  • Experience handling camera, lidar, or radar data

🎁 Benefits & Perks

  • 💶 Competitive compensation based on skills and experience
  • 🚀 Impactful work advancing autonomous systems and AI
  • 🤝 Collaborative culture with a supportive team
  • 📈 Professional growth opportunities in a cutting-edge field

🚩 Heads Up

  • Vague compensation description ('dependent on skills, qualifications, experience, and location')
  • No explicit salary range listed in the posting
0 0 0