4h ago

Staff Software Engineer, ML Platform

Pittsburgh, PA or Remote
full-timesenior Remoteautonomous vehicles

Tech Stack

Description

You will build state-of-the-art multimodal data mining and semantic search solutions to power autonomous vehicle product development. You will develop data understanding platform infrastructure for real-time querying and batch/stream processing using technologies like Ray, Spark, and Lance.

Requirements

  • Experience with ML platforms and building ML-based applications.
  • Proven track record of building scalable, reliable infrastructure in a fast-paced environment.
  • Ability to collaborate effectively across teams.
  • Deep understanding of design trade-offs with ability to articulate and align with others.
  • 6+ years experience with multimodal data indexing, inference pipelines, semantic search, embedding generation, vector DB, large scale ML pipelines (Airflow/Flyte), and model optimization.

Responsibilities

  • Build multimodal data mining and semantic search solutions for AV product development.
  • Develop data understanding platform infrastructure with real-time querying/vector databases and batch/stream processing using Ray, Spark, Lance.
  • Deliver end-to-end data mining solutions spanning onboard (C++) and offboard (ML Data Infra) infrastructure.
  • Develop e2e solution for real-time semantic search services (text/images/videos) and vector DBs.
  • Architect and tune ETL pipelines to maximize GPU/CPU/Ram utilization.
0 views 0 saves 0 applications