14h ago

Data Infrastructure Engineer

Bengaluru

$3000k-$6000k / yearest.

full-timeai-ml

🛠 Tech Stack

+2

💼 About This Role

You'll build the distributed data layer powering our semantic platform for autonomous AI agents. You'll design high-throughput ingestion and real-time processing systems that make enterprise data understandable. Join a seed-stage team deploying with major enterprises.

🎯 What You'll Do

  • Build and operate large-scale data pipelines on Spark, Kafka, and Ray.
  • Design fault-tolerant streaming and batch systems that move terabytes reliably.
  • Optimize data workflows for performance, cost, and latency.
  • Collaborate with ML and product engineers to ensure data is discoverable and queryable.

📋 Requirements

  • Deep experience with distributed data systems (Spark, Kafka, Flink, Ray).
  • Strong programming skills in Python, Scala, or Java.
  • Comfort with Kubernetes and cloud environments (AWS/GCP/Azure).
  • Solid understanding of streaming vs. batch tradeoffs and scaling patterns.

✨ Nice to Have

  • Experience with Ray.
  • Familiarity with ML pipelines.
  • Background in knowledge graphs.

🎁 Benefits & Perks

  • 💰 Competitive salary and meaningful equity
  • 🏖️ Full benefits including health insurance
  • 🍽️ Meals provided
  • 🖥️ Equipment budget
  • 🏢 Onsite collaboration with a small senior team
0 0 0