14h ago
Data Infrastructure Engineer
Bengaluru
✨ $3000k-$6000k / yearest.
full-timeai-ml
🛠 Tech Stack
+2
💼 About This Role
You'll build the distributed data layer powering our semantic platform for autonomous AI agents. You'll design high-throughput ingestion and real-time processing systems that make enterprise data understandable. Join a seed-stage team deploying with major enterprises.
🎯 What You'll Do
- Build and operate large-scale data pipelines on Spark, Kafka, and Ray.
- Design fault-tolerant streaming and batch systems that move terabytes reliably.
- Optimize data workflows for performance, cost, and latency.
- Collaborate with ML and product engineers to ensure data is discoverable and queryable.
📋 Requirements
- Deep experience with distributed data systems (Spark, Kafka, Flink, Ray).
- Strong programming skills in Python, Scala, or Java.
- Comfort with Kubernetes and cloud environments (AWS/GCP/Azure).
- Solid understanding of streaming vs. batch tradeoffs and scaling patterns.
✨ Nice to Have
- Experience with Ray.
- Familiarity with ML pipelines.
- Background in knowledge graphs.
🎁 Benefits & Perks
- 💰 Competitive salary and meaningful equity
- 🏖️ Full benefits including health insurance
- 🍽️ Meals provided
- 🖥️ Equipment budget
- 🏢 Onsite collaboration with a small senior team
0 0 0