about 3 hours ago
Staff, Backend Engineer - Catalog
Palo Alto, California, United States
$225,000-$300,000 / year
full-timesenior HybridData Infrastructure / AI
Tech Stack
+3
Description
You will lead development of DataHub's Platform framework, building scalable ingestion systems, clean APIs, and event-driven architectures for real-time metadata processing. Your work will directly power the next generation of AI systems at global scale.
Requirements
- 8+ years building production-grade distributed systems
- Advanced Python and API design expertise
- Experience with high-scale data processing or integration frameworks
- Strong systems knowledge and distributed architecture experience
- Proven track record solving complex technical challenges
- Built and maintained online applications serving live traffic at scale (100+ QPS)
- Set up monitoring and alerting for services
- Designed indexing, storage, and data architectures to make large-scale data accessible to online services
- Designed and scaled distributed systems
- Hands-on experience developing in a tight loop with LLMs and applying best practices for scalable LLM development
- Proficiency in one of Java/Scala/Kotlin/C#/Go
Responsibilities
- Build scalable, fault-tolerant ingestion systems for enterprise-scale metadata
- Create clean, intuitive APIs for our connector ecosystem
- Develop event-driven architectures for real-time metadata processing
- Implement schema mapping between diverse systems and DataHub's unified model
- Build versioning systems for AI assets (training data, model weights, embeddings)
0 views 0 saves 0 applications