7h ago
Software Engineer, Infrastructure
San Francisco, California
$180k-$350k / year
full-timesoftware Visa Sponsor
🛠 Tech Stack
💼 About This Role
You'll build massive-scale infrastructure for Exa's custom search engine, powering AI applications. You'll optimize GPU clusters, Kubernetes orchestration, and batchjob systems. This role offers the chance to work on state-of-the-art infrastructure and big-scale systems.
🎯 What You'll Do
- Build GPU cluster orchestration on Kubernetes
- Scale batchjob systems for map-reduce over tens of thousands of machines
- Design GPU scheduling software to maximize cluster utilization
- Build observability into production systems
📋 Requirements
- Experience designing and operating large-scale infrastructure
- Expertise with GPU clusters or large Kubernetes clusters
- Expertise with cloud batchjob systems
- Obsessive mindset on reliability, observability, and optimization
0 0 0