7h ago

Software Engineer, Infrastructure

San Francisco, California

$180k-$350k / year

full-timesoftware Visa Sponsor

🛠 Tech Stack

💼 About This Role

You'll build massive-scale infrastructure for Exa's custom search engine, powering AI applications. You'll optimize GPU clusters, Kubernetes orchestration, and batchjob systems. This role offers the chance to work on state-of-the-art infrastructure and big-scale systems.

🎯 What You'll Do

  • Build GPU cluster orchestration on Kubernetes
  • Scale batchjob systems for map-reduce over tens of thousands of machines
  • Design GPU scheduling software to maximize cluster utilization
  • Build observability into production systems

📋 Requirements

  • Experience designing and operating large-scale infrastructure
  • Expertise with GPU clusters or large Kubernetes clusters
  • Expertise with cloud batchjob systems
  • Obsessive mindset on reliability, observability, and optimization
0 0 0