5h ago

Software Engineer, Frontier Systems

San Francisco

$250k-$445k / year

full-timeseniorai-ml

🛠 Tech Stack

💼 About This Role

You'll build critical infrastructure that keeps hyperscale supercomputers reliable for cutting-edge AI research. You'll own system health checks, lead deep dives into hardware failures, and build automation to fix issues at scale across thousands of machines.

🎯 What You'll Do

  • Own and improve system health checks for hyperscale supercomputers
  • Lead deep dives into hardware failures and system-level bugs
  • Build automation to monitor and fix issues across thousands of machines

📋 Requirements

  • 7+ years of industry experience in software engineering
  • Proficiency with Python and shell scripting
  • Experience with SQL, PromQL, and Pandas for data analysis
  • Experience developing reproducible analyses

✨ Nice to Have

  • Experience with low-level hardware components and Linux tooling
  • Expertise with network operations and tooling
  • Expertise with power management and stabilization
0 0 0