5h ago
Software Engineer, Frontier Systems
San Francisco
$250k-$445k / year
full-timeseniorai-ml
🛠 Tech Stack
💼 About This Role
You'll build critical infrastructure that keeps hyperscale supercomputers reliable for cutting-edge AI research. You'll own system health checks, lead deep dives into hardware failures, and build automation to fix issues at scale across thousands of machines.
🎯 What You'll Do
- Own and improve system health checks for hyperscale supercomputers
- Lead deep dives into hardware failures and system-level bugs
- Build automation to monitor and fix issues across thousands of machines
📋 Requirements
- 7+ years of industry experience in software engineering
- Proficiency with Python and shell scripting
- Experience with SQL, PromQL, and Pandas for data analysis
- Experience developing reproducible analyses
✨ Nice to Have
- Experience with low-level hardware components and Linux tooling
- Expertise with network operations and tooling
- Expertise with power management and stabilization
0 0 0