2d ago
Software Engineer, Compute Infrastructure
San Francisco
$230k-$405k / year
full-time Hybridai-ml
๐ Tech Stack
๐ผ About This Role
You'll build the compute platform powering frontier AI research, working across the entire stack from hardware to developer tools. Your work will directly accelerate research velocity by optimizing large-scale distributed systems. This role offers the chance to tackle unique challenges at unprecedented scale and impact.
๐ฏ What You'll Do
- Design and optimize system software for large-scale compute clusters
- Profile and benchmark training workloads to identify bottlenecks
- Build automation for provisioning, firmware upgrades, and operations
- Develop tools and abstractions to improve researcher productivity
๐ Requirements
- Experience building or operating distributed systems or infrastructure platforms
- Proficiency in systems programming (C++, Rust, or similar)
- Deep understanding of computer architecture and performance optimization
- Strong cross-layer debugging skills from hardware to application
โจ Nice to Have
- Experience with Kubernetes or other container orchestration platforms
- Knowledge of high-performance networking (RDMA, InfiniBand, NCCL)
- Background in GPU hardware and CUDA programming
๐ Benefits & Perks
- ๐ฐ Competitive equity package
- ๐ฅ Comprehensive health insurance
- ๐๏ธ Flexible PTO policy
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Phone Interviewยท 60 min
- 3Onsite Interview (3-4 rounds)ยท Half day
0 0 0