2d ago

Software Engineer, Compute Infrastructure

San Francisco

$230k-$405k / year

full-time Hybridai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll build the compute platform powering frontier AI research, working across the entire stack from hardware to developer tools. Your work will directly accelerate research velocity by optimizing large-scale distributed systems. This role offers the chance to tackle unique challenges at unprecedented scale and impact.

๐ŸŽฏ What You'll Do

  • Design and optimize system software for large-scale compute clusters
  • Profile and benchmark training workloads to identify bottlenecks
  • Build automation for provisioning, firmware upgrades, and operations
  • Develop tools and abstractions to improve researcher productivity

๐Ÿ“‹ Requirements

  • Experience building or operating distributed systems or infrastructure platforms
  • Proficiency in systems programming (C++, Rust, or similar)
  • Deep understanding of computer architecture and performance optimization
  • Strong cross-layer debugging skills from hardware to application

โœจ Nice to Have

  • Experience with Kubernetes or other container orchestration platforms
  • Knowledge of high-performance networking (RDMA, InfiniBand, NCCL)
  • Background in GPU hardware and CUDA programming

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive equity package
  • ๐Ÿฅ Comprehensive health insurance
  • ๐Ÿ–๏ธ Flexible PTO policy

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Phone Interviewยท 60 min
  3. 3Onsite Interview (3-4 rounds)ยท Half day
0 0 0