16h ago
Principal Systems Software Engineer
San Francisco, CA
$260k-$340k / year
full-timeleadai-ml
🛠 Tech Stack
💼 About This Role
You'll serve as the visionary lead for Crusoe's next-generation AI infrastructure, bridging silicon and software to unify Bare-Metal-as-a-Service, Intelligent IaaS, and Elastic CaaS into a high-performance pool of intelligence. Your core impact is to redefine the I/O path for generative AI by designing fluid fabrics that push massive-scale training workloads to hardware limits. This role stands out for its hyperscale-level systems architecture and hands-on R&D leadership.
🎯 What You'll Do
- Architect Bare-Metal-as-a-Service with zero-latency InfiniBand/RDMA fabrics
- Design thin virtualization layers using KVM or custom micro-VMs
- Build high-performance container substrate with Kubernetes or Slurm
- Lead R&D on novel memory, networking, and compute methods
📋 Requirements
- 12+ years designing core infrastructure at a major hyperscaler or HPC cloud
- Authoritative knowledge of Linux kernel, KVM, QEMU, and high-performance networking
- Proven ability to design software maximizing NVIDIA/AMD GPU and high-speed NIC performance
- Bachelor's or Master's in CS or related field (or equivalent experience)
✨ Nice to Have
- Patents related to network virtualization, GPU scheduling, or distributed file systems
- Maintainer status or significant contributions to Linux Kernel, Kubernetes, or HPC projects
- Direct experience optimizing infrastructure for LLM training and inference at scale
🎁 Benefits & Perks
- 💰 Competitive compensation + Significant Equity & Bonus
- 🏖️ Paid time off & paid holidays
- 🏥 Comprehensive health, dental & vision insurance with HSA contributions
- 👶 Paid parental leave
- 📈 401(k) Retirement plan with company match up to 4% of salary
0 0 0