16h ago

Principal Systems Software Engineer

San Francisco, CA

$260k-$340k / year

full-timeleadai-ml

🛠 Tech Stack

💼 About This Role

You'll serve as the visionary lead for Crusoe's next-generation AI infrastructure, bridging silicon and software to unify Bare-Metal-as-a-Service, Intelligent IaaS, and Elastic CaaS into a high-performance pool of intelligence. Your core impact is to redefine the I/O path for generative AI by designing fluid fabrics that push massive-scale training workloads to hardware limits. This role stands out for its hyperscale-level systems architecture and hands-on R&D leadership.

🎯 What You'll Do

  • Architect Bare-Metal-as-a-Service with zero-latency InfiniBand/RDMA fabrics
  • Design thin virtualization layers using KVM or custom micro-VMs
  • Build high-performance container substrate with Kubernetes or Slurm
  • Lead R&D on novel memory, networking, and compute methods

📋 Requirements

  • 12+ years designing core infrastructure at a major hyperscaler or HPC cloud
  • Authoritative knowledge of Linux kernel, KVM, QEMU, and high-performance networking
  • Proven ability to design software maximizing NVIDIA/AMD GPU and high-speed NIC performance
  • Bachelor's or Master's in CS or related field (or equivalent experience)

✨ Nice to Have

  • Patents related to network virtualization, GPU scheduling, or distributed file systems
  • Maintainer status or significant contributions to Linux Kernel, Kubernetes, or HPC projects
  • Direct experience optimizing infrastructure for LLM training and inference at scale

🎁 Benefits & Perks

  • 💰 Competitive compensation + Significant Equity & Bonus
  • 🏖️ Paid time off & paid holidays
  • 🏥 Comprehensive health, dental & vision insurance with HSA contributions
  • 👶 Paid parental leave
  • 📈 401(k) Retirement plan with company match up to 4% of salary
0 0 0