3h ago

Network Engineer

Palo Alto, CA

$180k-$440k / year

full-timeleadArtificial Intelligence

🛠 Tech Stack

💼 About This Role

You'll join a small, highly motivated team building AI/HPC networks at xAI. You'll work on optimizing performance and availability for training and inference workloads. This role offers the opportunity to shape the next generation of backend and front-end networks.

🎯 What You'll Do

  • Developing and optimizing RoCEv2 at hyperscale.
  • Building metric dashboards for network performance.
  • Designing next-gen networks for GPU infrastructure.
  • Participating in on-call rotation and travel.

📋 Requirements

  • 10+ years designing and operating large scale networks.
  • 5 years in ethernet AI/HPC space.
  • Deep understanding of congestion control on ethernet.
  • Proficiency with NCCL for debugging and contributions.
  • Python experience for automation and data analysis.

✨ Nice to Have

  • Infiniband experience.
  • Experience with AI training and inference workloads.

🎁 Benefits & Perks

  • 💰 Competitive base salary plus equity.
  • 🏥 Comprehensive medical, vision, and dental coverage.
  • 🏦 401(k) retirement plan.
  • 🛡️ Short and long-term disability insurance.
  • 🛡️ Life insurance.
0 0 0