3h ago
Network Engineer
Palo Alto, CA
$180k-$440k / year
full-timeleadArtificial Intelligence
🛠 Tech Stack
💼 About This Role
You'll join a small, highly motivated team building AI/HPC networks at xAI. You'll work on optimizing performance and availability for training and inference workloads. This role offers the opportunity to shape the next generation of backend and front-end networks.
🎯 What You'll Do
- Developing and optimizing RoCEv2 at hyperscale.
- Building metric dashboards for network performance.
- Designing next-gen networks for GPU infrastructure.
- Participating in on-call rotation and travel.
📋 Requirements
- 10+ years designing and operating large scale networks.
- 5 years in ethernet AI/HPC space.
- Deep understanding of congestion control on ethernet.
- Proficiency with NCCL for debugging and contributions.
- Python experience for automation and data analysis.
✨ Nice to Have
- Infiniband experience.
- Experience with AI training and inference workloads.
🎁 Benefits & Perks
- 💰 Competitive base salary plus equity.
- 🏥 Comprehensive medical, vision, and dental coverage.
- 🏦 401(k) retirement plan.
- 🛡️ Short and long-term disability insurance.
- 🛡️ Life insurance.
0 0 0