1d ago

Infrastructure Engineer (GPU & Compute)

United States

$180k-$200k / year

full-timesenior Remoteai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll own image management, deployment, and validation across GPU-enabled infrastructure for AI workloads. You'll lead system validation and diagnostics to ensure production-ready environments from day one. Collaborate with cross-functional teams to scale compute for the future of AI.

๐ŸŽฏ What You'll Do

  • Evolve image management and deployment systems for GPU environments
  • Lead GPU diagnostics and validation workflows
  • Build automation tools in Python for provisioning
  • Support hardware qualification for new platforms

๐Ÿ“‹ Requirements

  • 5+ years of infrastructure engineering experience
  • Strong Linux systems administration in production
  • Hands-on experience with GPU-enabled systems and NVIDIA DCGM
  • Proficiency in Python for automation

โœจ Nice to Have

  • Experience with high-performance interconnects (InfiniBand, NVLink)
  • AI/ML or HPC workload experience
  • Large-scale hardware validation frameworks

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive salary $180k-$200k
  • ๐Ÿ“ˆ Performance bonus and equity
  • ๐Ÿฉบ Medical, dental, vision coverage
  • ๐Ÿ–๏ธ Generous PTO and paid parental leave
  • ๐Ÿก Flexible remote work options

๐Ÿ“จ Hiring Process

Estimated timeline: 2-3 weeks ยท AI estimate

  1. 1Recruiter screenยท 30 min
  2. 2Technical interviewยท 60 min
  3. 3Hiring manager roundยท 45 min
0 0 0