12h ago

Research Engineer, Data Infrastructure

Palo Alto

✨ $200k-$300k / yearest.

full-timesenior Hybridai-ml Visa Sponsor

πŸ›  Tech Stack

πŸ’Ό About This Role

You'll build and optimize large-scale learning systems for open-weight models, working with Research Scientists. You'll accelerate researchers by developing robust tools and integrating cutting-edge research into production. This role spans platform infrastructure and embedded research teams.

🎯 What You'll Do

  • Build and enhance shared training framework and data pipelines.
  • Integrate checkpoints and streamline evaluation workflows.
  • Conduct experiments on distributed training with thousands of GPUs.
  • Deliver prototypes that become production-grade components.

πŸ“‹ Requirements

  • Master’s or PhD in Computer Science or equivalent.
  • 4+ years working on large-scale ML codebases.
  • Hands-on with PyTorch, JAX, or TensorFlow.
  • Experience with distributed training (DeepSpeed, FSDP, SLURM, K8s).

✨ Nice to Have

  • Experience with CUDA or data pipelines.
  • Background in deep learning, NLP, or LLMs.

🎁 Benefits & Perks

  • πŸ’° Competitive salary and equity
  • πŸš‘ Healthcare: Medical/Dental/Vision covered for you and family
  • 🏝️ PTO: 18 days
  • 🌎 Visa sponsorship
  • πŸ₯• Meal stipend: $400 monthly allowance

πŸ“¨ Hiring Process

Estimated timeline: 2-4 weeks Β· AI estimate

  1. 1Recruiter ScreenΒ· 30 min
  2. 2Technical InterviewΒ· 60 min
  3. 3Onsite InterviewΒ· 120 min
0 0 0