12h ago
Research Engineer, Data Infrastructure
Palo Alto
β¨ $200k-$300k / yearest.
full-timesenior Hybridai-ml Visa Sponsor
π Tech Stack
πΌ About This Role
You'll build and optimize large-scale learning systems for open-weight models, working with Research Scientists. You'll accelerate researchers by developing robust tools and integrating cutting-edge research into production. This role spans platform infrastructure and embedded research teams.
π― What You'll Do
- Build and enhance shared training framework and data pipelines.
- Integrate checkpoints and streamline evaluation workflows.
- Conduct experiments on distributed training with thousands of GPUs.
- Deliver prototypes that become production-grade components.
π Requirements
- Masterβs or PhD in Computer Science or equivalent.
- 4+ years working on large-scale ML codebases.
- Hands-on with PyTorch, JAX, or TensorFlow.
- Experience with distributed training (DeepSpeed, FSDP, SLURM, K8s).
β¨ Nice to Have
- Experience with CUDA or data pipelines.
- Background in deep learning, NLP, or LLMs.
π Benefits & Perks
- π° Competitive salary and equity
- π Healthcare: Medical/Dental/Vision covered for you and family
- ποΈ PTO: 18 days
- π Visa sponsorship
- π₯ Meal stipend: $400 monthly allowance
π¨ Hiring Process
Estimated timeline: 2-4 weeks Β· AI estimate
- 1Recruiter ScreenΒ· 30 min
- 2Technical InterviewΒ· 60 min
- 3Onsite InterviewΒ· 120 min
0 0 0