2h ago
Member of Technical Staff - Mid-Training Infra
San Francisco
✨ $200k-$400k / yearest.
full-timeseniorai-ml
🛠 Tech Stack
💼 About This Role
You'll design and build large-scale GPU infrastructure for high-throughput model inference and mid-training workloads. Your work will power synthetic data generation and reinforcement learning pipelines at scale. Join a small talent-dense team building open foundation models from the ground up.
🎯 What You'll Do
- Design and build large-scale GPU infrastructure for inference and training
- Develop systems for synthetic data generation and RL pipelines
- Optimize throughput, latency, and GPU utilization for LLM inference
- Build infrastructure for distributed RL workloads and model evaluation
📋 Requirements
- Experience deploying and operating large-scale GPU systems for inference
- Several years of hands-on production infrastructure experience
- Strong understanding of GPU performance characteristics and optimization
- Experience with modern inference frameworks like SGLang or Megatron
✨ Nice to Have
- Familiarity with distributed RL infrastructure or rollout generation systems
- Experience with GPU kernels or low-level performance optimization
- Experience debugging performance issues across GPU, networking, and distributed layers
🎁 Benefits & Perks
- 💰 Top-tier compensation including salary and equity
- 🏥 Comprehensive health insurance (medical, dental, vision)
- 👶 Fully paid parental leave for all new parents
- 🍽️ Daily lunch and dinner provided
- 🌍 Relocation support and paid time off
0 0 0