1d ago
AI System Research and Development Engineer - Optimization
US-WA-Bellevue
$200k-$287.5k / year
full-timesenior Hybridsoftware
๐ Tech Stack
๐ผ About This Role
You'll join Snowflake's AI Research team to build efficient, scalable LLM inference and training systems. Your work will power agentic enterprise applications and reduce costs through innovations like SwiftKV. You'll collaborate with founding members of DeepSpeed, vLLM, and TensorFlow.
๐ฏ What You'll Do
- Analyze and optimize GPU kernel performance for LLM training and inference.
- Profile and benchmark deep learning systems to identify bottlenecks.
- Design optimizations to reduce latency and improve resource utilization.
- Contribute to agentic frameworks and applications for LLM workflows.
๐ Requirements
- Bachelor's degree in CS, EE, or related field (Master's or PhD preferred).
- 5 years of experience in GPU kernel, deep learning, or HPC optimization.
- Proficiency in deep learning frameworks such as PyTorch, TensorFlow, JAX.
- Experience with CUDA and GPU architectures.
โจ Nice to Have
- Experience with CUTLASS, Triton, cuDNN.
- Experience with profiling tools (nvprof, Nsight).
๐ Benefits & Perks
- ๐ฐ Competitive salary and equity.
- ๐ Hybrid work flexibility.
- ๐ Innovation-driven culture with top-tier collaborators.
- ๐ Opportunities to publish at top conferences.
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Onsite Interviewsยท 4 hours
0 0 0