1d ago

AI System Research and Development Engineer - Optimization

US-WA-Bellevue

$200k-$287.5k / year

full-timesenior Hybridsoftware

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll join Snowflake's AI Research team to build efficient, scalable LLM inference and training systems. Your work will power agentic enterprise applications and reduce costs through innovations like SwiftKV. You'll collaborate with founding members of DeepSpeed, vLLM, and TensorFlow.

๐ŸŽฏ What You'll Do

  • Analyze and optimize GPU kernel performance for LLM training and inference.
  • Profile and benchmark deep learning systems to identify bottlenecks.
  • Design optimizations to reduce latency and improve resource utilization.
  • Contribute to agentic frameworks and applications for LLM workflows.

๐Ÿ“‹ Requirements

  • Bachelor's degree in CS, EE, or related field (Master's or PhD preferred).
  • 5 years of experience in GPU kernel, deep learning, or HPC optimization.
  • Proficiency in deep learning frameworks such as PyTorch, TensorFlow, JAX.
  • Experience with CUDA and GPU architectures.

โœจ Nice to Have

  • Experience with CUTLASS, Triton, cuDNN.
  • Experience with profiling tools (nvprof, Nsight).

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive salary and equity.
  • ๐Ÿ  Hybrid work flexibility.
  • ๐Ÿ“š Innovation-driven culture with top-tier collaborators.
  • ๐ŸŒŸ Opportunities to publish at top conferences.

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Onsite Interviewsยท 4 hours
0 0 0