4h ago

TPU Kernel Engineer

San Francisco, CA | New York City, NY | Seattle, WA

$280,000-$850,000 / year

full-timemidArtificial Intelligence Visa Sponsor

Tech Stack

Description

You will identify and address performance issues across ML systems, design and optimize kernels for TPUs, and provide feedback on how model changes impact performance. This role involves low-level optimization and collaboration with researchers.

Requirements

  • Significant experience optimizing ML systems for TPUs, GPUs, or other accelerators
  • Results-oriented with bias towards flexibility and impact
  • Willingness to pair program
  • Interest in machine learning research and societal impacts
  • Bachelor's degree in relevant field

Responsibilities

  • Identify and address performance issues across ML systems
  • Design and optimize kernels for TPU
  • Provide feedback to researchers on model performance impact
  • Implement low-latency, high-throughput sampling for large language models
  • Adapt existing models for low-precision inference
0 views 0 saves 0 applications