3h ago

Software Engineer, ML Inference Performance

Palo Alto, California, United States
full-timeseniorArtificial Intelligence / Semiconductors

Tech Stack

Description

In this role, you will work on optimizing ML inference performance by collaborating across the compiler stack and hardware teams. You'll dig into PyTorch and ML models to map operations efficiently onto SambaNova's platform, driving innovation in compiler infrastructure and optimization algorithms.

Requirements

  • Bachelor's or Master's in CS/CE or equivalent with 5-10 years industry experience
  • Deep theoretical understanding of compiler fundamentals
  • Experience building and deploying software products
  • Experience with deep learning frameworks (TensorFlow, PyTorch) is a plus
  • Excitement about high-performance systems engineering and performance debugging

Responsibilities

  • Lead compiler engineering, ensuring standard methodologies and process evolution
  • Collaborate with peers, domain experts, developers, and customers to find optimal solutions
  • Develop, integrate, and implement products
  • Support proposals in key areas aligned with core team competencies
0 views 0 saves 0 applications