3h ago
Software Engineer, ML Inference Performance
Palo Alto, California, United States
full-timeseniorArtificial Intelligence / Semiconductors
Tech Stack
Description
In this role, you will work on optimizing ML inference performance by collaborating across the compiler stack and hardware teams. You'll dig into PyTorch and ML models to map operations efficiently onto SambaNova's platform, driving innovation in compiler infrastructure and optimization algorithms.
Requirements
- Bachelor's or Master's in CS/CE or equivalent with 5-10 years industry experience
- Deep theoretical understanding of compiler fundamentals
- Experience building and deploying software products
- Experience with deep learning frameworks (TensorFlow, PyTorch) is a plus
- Excitement about high-performance systems engineering and performance debugging
Responsibilities
- Lead compiler engineering, ensuring standard methodologies and process evolution
- Collaborate with peers, domain experts, developers, and customers to find optimal solutions
- Develop, integrate, and implement products
- Support proposals in key areas aligned with core team competencies
0 views 0 saves 0 applications