12h ago
C++ Systems Engineer
New York City
$175k-$275k / year
full-timemidai-ml
๐ Tech Stack
๐ผ About This Role
You'll design and optimize the core native runtime for LM Studio, working across LLM engines and GPU backends. You'll implement system-level code for concurrency, memory, and IPC, and integrate platform acceleration paths like Metal and CUDA. This role offers the chance to shape the future of on-device AI software.
๐ฏ What You'll Do
- Contribute to the C++ runtime powering LM Studio
- Extend LLM engine integrations for desktop OS
- Implement resilient IPC and scheduling logic
- Improve build, packaging, and release infrastructure
๐ Requirements
- 4+ years building production C++ systems software
- Deep knowledge of concurrency and memory management
- C++11 (or newer) expertise with RAII mindset
- Experience optimizing performance with profilers
โจ Nice to Have
- Experience with GPU backends (Metal, CUDA, Vulkan)
- Familiarity with llama.cpp or MLX
- Cross-platform development (macOS, Windows, Linux)
๐ Benefits & Perks
- ๐ฐ Competitive salary with equity
- ๐๏ธ New York City based, in-person collaboration
- ๐งโ๐ป Small team with direct impact
- ๐ Cutting-edge AI products
๐จ Hiring Process
Estimated timeline: 2-3 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Onsite Interviewยท half day
0 0 0