6h ago
Software Engineer - GPU Inference
San Francisco
$165k-$330k / year
full-timesenior Hybridai-ml
๐ Tech Stack
๐ผ About This Role
You'll own the inference stack for Voice AI models, joining a small founding team focused on bringing state-of-the-art open source models to production. Your work will power mission-critical Voice AI deployments across productivity, customer service, and more, making a meaningful impact on people's daily lives. You'll drive high-ownership initiatives from roadmap through engineering implementation.
๐ฏ What You'll Do
- Own voice AI product areas end-to-end from architecture to production operations.
- Design and operate real-time, large-scale model serving systems for STT/TTS/voice agents.
- Drive cross-team collaboration on full-stack technical problems and delivery coordination.
- Mentor teammates through code reviews, design docs, and technical leadership.
๐ Requirements
- Bachelor's degree or higher in Computer Science or related field.
- Proven track record owning production-grade real-time large-scale systems where tail latency (p99) matters.
- Proficient coding in Python or similar languages.
- Comfortable using AI coding assistants (e.g., Claude Code, Cursor) as daily productivity multiplier.
โจ Nice to Have
- Experience implementing pipeline-level model runtime optimizations (dynamic batching, async scheduling).
- Experience building developer platforms (SDKs, CLIs, APIs) for ML or infrastructure products.
- Familiarity with speech/audio ML models (STT, TTS) and model-serving runtimes (vLLM, TensorRT).
๐ Benefits & Perks
- ๐๏ธ Flexible PTO including company-wide Winter Break
- ๐ฉบ 100% medical/dental/vision coverage for employee and dependents
- ๐ถ Paid parental leave and fertility/family-building stipend
- ๐ฆ 401(k) company-facilitated
- ๐ Competitive compensation including meaningful equity
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3System Design Interviewยท 60 min
- 4Hiring Manager Interviewยท 45 min
0 0 0