6h ago

Software Engineer - GPU Inference

San Francisco

$165k-$330k / year

full-timesenior Hybridai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll own the inference stack for Voice AI models, joining a small founding team focused on bringing state-of-the-art open source models to production. Your work will power mission-critical Voice AI deployments across productivity, customer service, and more, making a meaningful impact on people's daily lives. You'll drive high-ownership initiatives from roadmap through engineering implementation.

๐ŸŽฏ What You'll Do

  • Own voice AI product areas end-to-end from architecture to production operations.
  • Design and operate real-time, large-scale model serving systems for STT/TTS/voice agents.
  • Drive cross-team collaboration on full-stack technical problems and delivery coordination.
  • Mentor teammates through code reviews, design docs, and technical leadership.

๐Ÿ“‹ Requirements

  • Bachelor's degree or higher in Computer Science or related field.
  • Proven track record owning production-grade real-time large-scale systems where tail latency (p99) matters.
  • Proficient coding in Python or similar languages.
  • Comfortable using AI coding assistants (e.g., Claude Code, Cursor) as daily productivity multiplier.

โœจ Nice to Have

  • Experience implementing pipeline-level model runtime optimizations (dynamic batching, async scheduling).
  • Experience building developer platforms (SDKs, CLIs, APIs) for ML or infrastructure products.
  • Familiarity with speech/audio ML models (STT, TTS) and model-serving runtimes (vLLM, TensorRT).

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Flexible PTO including company-wide Winter Break
  • ๐Ÿฉบ 100% medical/dental/vision coverage for employee and dependents
  • ๐Ÿ‘ถ Paid parental leave and fertility/family-building stipend
  • ๐Ÿฆ 401(k) company-facilitated
  • ๐Ÿ“ˆ Competitive compensation including meaningful equity

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3System Design Interviewยท 60 min
  4. 4Hiring Manager Interviewยท 45 min
0 0 0