22h ago

Staff Software Engineer - AI Traffic & Inference Infrastructure

Bengaluru

โœจ $200k-$300k / yearest.

full-timeleade-commerce

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll design and scale the intelligent nervous system of Coupang's AI platform, building orchestration and routing layers for LLMs and foundation models. Your work will ensure high availability, low latency, and cost efficiency across thousands of accelerators. This role offers a unique opportunity to shape the infrastructure powering one of the fastest-growing e-commerce companies.

๐ŸŽฏ What You'll Do

  • Design load-balancing algorithms for AI workloads
  • Architect inference orchestration with auto-scaling
  • Minimize tail latency using batching and caching
  • Build infrastructure-as-code pipelines for dynamic fleets

๐Ÿ“‹ Requirements

  • 8โ€“12 years of software engineering experience
  • Proficiency in Go or Python
  • Expert-level knowledge of Kubernetes internals
  • Experience with load balancing and request routing

โœจ Nice to Have

  • Experience with LLM inference infrastructure
  • Familiarity with mixed precision and kernel tuning
  • Multi-cloud deployment experience (AWS, Azure, GCP)

๐ŸŽ Benefits & Perks

  • ๐Ÿ–๏ธ Unlimited PTO
  • ๐Ÿ’ฐ Equity participation
  • ๐Ÿฅ Comprehensive health insurance
  • ๐Ÿ“š Learning and development budget

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3System Design Interviewยท 60 min
0 0 0