22h ago
Staff Software Engineer - AI Traffic & Inference Infrastructure
Bengaluru
โจ $200k-$300k / yearest.
full-timeleade-commerce
๐ Tech Stack
๐ผ About This Role
You'll design and scale the intelligent nervous system of Coupang's AI platform, building orchestration and routing layers for LLMs and foundation models. Your work will ensure high availability, low latency, and cost efficiency across thousands of accelerators. This role offers a unique opportunity to shape the infrastructure powering one of the fastest-growing e-commerce companies.
๐ฏ What You'll Do
- Design load-balancing algorithms for AI workloads
- Architect inference orchestration with auto-scaling
- Minimize tail latency using batching and caching
- Build infrastructure-as-code pipelines for dynamic fleets
๐ Requirements
- 8โ12 years of software engineering experience
- Proficiency in Go or Python
- Expert-level knowledge of Kubernetes internals
- Experience with load balancing and request routing
โจ Nice to Have
- Experience with LLM inference infrastructure
- Familiarity with mixed precision and kernel tuning
- Multi-cloud deployment experience (AWS, Azure, GCP)
๐ Benefits & Perks
- ๐๏ธ Unlimited PTO
- ๐ฐ Equity participation
- ๐ฅ Comprehensive health insurance
- ๐ Learning and development budget
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3System Design Interviewยท 60 min
0 0 0