18h ago
Senior AI Infrastructure Engineer
Porto
$66.5k-$104.5k / year
full-timesenior Remotehealthcare
🛠 Tech Stack
+1
💼 About This Role
You'll own the infrastructure that brings AI models to life in production, from optimizing LLM inference to deploying real-time voice AI agents. You will sit at the intersection of ML and infrastructure, designing systems that power real-time computer vision and conversational AI at scale.
🎯 What You'll Do
- Design and maintain inference infrastructure for AI products
- Own end-to-end deployment pipeline for AI models
- Architect and scale Kubernetes clusters for GPU workloads
- Build infrastructure for real-time AI agents and WebRTC
- Drive inference scaling strategies like speculative decoding
📋 Requirements
- 5+ years in infrastructure engineering, 2+ years focused on AI/ML workloads
- Strong experience with Kubernetes for GPU-accelerated workloads
- Hands-on experience with model serving and inference optimization
- Solid understanding of LLM inference optimization techniques
- Experience with Infrastructure as Code (Terraform) and GitOps
✨ Nice to Have
- Experience with LLM serving engines like vLLM or SGLang
- Experience with NVIDIA Triton and TensorRT
- Familiarity with WebRTC infrastructure and real-time media streaming
🎁 Benefits & Perks
- 🏖️ Unlimited vacation
- 💻 Remote or Hybrid work policy
- 🏥 Health and well-being program (digital therapist sessions)
- 📈 Career development and growth with competitive salary
- 🚀 Fast-paced environment with high-tech startup
0 0 0