7h ago
ML Infrastructure Engineer
Germany
โจ $115k-$145k / yearest.
full-timemid Remoteai-ml
๐ Tech Stack
๐ผ About This Role
You'll benchmark and optimize GPU platforms for large-scale AI workloads, working at the intersection of deep learning optimization and cloud-scale infrastructure. Contribute directly to improving performance and efficiency of cutting-edge AI systems. This role offers exposure to modern AI frameworks and international collaboration.
๐ฏ What You'll Do
- Benchmark GPU performance for ML and AI workloads across architectures
- Profile GPU performance at system and kernel levels for optimization
- Analyze and debug training/inference workloads for efficiency and scalability
- Develop internal tools and dashboards for performance metrics
๐ Requirements
- Strong foundation in machine learning and deep learning architectures
- Deep understanding of performance optimization for neural network training/inference
- Extensive experience with PyTorch, JAX, or TensorRT-LLM
- Solid expertise with CUDA, NCCL, and GPU software stacks
โจ Nice to Have
- Experience with LLM inference frameworks like vLLM or SGLang
- Familiarity with cloud ML ecosystems (AWS, GCP, Azure ML)
- Contributions to open-source ML tooling or benchmarking
๐ Benefits & Perks
- ๐ฐ Competitive compensation aligned with experience
- ๐ก Flexible remote work
- ๐ Continuous learning and career development
- ๐ International work environment with global teams
- ๐ฌ Impactful AI projects shaping ML infrastructure
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Callยท 30 min
- 2Technical Interviewยท 60 min
- 3Team Fit Interviewยท 45 min
0 0 0