7h ago

ML Infrastructure Engineer

Germany

โœจ $115k-$145k / yearest.

full-timemid Remoteai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll benchmark and optimize GPU platforms for large-scale AI workloads, working at the intersection of deep learning optimization and cloud-scale infrastructure. Contribute directly to improving performance and efficiency of cutting-edge AI systems. This role offers exposure to modern AI frameworks and international collaboration.

๐ŸŽฏ What You'll Do

  • Benchmark GPU performance for ML and AI workloads across architectures
  • Profile GPU performance at system and kernel levels for optimization
  • Analyze and debug training/inference workloads for efficiency and scalability
  • Develop internal tools and dashboards for performance metrics

๐Ÿ“‹ Requirements

  • Strong foundation in machine learning and deep learning architectures
  • Deep understanding of performance optimization for neural network training/inference
  • Extensive experience with PyTorch, JAX, or TensorRT-LLM
  • Solid expertise with CUDA, NCCL, and GPU software stacks

โœจ Nice to Have

  • Experience with LLM inference frameworks like vLLM or SGLang
  • Familiarity with cloud ML ecosystems (AWS, GCP, Azure ML)
  • Contributions to open-source ML tooling or benchmarking

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive compensation aligned with experience
  • ๐Ÿก Flexible remote work
  • ๐Ÿ“š Continuous learning and career development
  • ๐ŸŒ International work environment with global teams
  • ๐Ÿ”ฌ Impactful AI projects shaping ML infrastructure

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Callยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Team Fit Interviewยท 45 min
0 0 0