20h ago
ML Infrastructure Engineer
Spain
โจ $150k-$200k / yearest.
full-timesenior Remoteai-ml
๐ Tech Stack
๐ผ About This Role
You'll join a cutting-edge AI infrastructure team to benchmark and optimize GPU platforms for large-scale ML and AI workloads. You'll drive performance improvements across GPU architectures, deep learning frameworks, and distributed systems. This role provides exposure to modern AI frameworks and international collaboration.
๐ฏ What You'll Do
- Benchmark GPU platform performance for ML and AI workloads.
- Collaborate with hardware teams to profile and optimize GPU performance.
- Conduct acceptance testing for new GPU clusters.
- Develop tools and dashboards for performance metrics and trends.
๐ Requirements
- Strong foundation in machine learning and deep learning architectures.
- Deep understanding of performance optimization for neural network training and inference.
- Extensive experience with PyTorch, JAX, or similar deep learning frameworks.
- Solid expertise with CUDA, NCCL, and GPU software stacks.
โจ Nice to Have
- Experience with LLM inference frameworks like vLLM or TensorRT.
- Familiarity with cloud-based ML ecosystems (AWS, GCP, Azure).
- Contributions to open-source ML tooling or benchmarking frameworks.
๐ Benefits & Perks
- ๐ฐ Competitive compensation
- ๐ Flexible remote work
- ๐ Continuous learning and development
- ๐ International work environment
- โ๏ธ High-performance AI systems exposure
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Offerยท 30 min
0 0 0