20h ago
ML Infrastructure Engineer
UK
โจ $100k-$160k / yearest.
full-timesenior Remoteai-ml
๐ Tech Stack
๐ผ About This Role
You'll join a cutting-edge AI infrastructure team, benchmarking and optimizing GPU platforms for large-scale machine learning workloads. You'll drive performance improvements across compute architectures, software stacks, and distributed AI environments. This role offers exposure to modern AI frameworks and high-performance GPU ecosystems with international collaboration.
๐ฏ What You'll Do
- Benchmark GPU platform performance for ML and AI workloads
- Profile GPU performance and identify optimization opportunities
- Analyze and debug training and inference workloads
- Conduct acceptance testing for new GPU clusters
๐ Requirements
- Deep learning architecture and optimization knowledge
- Experience with PyTorch, JAX, or Megatron-LM
- Expertise in CUDA, NCCL, and GPU software stacks
- Proficiency in Python and performance profiling tools
โจ Nice to Have
- Experience with LLM inference frameworks like vLLM
- Familiarity with cloud ML ecosystems (AWS, GCP, Azure)
- Contributions to open-source ML tooling
๐ Benefits & Perks
- ๐ฐ Competitive compensation aligned with experience
- ๐ Flexible remote work supporting work-life balance
- ๐ Continuous learning and career development
- ๐ International collaboration with global teams
๐จ Hiring Process
Estimated timeline: 2-4 weeks ยท AI estimate
- 1Recruiter Screenยท 30 min
- 2Technical Interviewยท 60 min
- 3Final Interviewยท 45 min
0 0 0