20h ago

ML Infrastructure Engineer

UK

โœจ $100k-$160k / yearest.

full-timesenior Remoteai-ml

๐Ÿ›  Tech Stack

๐Ÿ’ผ About This Role

You'll join a cutting-edge AI infrastructure team, benchmarking and optimizing GPU platforms for large-scale machine learning workloads. You'll drive performance improvements across compute architectures, software stacks, and distributed AI environments. This role offers exposure to modern AI frameworks and high-performance GPU ecosystems with international collaboration.

๐ŸŽฏ What You'll Do

  • Benchmark GPU platform performance for ML and AI workloads
  • Profile GPU performance and identify optimization opportunities
  • Analyze and debug training and inference workloads
  • Conduct acceptance testing for new GPU clusters

๐Ÿ“‹ Requirements

  • Deep learning architecture and optimization knowledge
  • Experience with PyTorch, JAX, or Megatron-LM
  • Expertise in CUDA, NCCL, and GPU software stacks
  • Proficiency in Python and performance profiling tools

โœจ Nice to Have

  • Experience with LLM inference frameworks like vLLM
  • Familiarity with cloud ML ecosystems (AWS, GCP, Azure)
  • Contributions to open-source ML tooling

๐ŸŽ Benefits & Perks

  • ๐Ÿ’ฐ Competitive compensation aligned with experience
  • ๐Ÿ  Flexible remote work supporting work-life balance
  • ๐Ÿ“š Continuous learning and career development
  • ๐ŸŒ International collaboration with global teams

๐Ÿ“จ Hiring Process

Estimated timeline: 2-4 weeks ยท AI estimate

  1. 1Recruiter Screenยท 30 min
  2. 2Technical Interviewยท 60 min
  3. 3Final Interviewยท 45 min
0 0 0