9h ago

Member of Technical Staff - Research - Model

London

$150k-$250k / yearest.

full-timesenior Hybridai-ml

🛠 Tech Stack

💼 About This Role

You'll develop and train advanced LLMs and VLMs for agentic AI systems. You'll research and implement large-scale reinforcement learning to enhance instruction following and tool use. This role operates at the intersection of research and product, shaping the next generation of superintelligent AI.

🎯 What You'll Do

  • Develop and train advanced LLMs and VLMs including multimodal architectures
  • Research and implement training methods for instruction following and tool use
  • Design and optimize data pipelines for large-scale distributed training
  • Collaborate with cross-functional teams to integrate models into agentic systems

📋 Requirements

  • Strong programming skills in Python and Git
  • Expertise in PyTorch, JAX, or TensorFlow
  • Experience with large-scale distributed training of LLMs and VLMs
  • Hands-on experience with LLM training, alignment, and reinforcement learning

✨ Nice to Have

  • Industry experience with LLM training using reinforcement learning
  • Experience with data processing techniques
  • Publications in top-tier AI conferences (e.g., NeurIPS, ICML, CVPR, ACL)

🎁 Benefits & Perks

  • 🚀 Join a hottest AI startup shaping the future of superintelligence
  • 🌍 Collaborate with world-class AI talent in a multicultural team
  • 📈 Professional growth and continuous learning opportunities
  • 💰 Competitive salary and early-stage equity potential
0 0 0