3h ago

Research Engineer (Agentic Models)

Amsterdam, Netherlands; Belgrade, Serbia; Berlin, Germany; Limassol, Cyprus; London, United Kingdom; Madrid, Spain; Munich, Germany; Paphos, Cyprus; Prague, Czech Republic; Remote, Germany; Warsaw, Poland; Yerevan, Armenia
full-timesenior Remotesoftware development tools

Tech Stack

Description

You will design and implement training pipelines for multi-step coding agents, train LLMs for agent workflows, and build evaluation frameworks to improve model performance. Your work will directly impact developer tools used by millions.

Requirements

  • Extensive hands-on experience training LLMs in research or production
  • Deep expertise in PyTorch and LLM training stacks (e.g., Megatron, NeMo, verl)
  • Strong understanding of LLM architectures, tokenization, data pipelines, distributed training
  • Ability to own projects end to end from problem to implementation
  • At least 3 years of Python experience in modern ML codebases

Responsibilities

  • Design and maintain SFT and RL post-training pipelines for multi-step coding agents
  • Train and adapt LLMs for agent workflows, including planning and tool use
  • Build evaluation and simulation environments for coding agents
  • Develop evaluation frameworks and metrics for agent behavior
  • Analyze training results and improve models, architectures, and datasets
0 views 0 saves 0 applications