5h ago

Applied Research Scientist

US - Remote

$150k-$200k / yearest.

full-timesenior Remotehealthcare

🛠 Tech Stack

💼 About This Role

You'll build and scale automated evaluation pipelines for clinical-grade AI agents at a healthcare startup. Your work will define benchmarks and deploy CI gates for accuracy and safety. This role offers first-of-their-kind problems at the intersection of AI and medicine.

🎯 What You'll Do

  • Design and implement LLM-as-judge evaluation pipelines
  • Define clinical-grade benchmarks for agentic tasks
  • Build automated CI gates for accuracy and safety
  • Partner with engineering to land evaluation frameworks

📋 Requirements

  • Python and ML background with PyTorch/TensorFlow
  • Experience with LLM evaluation and benchmarking
  • Track record of published research or deep applied work in LLMs

✨ Nice to Have

  • Experience with healthcare or clinical NLP
  • Familiarity with LangChain or LlamaIndex
  • Strong communication and technical writing skills

🎁 Benefits & Perks

  • 🧠 High-impact work on first-of-their-kind problems
  • ⚡️ Autonomy and ownership with urgent execution
  • ❤️ Direct impact on doctor efficiency and patient care

🚩 Heads Up

  • No salary mentioned in listing
0 0 0