absorb.md

Medagentbench

1 mentions across 1 person

Visit ↗
Andrew Ng
paper · 2025-01-24
Recommended

MedAgentBench establishes this and is publicly available at https://github.com/stanfordmlgroup/MedAgentBench , offering a valuable framework for model developers to track progress and drive continuous improvements in the agent capabilities of large language models within the medical domain.

MedAgentBench: A Virtual EHR Environment for LLM Agent Benchmarking