Medagentbench
1 mentions across 1 person
Visit ↗All mentions
“MedAgentBench establishes this and is publicly available at https://github.com/stanfordmlgroup/MedAgentBench , offering a valuable framework for model developers to track progress and drive continuous improvements in the agent capabilities of large language models within the medical domain.”
MedAgentBench: A Virtual EHR Environment for LLM Agent Benchmarking ↗