Memevobench
1 mentions across 0 people
All mentions
Unknown speaker
Recommendedpaper · 2026-04-20
“we introduce MemEvoBench, the first benchmark evaluating long-horizon memory safety in LLM agents against adversarial memory injection, noisy tool outputs, and biased feedback.”
MemEvoBench Reveals Severe Memory-Induced Safety Drift in LLM Agents ↗