Agentsafetybench
1 mentions across 0 people
All mentions
Unknown speaker
Recommendedpaper · 2026-04-20
“workflow-style tasks adapted from 20 Agent-SafetyBench environments with noisy tool returns.”
MemEvoBench Reveals Severe Memory-Induced Safety Drift in LLM Agents ↗