Reproducible Benchmark Pipeline
1 mentions across 0 people
All mentions
Unknown speaker
Recommendedpaper · 2026-04-09
“We release a reproducible benchmark pipeline, aggregated results, and paired statistical analyses to support deployment-oriented evaluation of reasoning LLMs under real resource constraints.”
Efficiency-Accuracy Trade-offs in Reasoning LLMs: Beyond MoE ↗