Qwen 25 3b Instruct
1 mentions across 0 people
All mentions
Unknown speaker
Recommendedpaper · 2026-04-10
“We evaluate TrACE against greedy decoding and fixed-budget self-consistency (SC-4, SC-8) on two benchmarks spanning single-step reasoning (GSM8K, n=50) and multi-step household navigation (MiniHouse, n=30), using a Qwen 2.5 3B Instruct model running on CPU.”
TrACE: Adaptive Compute for LLM Agents via Inter-Rollout Agreement ↗