🛠 tool

Qwen 25 3b Instruct

1 mentions across 0 people

All mentions

Unknown speaker

paper · 2026-04-10

Recommended

“We evaluate TrACE against greedy decoding and fixed-budget self-consistency (SC-4, SC-8) on two benchmarks spanning single-step reasoning (GSM8K, n=50) and multi-step household navigation (MiniHouse, n=30), using a Qwen 2.5 3B Instruct model running on CPU.”

TrACE: Adaptive Compute for LLM Agents via Inter-Rollout Agreement ↗