absorb.md

Qwen 25 3b Instruct

1 mentions across 0 people

Unknown speaker
paper · 2026-04-10
Recommended

We evaluate TrACE against greedy decoding and fixed-budget self-consistency (SC-4, SC-8) on two benchmarks spanning single-step reasoning (GSM8K, n=50) and multi-step household navigation (MiniHouse, n=30), using a Qwen 2.5 3B Instruct model running on CPU.

TrACE: Adaptive Compute for LLM Agents via Inter-Rollout Agreement