Diffumask
1 mentions across 1 person
All mentions
Wes Roth
Recommendedpaper · 2026-04-10
“We present DiffuMask, a diffusion-based framework integrating hierarchical shot-level and token-level pruning signals, that enables rapid and parallel prompt pruning via iterative mask prediction.”
DiffuMask: Enhancing LLM Efficiency through Diffusion-Based Prompt Pruning ↗