Fp16tuned Lr Schedule Applied To Int8int6 Qat
1 mentions across 0 people
All mentions
Unknown speaker
Recommendedpaper · 2026-05-26
“Practical recommendation: at sub-100M scale, tune the LR schedule once at FP16 and apply unchanged to INT8/INT6 QAT”
LR Schedule Is Bit-Width-Agnostic for Sub-100M QAT — Except INT4 Above 50M Param ↗