absorb.md

Machine Learning Theory

No compiled wiki article for this topic yet. Raw entries below are the source material — a wiki article can be generated on demand from /admin/triggers.

Order-Optimal Sequential 1-Bit Mean Estimation Achieved Despite Quantization Constraints

This paper introduces a novel adaptive mean estimator that achieves order-optimal sample complexity for 1-bit mean estimation across all tail regimes (k > 1). The method utilizes randomized threshold queries and demonstrates that the performance matches unquantized methods with an additional localiz

Spectral Wasserstein Flow for Gradient Normalization in Deep Learning

This paper introduces Spectral Wasserstein distances as a framework to analyze gradient normalization techniques like Muon in deep learning. It establishes a connection between these distances and various normalization schemes, including Schatten-type norms. The framework provides theoretical founda