Machine Learning Theory
No compiled wiki article for this topic yet. Raw entries below are the source material — a wiki article can be generated on demand from /admin/triggers.
All entries on this topic (2)
Order-Optimal Sequential 1-Bit Mean Estimation Achieved Despite Quantization Constraints
This paper introduces a novel adaptive mean estimator that achieves order-optimal sample complexity for 1-bit mean estimation across all tail regimes (k > 1). The method utilizes randomized threshold queries and demonstrates that the performance matches unquantized methods with an additional localiz…
Spectral Wasserstein Flow for Gradient Normalization in Deep Learning
This paper introduces Spectral Wasserstein distances as a framework to analyze gradient normalization techniques like Muon in deep learning. It establishes a connection between these distances and various normalization schemes, including Schatten-type norms. The framework provides theoretical founda…