Poster Session 4 · Thursday, December 4, 2025 4:30 PM → 7:30 PM
#3308
Simple and Optimal Sublinear Algorithms for Mean Estimation
Abstract
We study the sublinear multivariate mean estimation problem in -dimensional Euclidean space. Specifically, we aim to find the mean of a ground point set , which minimizes the sum of squared Euclidean distances of the points in to .
We first show that a multiplicative approximation to can be found with probability using many independent uniform random samples, and provide a matching lower bound.
Furthermore, we give two estimators with optimal sample complexity that can be computed in optimal running time for extracting a suitable approximate mean:
- The coordinate-wise median of sample means of sample size . As a corollary, we also show improved convergence rates for this estimator for estimating means of multivariate distributions.
- The geometric median of sample means of sample size . To compute a solution efficiently, we design a novel and simple gradient descent algorithm that is significantly faster for our specific setting than all other known algorithms for computing geometric medians.
In addition, we propose an order statistics approach that is empirically competitive with these algorithms, has an optimal sample complexity and matches the running time up to lower order terms.
We finally provide an extensive experimental evaluation among several estimators which concludes that the geometric-median-of-means-based approach is typically the most competitive in practice.