Universally Consistent K-Sample Tests via Dependence Measures
Shows that any consistent measure of statistical dependence gives rise to a consistent k-sample test, unifying a family of methods under a single framework.
Abstract
The K-sample testing problem involves determining whether K groups of data points are each drawn from the same distribution. Analysis of variance is arguably the most classical method to test mean differences, along with several recent methods to test distributional differences. In this paper, we demonstrate the existence of a transformation that allows K-sample testing to be carried out using any dependence measure. Consequently, universally consistent K-sample testing can be achieved using a universally consistent dependence measure, such as distance correlation and the Hilbert-Schmidt independence criterion. This enables a wide range of dependence measures to be easily applied to K-sample testing.
@article{panda2025universally,
title = {Universally Consistent {{K-sample}} Tests via Dependence Measures},
author = {Panda, Sambit and Shen, Cencheng and Perry, Ronan and Zorn, Jelle and Lutz, Antoine and Priebe, Carey E. and Vogelstein, Joshua T.},
year = 2025,
month = jan,
journal = {Statistics \& Probability Letters},
volume = {216},
pages = {110278},
issn = {0167-7152},
doi = {10.1016/j.spl.2024.110278},
}