论文标题
在(随机)成对的中值
On Medians of (Randomized) Pairwise Means
论文作者
论文摘要
最近在Lugosi&Mendelson(2016)中引入的锦标赛程序提供了一种吸引人的替代方案,从理论角度,至少是机器学习中经验风险最小化的原则。中位数(MOM)的统计学习基本上是将培训数据分割为相等大小的块,并比较每个数据块上每对候选决策规则的统计性能:在大多数块上的表现最高,将其宣布为获胜者。在非参数回归的背景下,赢得所有决斗的功能已被证明超过了经验风险最小化的W.R.T.在最小假设下的平均平方误差,同时表现出鲁棒性。本文的目的是扩展这种方法以解决其他学习问题,特别是在该问题上,绩效标准以对成对的观测值的期望形式,而不是对一个观察结果进行,就像成对排名,群集或度量学习中一样。确切地说,在这里证明,当块是通过独立采样而没有替换方案而不是简单的分割的独立采样来构建块时,妈妈实现的界限基本上是保守的。这些结果接下来扩展到风险与成对损失功能相关的情况,其经验对应物的形式为$ u $统计的形式。除了确保提出的学习/估计方法表现的理论结果之外,一些数值实验还提供了其在实践中相关性的经验证据。
Tournament procedures, recently introduced in Lugosi & Mendelson (2016), offer an appealing alternative, from a theoretical perspective at least, to the principle of Empirical Risk Minimization in machine learning. Statistical learning by Median-of-Means (MoM) basically consists in segmenting the training data into blocks of equal size and comparing the statistical performance of every pair of candidate decision rules on each data block: that with highest performance on the majority of the blocks is declared as the winner. In the context of nonparametric regression, functions having won all their duels have been shown to outperform empirical risk minimizers w.r.t. the mean squared error under minimal assumptions, while exhibiting robustness properties. It is the purpose of this paper to extend this approach in order to address other learning problems, in particular for which the performance criterion takes the form of an expectation over pairs of observations rather than over one single observation, as may be the case in pairwise ranking, clustering or metric learning. Precisely, it is proved here that the bounds achieved by MoM are essentially conserved when the blocks are built by means of independent sampling without replacement schemes instead of a simple segmentation. These results are next extended to situations where the risk is related to a pairwise loss function and its empirical counterpart is of the form of a $U$-statistic. Beyond theoretical results guaranteeing the performance of the learning/estimation methods proposed, some numerical experiments provide empirical evidence of their relevance in practice.