论文标题
纠正学习到级别系统中的选择偏见
Correcting for Selection Bias in Learning-to-rank Systems
论文作者
论文摘要
现代推荐系统收集的点击数据是观察数据的重要来源,可用于训练学习级(LTR)系统。但是,这些数据遭受了许多偏见,可能导致LTR系统的性能差。此类系统中偏差校正的最新方法主要集中在位置偏差上,即使在用户的查询时,即使不是最相关的结果,也更可能单击排名更高的结果(例如,顶级搜索引擎结果)的事实。对纠正选择偏差的关注减少了,这是因为单击文档首先反映了向用户显示了哪些文档的原因。在这里,我们提出了新的反事实方法,以适应Heckman的两阶段方法,并解释LTR系统中的选择和位置偏差。我们的经验评估表明,与现有的无偏LTR算法相比,我们提出的方法对噪声更为强大,并且具有更好的准确性,尤其是在中等到没有位置偏见的情况下。
Click data collected by modern recommendation systems are an important source of observational data that can be utilized to train learning-to-rank (LTR) systems. However, these data suffer from a number of biases that can result in poor performance for LTR systems. Recent methods for bias correction in such systems mostly focus on position bias, the fact that higher ranked results (e.g., top search engine results) are more likely to be clicked even if they are not the most relevant results given a user's query. Less attention has been paid to correcting for selection bias, which occurs because clicked documents are reflective of what documents have been shown to the user in the first place. Here, we propose new counterfactual approaches which adapt Heckman's two-stage method and accounts for selection and position bias in LTR systems. Our empirical evaluation shows that our proposed methods are much more robust to noise and have better accuracy compared to existing unbiased LTR algorithms, especially when there is moderate to no position bias.