培训课程以重新排列开放域名

论文标题

培训课程以重新排列开放域名

Training Curricula for Open Domain Answer Re-Ranking

论文作者

MacAvaney, Sean, Nardini, Franco Maria, Perego, Raffaele, Tonellotto, Nicola, Goharian, Nazli, Frieder, Ophir

论文摘要

在以精确的为导向的任务中，例如答案排名，比检索所有相关答案更重要的是，对许多相关答案进行排名更重要。因此，一个良好的排名策略将是学习如何首先识别最简单的正确答案（即，将高级评分分配给具有通常表明相关特征的答案，并且对那些没有特征的人的排名得分很低），然后再将更复杂的逻辑纳入处理困难的情况（例如，语义匹配或推理或推理））。在这项工作中，我们将此想法应用于使用课程学习对神经答案排名者的培训。我们提出了几种启发式方法来估计给定培训样本的难度。我们表明，拟议的启发式方法可用于建立培训课程，该课程在培训过程的早期就降低了重量的困难样本。随着训练过程的进行，无论难度如何，我们的方法都会逐渐转向所有样品加权。我们对三个答案排名数据集的拟议想法进行了全面评估。结果表明，我们的方法会带来两种领先的神经排名架构，即Bert和Consknrm的出色表现，即使用方向和成对损失。当应用于基于BERT的排名时，我们的方法的MRR提高了4％，P@1提高了9％（与未经课程训练的模型相比）。这导致模型可以实现与更昂贵的最先进技术相当的性能。

In precision-oriented tasks like answer ranking, it is more important to rank many relevant answers highly than to retrieve all relevant answers. It follows that a good ranking strategy would be to learn how to identify the easiest correct answers first (i.e., assign a high ranking score to answers that have characteristics that usually indicate relevance, and a low ranking score to those with characteristics that do not), before incorporating more complex logic to handle difficult cases (e.g., semantic matching or reasoning). In this work, we apply this idea to the training of neural answer rankers using curriculum learning. We propose several heuristics to estimate the difficulty of a given training sample. We show that the proposed heuristics can be used to build a training curriculum that down-weights difficult samples early in the training process. As the training process progresses, our approach gradually shifts to weighting all samples equally, regardless of difficulty. We present a comprehensive evaluation of our proposed idea on three answer ranking datasets. Results show that our approach leads to superior performance of two leading neural ranking architectures, namely BERT and ConvKNRM, using both pointwise and pairwise losses. When applied to a BERT-based ranker, our method yields up to a 4% improvement in MRR and a 9% improvement in P@1 (compared to the model trained without a curriculum). This results in models that can achieve comparable performance to more expensive state-of-the-art techniques.

下载PDF全文

下载文献需遵守相关版权规定

论文标题