稀疏的专注内存网络，用于单击速率预测长序列

论文标题

稀疏的专注内存网络，用于单击速率预测长序列

Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences

论文作者

Lin, Qianying, Zhou, Wen-Ji, Wang, Yanshi, Da, Qing, Chen, Qing-Guo, Wang, Bing

论文摘要

顺序推荐通过其历史互动来预测用户的下一个行为。推荐更长的序列可以提高建议精度并提高个性化程度。随着序列的延长，现有作品尚未解决以下两个主要挑战。首先，在序列长度增加时，很难对远程内部序列依赖性进行建模。其次，它需要有效的内存和计算速度。在本文中，我们提出了一个稀疏的细心内存（SAM）网络，以进行长期顺序的用户行为建模。 SAM支持对用户行为序列的有效培训和实时推断，其长度为数千。在SAM中，我们将目标项目建模为查询，而长序列为知识数据库，在该数据库中，前者不断从后者中引起相关信息。 SAM同时模拟了目标序列依赖性和远程内部依赖性，其复杂性和O（1）顺序更新数量，只能通过具有O（l^2）复杂性的自我注意机制来实现。广泛的经验结果表明，我们提出的解决方案不仅在长期用户行为建模中而且在短序列建模中也有效。 SAM按照长度为1000的序列实施，成功地部署在最大的国际电子商务平台之一上。此推论时间在30毫秒内，在线A/B测试的点击率提高了7.30％。据我们所知，这是第一个端到端的长时间用户序列建模框架，它以上述效率程度对序列和目标序列依赖性进行建模，并成功地部署在大型实时工业推荐系统上。

Sequential recommendation predicts users' next behaviors with their historical interactions. Recommending with longer sequences improves recommendation accuracy and increases the degree of personalization. As sequences get longer, existing works have not yet addressed the following two main challenges. Firstly, modeling long-range intra-sequence dependency is difficult with increasing sequence lengths. Secondly, it requires efficient memory and computational speeds. In this paper, we propose a Sparse Attentive Memory (SAM) network for long sequential user behavior modeling. SAM supports efficient training and real-time inference for user behavior sequences with lengths on the scale of thousands. In SAM, we model the target item as the query and the long sequence as the knowledge database, where the former continuously elicits relevant information from the latter. SAM simultaneously models target-sequence dependencies and long-range intra-sequence dependencies with O(L) complexity and O(1) number of sequential updates, which can only be achieved by the self-attention mechanism with O(L^2) complexity. Extensive empirical results demonstrate that our proposed solution is effective not only in long user behavior modeling but also on short sequences modeling. Implemented on sequences of length 1000, SAM is successfully deployed on one of the largest international E-commerce platforms. This inference time is within 30ms, with a substantial 7.30% click-through rate improvement for the online A/B test. To the best of our knowledge, it is the first end-to-end long user sequence modeling framework that models intra-sequence and target-sequence dependencies with the aforementioned degree of efficiency and successfully deployed on a large-scale real-time industrial recommender system.

下载PDF全文

下载文献需遵守相关版权规定

论文标题