论文标题

DCN V2:改进了深层网络和网络规模学习的实用课程,以对系统进行排名

DCN V2: Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems

论文作者

Wang, Ruoxi, Shivanna, Rakesh, Cheng, Derek Z., Jain, Sagar, Lin, Dong, Hong, Lichan, Chi, Ed H.

论文摘要

学习有效的功能杂交是构建推荐系统的关键。但是,稀疏和较大的特征空间需要详尽的搜索才能识别有效的十字架。提出了Deep&Cross Network(DCN),以自动有效地学习有限的预测特征相互作用。不幸的是,在提供数十亿个培训示例的网络尺度流量的模型中,DCN在学习更多预测性功能相互作用时在其跨网络中表现不佳。尽管取得了重大的研究进展,但许多深度学习模型仍然依赖传统的喂养前馈神经网络来学习特征杂交的效率低下。 鉴于DCN和现有功能交互学习方法的利弊,我们提出了改进的框架DCN-V2,以使DCN在大规模的工业环境中更加实用。在一项针对广泛的超参数搜索和模型调整的全面实验研究中,我们观察到DCN-V2在流行的基准数据集上的所有最新算法都超过了所有最新算法。改进的DCN-V2具有更大的表现力,但在功能交互学习中仍然具有成本效益,尤其是当与低级体系结构的混合物结合在一起时。 DCN-V2很简单,很容易被用作构件,并且在许多网络规模的学习中提供了显着的离线准确性和在线业务指标的增长,以在Google上对系统进行排名。

Learning effective feature crosses is the key behind building recommender systems. However, the sparse and large feature space requires exhaustive search to identify effective crosses. Deep & Cross Network (DCN) was proposed to automatically and efficiently learn bounded-degree predictive feature interactions. Unfortunately, in models that serve web-scale traffic with billions of training examples, DCN showed limited expressiveness in its cross network at learning more predictive feature interactions. Despite significant research progress made, many deep learning models in production still rely on traditional feed-forward neural networks to learn feature crosses inefficiently. In light of the pros/cons of DCN and existing feature interaction learning approaches, we propose an improved framework DCN-V2 to make DCN more practical in large-scale industrial settings. In a comprehensive experimental study with extensive hyper-parameter search and model tuning, we observed that DCN-V2 approaches outperform all the state-of-the-art algorithms on popular benchmark datasets. The improved DCN-V2 is more expressive yet remains cost efficient at feature interaction learning, especially when coupled with a mixture of low-rank architecture. DCN-V2 is simple, can be easily adopted as building blocks, and has delivered significant offline accuracy and online business metrics gains across many web-scale learning to rank systems at Google.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源