论文标题
优化收入,同时显示相关的分类
Optimizing Revenue while showing Relevant Assortments at Scale
论文作者
论文摘要
由于需要个性化和各种项目的可用性,因此可扩展的实时分类优化在电子商务运营中变得至关重要。虽然在有简单的分类选择时可以做到这一点,但是在基于商店经理和历史上表现良好的分类的洞察力中对相关分类施加约束时,优化过程变得困难。我们根据二进制搜索的变化来设计快速,灵活的算法,这些变化在此困难方面找到了(大约)最佳分类。特别是,我们重新审视了多项式logit选择模型下的大规模分类优化的问题,而没有对可行分类结构的任何假设。我们使用信息检索/机器学习领域的相似性搜索中的进步加快了比较步骤。对于任意分类的收集,我们的算法可以在分类数量中以下的及时找到解决方案,并且对于更简单的基数约束情况 - 项目数量的线性(现有方法是二次或更差)。使用现实世界数据集的经验验证(除了基于数十亿个价格数据集和几个零售交易数据集的半合成数据的实验外,我们的算法也具有竞争力,即使该项目数量为$ \ sim 10^5 $($ 10^5 $($ 10 \ tim $ tims Bat bat Bat Bat The Batter Bat y Timess $))。
Scalable real-time assortment optimization has become essential in e-commerce operations due to the need for personalization and the availability of a large variety of items. While this can be done when there are simplistic assortment choices to be made, the optimization process becomes difficult when imposing constraints on the collection of relevant assortments based on insights by store-managers and historically well-performing assortments. We design fast and flexible algorithms based on variations of binary search that find the (approximately) optimal assortment in this difficult regime. In particular, we revisit the problem of large-scale assortment optimization under the multinomial logit choice model without any assumptions on the structure of the feasible assortments. We speed up the comparison steps using advances in similarity search in the field of information retrieval/machine learning. For an arbitrary collection of assortments, our algorithms can find a solution in time that is sub-linear in the number of assortments, and for the simpler case of cardinality constraints - linear in the number of items (existing methods are quadratic or worse). Empirical validations using a real world dataset (in addition to experiments using semi-synthetic data based on the Billion Prices dataset and several retail transaction datasets) show that our algorithms are competitive even when the number of items is $\sim 10^5$ ($10\times$ larger instances than previously studied).