有效计算跨越树分布下的期望

论文标题

有效计算跨越树分布下的期望

Efficient Computation of Expectations under Spanning Tree Distributions

论文作者

Zmigrod, Ran, Vieira, Tim, Cotterell, Ryan

论文摘要

我们为推断跨越树模型提供了一个一般框架。我们提出了针对一阶期望的重要情况和二阶期望的统一算法。我们的算法利用了梯度和期望之间的基本联系，这使我们能够得出有效的算法。这些算法易于使用或不使用自动分化软件实现。我们通过以前的研究的几个\ emph {警告性故事}激发了我们的框架的发展，该研究已经开发了许多用于计算期望及其梯度的效率低下的算法。我们演示了我们的框架如何有效地计算出已知算法的几个数量，包括预期的固定评分，熵和广义期望标准。作为奖励，我们为文献中缺少的数量（包括KL差异）提供了算法。在所有情况下，我们的方法都与现有算法的效率相匹配，在某些情况下，我们的方法将运行时的复杂性降低了句子长度的一倍。我们通过运行时实验验证了框架的实现。我们发现，我们的算法分别比以前的算法要快15和9倍，用于计算香农熵和广义期望目标的梯度。

We give a general framework for inference in spanning tree models. We propose unified algorithms for the important cases of first-order expectations and second-order expectations in edge-factored, non-projective spanning-tree models. Our algorithms exploit a fundamental connection between gradients and expectations, which allows us to derive efficient algorithms. These algorithms are easy to implement with or without automatic differentiation software. We motivate the development of our framework with several \emph{cautionary tales} of previous research, which has developed numerous inefficient algorithms for computing expectations and their gradients. We demonstrate how our framework efficiently computes several quantities with known algorithms, including the expected attachment score, entropy, and generalized expectation criteria. As a bonus, we give algorithms for quantities that are missing in the literature, including the KL divergence. In all cases, our approach matches the efficiency of existing algorithms and, in several cases, reduces the runtime complexity by a factor of the sentence length. We validate the implementation of our framework through runtime experiments. We find our algorithms are up to 15 and 9 times faster than previous algorithms for computing the Shannon entropy and the gradient of the generalized expectation objective, respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题