论文标题

使用代数不变的系统发育网络中4节点杂交周期的超快学习

Ultrafast learning of 4-node hybridization cycles in phylogenetic networks using algebraic invariants

论文作者

Wu, Zhaoxing, Solis-Lemus, Claudia

论文摘要

动机:生命树中的基因流量的丰度挑战了可以通过完全分叉的过程来表示进化的观念,因为该过程无法捕获重要的生物学现实,例如杂交,渗入或水平基因转移。基于融资的网络方法越来越流行,但对于大数据而言不可扩展,因为它们需要在网络空间中执行启发式搜索以及可以是NP-HARD的数值优化。 结果:在这里,我们引入了一种新的方法,以基于代数不变性重建系统发育网络。虽然在系统发育中使用代数不变的有悠久的传统,但我们的工作是第一个对一致性因素(在输入基因树中4-Taxon分裂的频率)定义的系统发育不变的人,以识别1级的1级系统发育网络。我们的新型推理方法是无优化的,因为它仅需要评估多项式方程,因此,它绕过网络空间的遍历,从而比最快的网络方法快10倍。我们在各种模拟场景以及估计Canis属的系统发育网络上说明了新方法的准确性和速度。 可用性和实施​​:我们在https://github.com/solislemuslab/phylodiamond.jl上可用的开源公共可用的Julia Package theyodiamond.jl实施新的理论。 联系人:[email protected]

Motivation: The abundance of gene flow in the Tree of Life challenges the notion that evolution can be represented with a fully bifurcating process, as this process cannot capture important biological realities like hybridization, introgression, or horizontal gene transfer. Coalescent-based network methods are increasingly popular, yet not scalable for big data, because they need to perform a heuristic search in the space of networks as well as numerical optimization that can be NP-hard. Results: Here, we introduce a novel method to reconstruct phylogenetic networks based on algebraic invariants. While there is a long tradition of using algebraic invariants in phylogenetics, our work is the first to define phylogenetic invariants on concordance factors (frequencies of 4-taxon splits in the input gene trees) to identify level-1 phylogenetic networks under the multispecies coalescent model. Our novel inference methodology is optimization-free as it only requires the evaluation of polynomial equations, and as such, it bypasses the traversal of network space, yielding a computational speed at least 10 times faster than the fastest-to-date network methods. We illustrate the accuracy and speed of our new method on a variety of simulated scenarios as well as in the estimation of a phylogenetic network for the genus Canis. Availability and Implementation: We implement our novel theory on an open-source publicly available Julia package PhyloDiamond.jl available at https://github.com/solislemuslab/PhyloDiamond.jl with broad applicability within the evolutionary biology community. Contact: [email protected]

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源