论文标题
语言意识的连接路径查询索引
Language-aware Indexing for Conjunctive Path Queries
论文作者
论文摘要
连接路径查询(CPQ)是用于复杂图分析的最常用查询之一。但是,当前的图形索引并未量身定制以完全支持查询语言表达CPQ的功能。因此,当前的方法在\ cpq {}评估期间没有利用大量的修剪机会,从而导致查询处理性能不佳。我们提出了CPQ感知路径索引CPQX,这是针对CPQ表达的第一个路径索引。 CPQX建立在基于路径 - 仿真的结构概念中的源目标顶点对路径对的分区。路径 - 象征性是对路径的等价关系,使该关系引起的每个分区块由图形中的路径组成,相对于CPQ无法区分。图表的这种语言意识分区可以大大降低查询评估的成本。我们提出了支持完整索引生命周期的方法:用我们的索引进行索引构建,维护和查询处理。我们还开发了感兴趣的CPQX,以减少指数规模和指数构造开销,同时加速有关关注查询的查询评估。我们通过对14个真实图的广泛实验证明,我们的方法在最新的方法中以多个数量级加速了查询处理,索引尺寸较小。我们完整的C ++代码库可作为开源进行进一步研究。
Conjunctive path queries (CPQ) are one of the most frequently used queries for complex graph analysis. However, current graph indexes are not tailored to fully support the power of query languages to express CPQs. Consequently, current methods do not take advantage of significant pruning opportunities during \cpq{} evaluation, resulting in poor query processing performance. We propose the CPQ-aware path index CPQx, the first path index tailored to the expressivity of CPQ. CPQx is built on the partition of the set of source-target vertex pairs of paths in a graph based on the structural notion of path-bisimulation. Path-bisimulation is an equivalence relation on paths such that each partition block induced by the relation consists of paths in the graph indistinguishable with respect to CPQs. This language-aware partitioning of the graph can significantly reduce the cost of query evaluation. We present methods to support the full index life cycle: index construction, maintenance, and query processing with our index. We also develop interest-aware CPQx to reduce index size and index construction overhead while accelerating query evaluation for queries of interest. We demonstrate through extensive experiments on 14 real graphs that our methods accelerate query processing by up to multiple orders of magnitude over the state-of-the-art methods, with smaller index sizes. Our complete C++ codebase is available as open source for further research.