论文标题

提供有关受到失败和散漫者影响的查询的见解

Providing Insights for Queries affected by Failures and Stragglers

论文作者

Sundarmurthy, Bruhathi, Deshmukh, Harshad, Koutris, Paris, Naughton, Jeffrey

论文摘要

交互式时间响应对于分析大量数据的用户来说是至关重要的要求。此类分析查询通常在分布式设置中运行,数据分布在数千个节点上以获得高吞吐量。但是,提供实时分析仍然是一个很大的挑战。随着数据分布在数千个节点上,某些必需的节点在查询执行过程中无法使用或非常缓慢的概率很高,并且无法可用,可能会导致执行缓慢甚至失败。数据和用户的庞大幅度增加了资源争夺,这加剧了执行过程中Stragglers和Node失败的现象。在本文中,我们提出了一种新颖的解决方案,以减轻散乱/失败问题,以利用数据的现有有效分区属性,尤其是共敲打数据,并提供了近似答案以及对受到失败/straggler节点影响的查询的置信范围。我们考虑涉及加入,组BY,具有条款和嵌套子量子类的汇总查询。最后,我们通过在TPC-H数据集上进行的广泛实验来验证我们的方法。

Interactive time responses are a crucial requirement for users analyzing large amounts of data. Such analytical queries are typically run in a distributed setting, with data being sharded across thousands of nodes for high throughput. However, providing real-time analytics is still a very big challenge; with data distributed across thousands of nodes, the probability that some of the required nodes are unavailable or very slow during query execution is very high and unavailability may result in slow execution or even failures. The sheer magnitude of data and users increase resource contention and this exacerbates the phenomenon of stragglers and node failures during execution. In this paper, we propose a novel solution to alleviate the straggler/failure problem that exploits existing efficient partitioning properties of the data, particularly, co-hash partitioned data, and provides approximate answers along with confidence bounds to queries affected by failed/straggler nodes. We consider aggregate queries that involve joins, group bys, having clauses and a subclass of nested subqueries. Finally, we validate our approach through extensive experiments on the TPC-H dataset.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源