论文标题
从污染程序中提取清洁性能模型
Extracting Clean Performance Models from Tainted Programs
论文作者
论文摘要
性能模型是了解并行应用的缩放行为的知名工具。他们表达了绩效如何变化为关键执行参数,例如过程数量或输入问题的大小各不相同。除了推理程序行为外,此类模型也可以自动从性能数据中得出。这称为经验性能建模。乍一看,这听起来很简单,但这种方法面临着几个严重的相互关联的挑战,包括昂贵的性能测量,嘈杂的基准数据造成的不准确性以及整体复杂的实验设计,从选择正确的参数开始。考虑的参数越多,需要的实验就越多,噪声的影响越强。在本文中,我们展示了一种从计算机安全范围内借来的污点分析可以大大改善建模过程,降低其成本,提高模型质量,并有助于验证性能模型和实验设置。
Performance models are well-known instruments to understand the scaling behavior of parallel applications. They express how performance changes as key execution parameters, such as the number of processes or the size of the input problem, vary. Besides reasoning about program behavior, such models can also be automatically derived from performance data. This is called empirical performance modeling. While this sounds simple at the first glance, this approach faces several serious interrelated challenges, including expensive performance measurements, inaccuracies inflicted by noisy benchmark data, and overall complex experiment design, starting with the selection of the right parameters. The more parameters one considers, the more experiments are needed and the stronger the impact of noise. In this paper, we show how taint analysis, a technique borrowed from the domain of computer security, can substantially improve the modeling process, lowering its cost, improving model quality, and help validate performance models and experimental setups.