论文标题

基于深度功能的评估指标的参数灵敏度

Parameter Sensitivity of Deep-Feature based Evaluation Metrics for Audio Textures

论文作者

Gupta, Chitralekha, Wei, Yize, Gong, Zequn, Kamath, Purnima, Li, Zhuoyao, Wyse, Lonce

论文摘要

标准评估指标(例如Inception评分和Fréchet音频距离)提供了合成音频和参考清洁音频之间的一般音频质量距离指标。但是,这些指标对定义音频纹理的统计参数变化的敏感性尚未得到很好的研究。在这项工作中,我们对某些现有音频质量评估指标对音频纹理的参数变化的敏感性进行了系统的研究。此外,我们还研究了用于音频纹理合成的三个潜在的潜在参数敏感的指标,(a)基于克矩阵的距离,(b)使用革兰氏矩阵的汇总版本累积的革兰氏指标,以及(c)基于共管模型模型的统计统计特征指标。这些指标使用的深度功能总结了任何给定的音频纹理的统计信息,因此对定义音频纹理的统计参数的变化固有敏感。我们研究和评估现有标准指标的敏感性以及革兰氏矩阵和基于人工耳蜗模型对控制参数的质地变化,并通过主观评估进行验证。我们发现每个指标都对不同的纹理参数类型敏感。这是研究评估音频纹理参数灵敏度的客观指标的第一步。

Standard evaluation metrics such as the Inception score and Fréchet Audio Distance provide a general audio quality distance metric between the synthesized audio and reference clean audio. However, the sensitivity of these metrics to variations in the statistical parameters that define an audio texture is not well studied. In this work, we provide a systematic study of the sensitivity of some of the existing audio quality evaluation metrics to parameter variations in audio textures. Furthermore, we also study three more potentially parameter-sensitive metrics for audio texture synthesis, (a) a Gram matrix based distance, (b) an Accumulated Gram metric using a summarized version of the Gram matrices, and (c) a cochlear-model based statistical features metric. These metrics use deep features that summarize the statistics of any given audio texture, thus being inherently sensitive to variations in the statistical parameters that define an audio texture. We study and evaluate the sensitivity of existing standard metrics as well as Gram matrix and cochlear-model based metrics to control-parameter variations in audio textures across a wide range of texture and parameter types, and validate with subjective evaluation. We find that each of the metrics is sensitive to different sets of texture-parameter types. This is the first step towards investigating objective metrics for assessing parameter sensitivity in audio textures.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源