论文标题

带有Bregman Diverence的阶段检索和对音频信号恢复的应用

Phase retrieval with Bregman divergences and application to audio signal recovery

论文作者

Vial, Pierre-Hugo, Magron, Paul, Oberlin, Thomas, Févotte, Cédric

论文摘要

相位检索(PR)旨在从一组内部产品的大小中恢复信号。此问题出现在许多音频信号处理应用程序中,这些应用程序以短时傅立叶变换幅度或功率谱图运行,并丢弃相位信息。从所得的修改频谱图中恢复缺失相是为了综合时间域信号而进行的。普遍考虑涉及二次损失函数的最小化问题来解决PR。在本文中,我们采用了不同的角度。实际上,二次损失无法正确解释音频的某些知觉属性,并且在许多情况下都首选诸如β-差异之类的替代差异措施。因此,我们将PR作为一个涉及Bregman Diverence的新最小化问题。由于这些差异相对于它们的两个输入参数并不对称,因此它们导致了两个不同的问题。为了优化所得目标,我们基于加速梯度下降和交替的乘数方法得出两种算法。从噪声观测值精确或估算的频谱图中进行的音频信号恢复的实验突出了我们提出的音频恢复方法的潜力。特别是,在非常嘈杂的条件下从频谱图执行PR时,利用这些Bregman差异的某些差异会引起比二次损失更好的性能。

Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing phase from the resulting modified spectrogram is indeed necessary in order to synthesize time-domain signals. PR is commonly addressed by considering a minimization problem involving a quadratic loss function. In this paper, we adopt a different standpoint. Indeed, the quadratic loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. Therefore, we formulate PR as a new minimization problem involving Bregman divergences. Since these divergences are not symmetric with respect to their two input arguments in general, they lead to two different formulations of the problem. To optimize the resulting objective, we derive two algorithms based on accelerated gradient descent and alternating direction method of multipliers. Experiments conducted on audio signal recovery from spectrograms that are either exact or estimated from noisy observations highlight the potential of our proposed methods for audio restoration. In particular, leveraging some of these Bregman divergences induce better performance than the quadratic loss when performing PR from spectrograms under very noisy conditions.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源