论文标题
恢复助听器用户深入学习的语音清晰度
Restoring speech intelligibility for hearing aid users with deep learning
论文作者
论文摘要
全球范围内将近十亿人遭受破坏听力损失的困扰。尽管助听器可以部分弥补这一点,但很大一部分用户很难在背景噪音的情况下了解语音。在这里,我们提出了一种基于学习的算法,该算法在维持语音信号的同时有选择地抑制噪声。该算法可恢复助听器用户对正常听力控制受试者的水平的语音清晰度。它由一个深厚的网络组成,该网络在嘈杂的语音信号的大型自定义数据库中进行了培训,并通过新颖的基于深度学习的指标来通过神经体系结构搜索进行了进一步优化。该网络在一系列人类毕业的评估中实现了最先进的定位,跨不同的噪声类别概括,并且与经典的波束形成方法相反,可以在单个麦克风上运行。该系统可以在笔记本电脑上实时运行,这表明可以在几年内实现大规模的助听器筹码部署。因此,基于深度学习的DeNoising具有改善数百万听力障碍者的生活质量的潜力。
Almost half a billion people world-wide suffer from disabling hearing loss. While hearing aids can partially compensate for this, a large proportion of users struggle to understand speech in situations with background noise. Here, we present a deep learning-based algorithm that selectively suppresses noise while maintaining speech signals. The algorithm restores speech intelligibility for hearing aid users to the level of control subjects with normal hearing. It consists of a deep network that is trained on a large custom database of noisy speech signals and is further optimized by a neural architecture search, using a novel deep learning-based metric for speech intelligibility. The network achieves state-of-the-art denoising on a range of human-graded assessments, generalizes across different noise categories and - in contrast to classic beamforming approaches - operates on a single microphone. The system runs in real time on a laptop, suggesting that large-scale deployment on hearing aid chips could be achieved within a few years. Deep learning-based denoising therefore holds the potential to improve the quality of life of millions of hearing impaired people soon.