轻巧的扬声器验证，用于在线识别具有短段的新扬声器的在线识别

论文标题

轻巧的扬声器验证，用于在线识别具有短段的新扬声器的在线识别

Lightweight Speaker Verification for Online Identification of New Speakers with Short Segments

论文作者

Velez, Ivette, Rascon, Caleb, Fuentes-Pineda, Gibran

论文摘要

验证最近提出了是否属于同一扬声器的两个音频段作为执行扬声器标识的灵活方式，因为当新扬声器出现在听觉场景中时，它不需要重新训练。尽管许多当前技术已经达到了很高的性能，但它们需要大量的内存，并且其输入音频段的特定最小长度。这些要求限制了这些技术在服务机器人，物联网和虚拟助理等方案中的适用性，在该方案中，计算资源有限，用户倾向于在短段中讲话。在这项工作中，我们提出了一个基于BLSTM的模型，该模型在使用短输入音频片段时达到与当前最新状态相当的性能水平，同时需要较少的内存量。此外，据我们所知，尚未使用此验证范式进行完整的说话者识别系统。因此，我们基于简单的投票系统提出了一个完整的在线说话者标识符，该标识符表明，与当前的最新技术相比，拟议的基于BLSTM的模型在网上识别说话者方面具有相似的性能。

Verifying if two audio segments belong to the same speaker has been recently put forward as a flexible way to carry out speaker identification, since it does not require to be re-trained when new speakers appear on the auditory scene. Although many of the current techniques have achieved high performances, they require a considerably high amount of memory, and a specific minimum length for their input audio segments. These requirements limit the applicability of these techniques in scenarios such as service robots, internet of things and virtual assistants, where computational resources are limited and the users tend to speak in short segments. In this work we propose a BLSTM-based model that reaches a level of performance comparable to the current state of the art when using short input audio segments, while requiring a considerably less amount of memory. Further, as far as we know, a complete speaker identification system has not been reported using this verification paradigm. Thus, we present a complete online speaker identifier, based on a simple voting system, that shows that the proposed BLSTM-based model achieves a similar performance at identifying speakers online compared to the current state of the art.

下载PDF全文

下载文献需遵守相关版权规定

论文标题