论文标题

使用基于音素识别排名的话语验证来检测文本脚本和配音之间的不匹配

Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking

论文作者

Jeong, Yoonjae, Cho, Hoon-Young

论文摘要

这项研究的目的是检测文本脚本和配音之间的不匹配。为此,我们提出了一种新颖的话语验证(UV)方法,该方法计算了脚本的配音与音素序列之间的对应关系。我们发现,与普通话语相比,夸张的配音的音素识别概率降低了,但它们的排名并未显示出任何重大变化。因此,所提出的方法使用对应于音素序列对应的每个音素段的识别排名,以测量对其相应脚本的配音话语的置信度。实验结果表明,所提出的UV方法的表现优于一种最新方法,该方法使用用于检测语音和转录之间的不匹配的交叉模态注意。

The purpose of this study is to detect the mismatch between text script and voice-over. For this, we present a novel utterance verification (UV) method, which calculates the degree of correspondence between a voice-over and the phoneme sequence of a script. We found that the phoneme recognition probabilities of exaggerated voice-overs decrease compared to ordinary utterances, but their rankings do not demonstrate any significant change. The proposed method, therefore, uses the recognition ranking of each phoneme segment corresponding to a phoneme sequence for measuring the confidence of a voice-over utterance for its corresponding script. The experimental results show that the proposed UV method outperforms a state-of-the-art approach using cross modal attention used for detecting mismatch between speech and transcription.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源