论文标题

自发讲话对话代理如何影响用户行为?

How does a spontaneously speaking conversational agent affect user behavior?

论文作者

Iizuka, Takahisa, Mori, Hiroki

论文摘要

这项研究调查了对会话剂的合成声音对人类互动者的训练对话剂的影响。具体来说,我们假设人类与对话代理相互作用时将表现出更多的社会反应,而对话代理具有综合声音以自发的语音为基础。通常,语音合成器建立在语音语料库上,语音专业人士阅读一组书面句子。综合演讲很清楚,好像新闻报道正在阅读新闻或配音演员在扮演动漫角色。但是,这与我们在日常对话中发表的自发演讲完全不同。语音合成的最新进展使我们能够在自发的语音语料库上建立语音合成器,并以合理的质量获得几乎对话的综合语音。通过利用这些技术,我们检查了人类是否对自发讲对话的代理人产生更多的社会反应。我们与一位对话剂进行了大规模的对话实验,该对话代理人的话语是通过自发言语训练的模型合成的,或者是读语音的。结果表明,与代理商互动的受试者是从自发性演讲中合成的话语往往显示较短的响应时间和更多的回音渠道。问卷的结果表明,与经纪人互动的受试者是从自发演讲中综合的话语,往往将他们与代理商的对话评价为更接近人类的对话。这些结果表明,基于自发言语的语音综合对于实现作为社会演员的会话代理至关重要。

This study investigated the effect of synthetic voice of conversational agent trained with spontaneous speech on human interactants. Specifically, we hypothesized that humans will exhibit more social responses when interacting with conversational agent that has a synthetic voice built on spontaneous speech. Typically, speech synthesizers are built on a speech corpus where voice professionals read a set of written sentences. The synthesized speech is clear as if a newscaster were reading a news or a voice actor were playing an anime character. However, this is quite different from spontaneous speech we speak in everyday conversation. Recent advances in speech synthesis enabled us to build a speech synthesizer on a spontaneous speech corpus, and to obtain a near conversational synthesized speech with reasonable quality. By making use of these technology, we examined whether humans produce more social responses to a spontaneously speaking conversational agent. We conducted a large-scale conversation experiment with a conversational agent whose utterances were synthesized with the model trained either with spontaneous speech or read speech. The result showed that the subjects who interacted with the agent whose utterances were synthesized from spontaneous speech tended to show shorter response time and a larger number of backchannels. The result of a questionnaire showed that subjects who interacted with the agent whose utterances were synthesized from spontaneous speech tended to rate their conversation with the agent as closer to a human conversation. These results suggest that speech synthesis built on spontaneous speech is essential to realize a conversational agent as a social actor.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源