引入潜在的音色综合

论文标题

引入潜在的音色综合

Introducing Latent Timbre Synthesis

论文作者

Tatar, K., Bisig, D., Pasquier, P.

论文摘要

我们介绍了潜在的音色综合（LTS），这是一种使用深度学习的新音频合成方法。合成方法允许作曲家和声音设计师使用音频框架的潜在空间在多种声音的音色之间插图和推断。我们为LTS提供了两个变异自动编码器体系结构的详细信息，并比较它们的优势和缺点。该实现包括一个具有图形用户界面的全面工作应用程序，称为\ textit {interpaly \ _two}，该应用程序使从业者能够使用插值和推断在音频框架的潜在空间中探索他们选择的两个音频摘录之间的音色。我们的实施是开源的，我们旨在通过为具有任何技术背景的用户提供指南来提高该技术的可访问性。

We present the Latent Timbre Synthesis (LTS), a new audio synthesis method using Deep Learning. The synthesis method allows composers and sound designers to interpolate and extrapolate between the timbre of multiple sounds using the latent space of audio frames. We provide the details of two Variational Autoencoder architectures for LTS, and compare their advantages and drawbacks. The implementation includes a fully working application with graphical user interface, called \textit{interpolate\_two}, which enables practitioners to explore the timbre between two audio excerpts of their selection using interpolation and extrapolation in the latent space of audio frames. Our implementation is open-source, and we aim to improve the accessibility of this technology by providing a guide for users with any technical background.

下载PDF全文

下载文献需遵守相关版权规定

论文标题