论文标题
YM2413-MDB:一种带有情感注释的多乐器FM视频游戏音乐数据集
YM2413-MDB: A Multi-Instrumental FM Video Game Music Dataset with Emotion Annotations
论文作者
论文摘要
现有的多音乐数据集往往会偏向流行音乐和古典音乐。此外,他们通常缺乏高级注释,例如情感标签。在本文中,我们提出了YM2413-MDB,这是具有多标签情感注释的80年代FM视频游戏音乐数据集。它包括80年代使用基于FM的可编程声音生成器YM2413的SEGA和MSX PC游戏中的669个音频和MIDI文件。收集的游戏音乐由15个单声乐器和一根鼓乐器组成。它们是从YM2413声音芯片的二进制命令中转换的。每首歌都用两个注释者标记为19个情感标签,并通过三个验证者验证以获取精制标签。我们为使用YM2413-MDB提供了情感识别和情感条件的符号音乐的基线模型和结果。
Existing multi-instrumental datasets tend to be biased toward pop and classical music. In addition, they generally lack high-level annotations such as emotion tags. In this paper, we propose YM2413-MDB, an 80s FM video game music dataset with multi-label emotion annotations. It includes 669 audio and MIDI files of music from Sega and MSX PC games in the 80s using YM2413, a programmable sound generator based on FM. The collected game music is arranged with a subset of 15 monophonic instruments and one drum instrument. They were converted from binary commands of the YM2413 sound chip. Each song was labeled with 19 emotion tags by two annotators and validated by three verifiers to obtain refined tags. We provide the baseline models and results for emotion recognition and emotion-conditioned symbolic music generation using YM2413-MDB.