论文标题
自动昆虫识别的声音自适应表示
Adaptive Representations of Sound for Automatic Insect Recognition
论文作者
论文摘要
昆虫是我们生态系统不可或缺的一部分。这些经常小而回避的动物对周围环境有很大的影响,提供了当前的生物多样性和授粉职责的很大一部分,构成了食物链的基础以及许多生物学和生态过程。由于人类影响的因素,随着时间的流逝,人口数量和生物多样性迅速下降。监视这种下降对于有效实施的保护措施变得越来越重要。但是,监视方法通常是侵入性,时间和资源强度,并且容易出现各种偏见。许多昆虫物种会产生特征性的交配声音,可以轻松地检测和记录,而无需大量的成本或精力。使用深度学习方法,可以自动检测并分类从现场记录中的昆虫声音来监测生物多样性和物种分布范围。在这个项目中,我使用现有的昆虫声音(骨翅目和cicadidae)和机器学习方法的数据集实施了此功能,并评估了它们的声学昆虫监测潜力。我将基于常规频谱图的深度学习方法与新的自适应和基于波形的方法叶的性能进行了比较。通过在训练过程中调整其特征提取参数,基于波形的前端比MEL-Spectrogragron前端获得了明显更好的分类性能。该结果令人鼓舞,以实施深度学习技术以自动昆虫的声音识别,尤其是在较大数据集可用的情况下。
Insects are an integral part of our ecosystem. These often small and evasive animals have a big impact on their surroundings, providing a large part of the present biodiversity and pollination duties, forming the foundation of the food chain and many biological and ecological processes. Due to factors of human influence, population numbers and biodiversity have been rapidly declining with time. Monitoring this decline has become increasingly important for conservation measures to be effectively implemented. But monitoring methods are often invasive, time and resource intense, and prone to various biases. Many insect species produce characteristic mating sounds that can easily be detected and recorded without large cost or effort. Using deep learning methods, insect sounds from field recordings could be automatically detected and classified to monitor biodiversity and species distribution ranges. In this project, I implement this using existing datasets of insect sounds (Orthoptera and Cicadidae) and machine learning methods and evaluate their potential for acoustic insect monitoring. I compare the performance of the conventional spectrogram-based deep learning method against the new adaptive and waveform-based approach LEAF. The waveform-based frontend achieved significantly better classification performance than the Mel-spectrogram frontend by adapting its feature extraction parameters during training. This result is encouraging for future implementations of deep learning technology for automatic insect sound recognition, especially if larger datasets become available.