论文标题
使用机器视觉模型自动检测野生动植物贸易
Towards automatic detection of wildlife trade using machine vision models
论文作者
论文摘要
野生动植物的不可持续贸易是影响全球生物多样性危机的主要威胁之一。现在,交易的重要部分是在互联网上,尤其是在数字市场和社交媒体上。由于保护资源有限,因此需要自动化的方法来识别贸易职位。在这里,我们开发了基于深层神经网络的机器视觉模型,目的是自动识别出易于销售的外来宠物动物的图像。为此,为此目的生成了一个新的代表在网络上出售的外来宠物动物的培训数据集。我们培训了24种神经网络模型,这些模型涵盖了五种不同的架构,三种培训方法和两种类型的数据集的组合。具体而言,设置了一部分训练图像以表示负面特征后,模型概括改进了。对模型的内部和未分布数据进行了评估,以测试更广泛的模型适用性。在分布评估内,最高的表现模型在分布数据集中的F-评分达到了0.95以上的F-评分超过0.75至0.87。值得注意的是,特征可视化表明,模型在检测动物所在的周围环境(例如笼子)方面表现良好,因此有助于自动检测非天然环境中动物的图像。拟议的方法可以帮助研究在线野生动植物贸易,但也可以适应数字平台的其他类型的人互动。未来的研究可以使用这些发现来为更多的分类学组构建强大的机器学习模型和新的数据收集管道。
Unsustainable trade in wildlife is one of the major threats affecting the global biodiversity crisis. An important part of the trade now occurs on the internet, especially on digital marketplaces and social media. Automated methods to identify trade posts are needed as resources for conservation are limited. Here, we developed machine vision models based on Deep Neural Networks with the aim to automatically identify images of exotic pet animals for sale. A new training dataset representing exotic pet animals advertised for sale on the web was generated for this purpose. We trained 24 neural-net models spanning a combination of five different architectures, three methods of training and two types of datasets. Specifically, model generalisation improved after setting a portion of the training images to represent negative features. Models were evaluated on both within and out of distribution data to test wider model applicability. The top performing models achieved an f-score of over 0.95 on within distribution evaluation and between 0.75 to 0.87 on the two out of distribution datasets. Notably, feature visualisation indicated that models performed well in detecting the surrounding context (e.g. a cage) in which an animal was located, therefore helping to automatically detect images of animals in non-natural environments. The proposed methods can help investigate the online wildlife trade, but can also be adapted to study other types of people-nature interactions from digital platforms. Future studies can use these findings to build robust machine learning models and new data collection pipelines for more taxonomic groups.