以医疗服装检测为例的合成数据集生成方法的比较

论文标题

以医疗服装检测为例的合成数据集生成方法的比较

Comparison of synthetic dataset generation methods for medical intervention rooms using medical clothing detection as an example

论文作者

Schülein, Patrick, Teufel, Hannah, Vorpahl, Ronja, Emter, Indira, Bukschat, Yannick, Pfister, Marcus, Siebert, Anke, Rathmann, Nils, Diehl, Steffen, Vetter, Marcus

论文摘要

从具有高隐私要求的领域（例如医疗干预空间）获得的真实数据较低，并且收购在法律上很复杂。因此，这项工作提供了一种以医疗服装为例为医疗环境创建合成数据集的方法。目的是缩小合成数据和真实数据之间的现实差距。为此，使用虚幻的引擎插件或Unity比较了3D扫描服装和设计服装的方法。此外，还使用了绿屏和目标域数据集的混合现实数据集。我们的实验表明，设计服装的结构性域随机化以及混合现实数据提供了基线，可在临床目标域的测试数据集上实现72.0％的地图。当使用15％可用的目标域火车数据的15％时，针对100％（660张图像）目标域列车数据的差距几乎可以关闭80.05％的地图（81.95％地图）。最后，我们表明，当使用100％目标域训练数据时，精度可以提高到83.35％的地图。

The availability of real data from areas with high privacy requirements, such as the medical intervention space, is low and the acquisition legally complex. Therefore, this work presents a way to create a synthetic dataset for the medical context, using medical clothing as an example. The goal is to close the reality gap between the synthetic and real data. For this purpose, methods of 3D-scanned clothing and designed clothing are compared in a Domain-Randomization and Structured-Domain-Randomization scenario using an Unreal-Engine plugin or Unity. Additionally a Mixed-Reality dataset in front of a greenscreen and a target domain dataset were used. Our experiments show, that Structured-Domain-Randomization of designed clothing together with Mixed-Reality data provide a baseline achieving 72.0% mAP on a test dataset of the clinical target domain. When additionally using 15% of available target domain train data, the gap towards 100% (660 images) target domain train data could be nearly closed 80.05% mAP (81.95% mAP). Finally we show that when additionally using 100% target domain train data the accuracy could be increased to 83.35% mAP.

下载PDF全文

下载文献需遵守相关版权规定

论文标题