XBougn-Former：朝向变压器中的跨尺度边界建模

论文标题

XBougn-Former：朝向变压器中的跨尺度边界建模

XBound-Former: Toward Cross-scale Boundary Modeling in Transformers

论文作者

Wang, Jiacheng, Chen, Fei, Ma, Yuxi, Wang, Liansheng, Fei, Zhaodong, Shuai, Jianwei, Tang, Xiangdong, Zhou, Qichao, Qin, Jing

论文摘要

皮肤病变从皮肤镜检查图像分割在皮肤癌的定量分析中具有重要意义，这对于皮肤科医生来说，由于固有的问题，即相当大的大小，形状和颜色变化以及含糊不清的边界，这对于皮肤科医生来说也很具挑战性。最近的视觉变压器在通过全球上下文建模来处理变化方面表现出了有希望的性能。尽管如此，他们尚未彻底解决歧义界限的问题，因为他们忽略了边界知识和全球环境的补充用法。在本文中，我们提出了一种新型的跨尺度边界感知变压器，\ textbf {xbound-Former}，以同时解决皮肤病变分割的变化和边界问题。 Xbougn-Former是一个纯粹基于注意力的网络，并通过三个专门设计的学习者获取边界知识。我们在两个皮肤病变数据集（ISIC-2016 \＆ph $^2 $和ISIC-2018）上评估了该模型，我们的模型始终优于其他基于卷积和变压器的模型，尤其是在边界指标上。我们广泛验证具有相似特征的息肉病变分割的概括能力，与最新模型相比，我们的模型也可以产生显着改善。

Skin lesion segmentation from dermoscopy images is of great significance in the quantitative analysis of skin cancers, which is yet challenging even for dermatologists due to the inherent issues, i.e., considerable size, shape and color variation, and ambiguous boundaries. Recent vision transformers have shown promising performance in handling the variation through global context modeling. Still, they have not thoroughly solved the problem of ambiguous boundaries as they ignore the complementary usage of the boundary knowledge and global contexts. In this paper, we propose a novel cross-scale boundary-aware transformer, \textbf{XBound-Former}, to simultaneously address the variation and boundary problems of skin lesion segmentation. XBound-Former is a purely attention-based network and catches boundary knowledge via three specially designed learners. We evaluate the model on two skin lesion datasets, ISIC-2016\&PH$^2$ and ISIC-2018, where our model consistently outperforms other convolution- and transformer-based models, especially on the boundary-wise metrics. We extensively verify the generalization ability of polyp lesion segmentation that has similar characteristics, and our model can also yield significant improvement compared to the latest models.

下载PDF全文

下载文献需遵守相关版权规定

论文标题