通过拓扑调整深图学习结构化的地标检测

论文标题

通过拓扑调整深图学习结构化的地标检测

Structured Landmark Detection via Topology-Adapting Deep Graph Learning

论文作者

Li, Weijian, Lu, Yuhang, Zheng, Kang, Liao, Haofu, Lin, Chihung, Luo, Jiebo, Cheng, Chi-Tung, Xiao, Jing, Lu, Le, Kuo, Chang-Fu, Miao, Shun

论文摘要

图像地标检测旨在自动识别预定义的基准点的位置。尽管该领域最近取得了成功，但尚未充分利用较高阶层的结构建模，以捕获解剖地标之间的隐式或显式关系。在这项工作中，我们提出了一种新的拓扑调整深度图学习方法，以进行准确的解剖面部和医学（例如，手，骨盆）的地标检测。所提出的方法构造图形信号信号利用本地图像特征和全局形状特征。自适应图形拓扑自然会探索和降落在特定于任务的结构上，这些结构是通过两个图形卷积网络（GCN）端到端学习的。在三个公共面部图像数据集（WFLW，300W和COFW-68）以及三个现实世界X射线医疗数据集（头孢符（CephalMetric（Public），手），手和骨盆）上进行了广泛的实验。定量结果与所有研究的数据集中的先前最新方法相比，表明鲁棒性和准确性的表现出色。学识渊博的图形拓扑的定性可视化表明，地标在物理上有可能的连通性。

Image landmark detection aims to automatically identify the locations of predefined fiducial points. Despite recent success in this field, higher-ordered structural modeling to capture implicit or explicit relationships among anatomical landmarks has not been adequately exploited. In this work, we present a new topology-adapting deep graph learning approach for accurate anatomical facial and medical (e.g., hand, pelvis) landmark detection. The proposed method constructs graph signals leveraging both local image features and global shape features. The adaptive graph topology naturally explores and lands on task-specific structures which are learned end-to-end with two Graph Convolutional Networks (GCNs). Extensive experiments are conducted on three public facial image datasets (WFLW, 300W, and COFW-68) as well as three real-world X-ray medical datasets (Cephalometric (public), Hand and Pelvis). Quantitative results comparing with the previous state-of-the-art approaches across all studied datasets indicating the superior performance in both robustness and accuracy. Qualitative visualizations of the learned graph topologies demonstrate a physically plausible connectivity laying behind the landmarks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题