Most medical image segmentation models adopt a U-shaped encoder-decoder structure with skip-connections in between. However, they suffer from two issues that degrade their performance. First, due to the large difference between numbers of convolutions, there are semantic inconsistency and spatial misalignment (collectively referred to as semantic gap) between shallow feature maps in the encoder and deep feature maps in the decoder. When simply concatenating them by skip-connections, some noisy shallow features are introduced into the result feature maps, impairing the feature discriminability and resulting in misclassifications. Second, the locality of convolutions limits models to explicitly capture global dependencies. For the first issue, we propose a novel S emantic C onsistency E nhancement M odule (SCEM) which consists of two sub-modules: S hallow F eature R efinement Trans former (SFRTrans) and D ual-Path P ath C ross C hannel-Attention A ttention (DPCCA). SFRTrans refines shallow feature maps with the high-level semantic guidance offered by deep feature maps in a global modeling manner, towards selectively providing shallow features to the decoder instead of simple concatenation. DPCCA performs channel-attention synergistically on SFRTrans output and deep feature maps to further alleviate the semantic gap from the channel perspective. For the second issue, we incorporate Self-Attention Transformer after a U-Net encoder to enable global context modeling, with the U-Net encoder learning local features and setting priors for the model. With these modules, we construct S emantic-Gap G ap-Bridging B ridging Trans former U- Net (SGBTransNet). We conduct extensive experiments on five datasets of four modalities. Experimental results show that SGBTransNet achieves better or comparable performance than state-of-the-art methods.
基金:
National Natural Science Foundation of China [61375075]; Natural Sci-ence Foundation of Hebei Province of China [F2019201329]; Special Project of Science and Technology Department of Hebei Province [2021H060306]; Innovation Capacity Enhancement Program Science and Technology Platform Project, Hebei Province [22567623H]; Natural Science Foundation Youth Sci-ence Fund Project of Hebei Province [F2024502006]; Fundamental Research Funds for the Central Universities [2024MS138]
第一作者机构:[1]Hebei Univ, Coll Math & Informat Sci, Baoding 071000, Hebei, Peoples R China
通讯作者:
通讯机构:[1]Hebei Univ, Coll Math & Informat Sci, Baoding 071000, Hebei, Peoples R China[2]Hebei Univ, Hebei Key Lab Machine Learning & Computat Intellig, Baoding 071000, Hebei, Peoples R China
推荐引用方式(GB/T 7714):
Zhou Yunlai,Wang Bing,Yang Jie,et al.SGBTransNet: Bridging the semantic gap in medical image segmentation models using Transformers[J].BIOMEDICAL SIGNAL PROCESSING AND CONTROL.2024,98:doi:10.1016/j.bspc.2024.106746.
APA:
Zhou, Yunlai,Wang, Bing,Yang, Jie,Yang, Ying&Tian, Xuedong.(2024).SGBTransNet: Bridging the semantic gap in medical image segmentation models using Transformers.BIOMEDICAL SIGNAL PROCESSING AND CONTROL,98,
MLA:
Zhou, Yunlai,et al."SGBTransNet: Bridging the semantic gap in medical image segmentation models using Transformers".BIOMEDICAL SIGNAL PROCESSING AND CONTROL 98.(2024)