详情页 - 河北大学附属医院知识库

当前位置：首页 > 详情页

SGBTransNet: Bridging the semantic gap in medical image segmentation models using Transformers

文献详情

资源类型：

WOS体系：

收录情况： ◇ SCIE

作者：

机构： [1]Hebei Univ, Coll Math & Informat Sci, Baoding 071000, Hebei, Peoples R China [2]Hebei Univ, Hebei Key Lab Machine Learning & Computat Intellig, Baoding 071000, Hebei, Peoples R China [3]Hebei Univ, Affiliated Hosp, Baoding 071000, Hebei, Peoples R China [4]Hebei Univ, Coll Cyber Secur & Comp, Baoding 071000, Hebei, Peoples R China

出处：

DOI：

ISSN：

关键词： Semantic gap U-shaped model Attention mechanism Transformer Medical image segmentation

摘要：

Most medical image segmentation models adopt a U-shaped encoder-decoder structure with skip-connections in between. However, they suffer from two issues that degrade their performance. First, due to the large difference between numbers of convolutions, there are semantic inconsistency and spatial misalignment (collectively referred to as semantic gap) between shallow feature maps in the encoder and deep feature maps in the decoder. When simply concatenating them by skip-connections, some noisy shallow features are introduced into the result feature maps, impairing the feature discriminability and resulting in misclassifications. Second, the locality of convolutions limits models to explicitly capture global dependencies. For the first issue, we propose a novel S emantic C onsistency E nhancement M odule (SCEM) which consists of two sub-modules: S hallow F eature R efinement Trans former (SFRTrans) and D ual-Path P ath C ross C hannel-Attention A ttention (DPCCA). SFRTrans refines shallow feature maps with the high-level semantic guidance offered by deep feature maps in a global modeling manner, towards selectively providing shallow features to the decoder instead of simple concatenation. DPCCA performs channel-attention synergistically on SFRTrans output and deep feature maps to further alleviate the semantic gap from the channel perspective. For the second issue, we incorporate Self-Attention Transformer after a U-Net encoder to enable global context modeling, with the U-Net encoder learning local features and setting priors for the model. With these modules, we construct S emantic-Gap G ap-Bridging B ridging Trans former U- Net (SGBTransNet). We conduct extensive experiments on five datasets of four modalities. Experimental results show that SGBTransNet achieves better or comparable performance than state-of-the-art methods.

基金：

语种：

WOS：

中科院(CAS)分区：

出版当年[2025]版：

无

最新[2025]版：

大类 | 2 区医学

小类 | 3 区工程：生物医学

JCR分区：

出版当年[2024]版：

无

最新[2023]版：

Q1 ENGINEERING, BIOMEDICAL

影响因子： 4.9 最新[2023版] 4.9 最新五年平均 0 出版当年[2024版] 0 出版当年五年平均 4.9 出版前一年[2023版]

第一作者：

第一作者机构： [1]Hebei Univ, Coll Math & Informat Sci, Baoding 071000, Hebei, Peoples R China

通讯作者：

通讯机构： [1]Hebei Univ, Coll Math & Informat Sci, Baoding 071000, Hebei, Peoples R China [2]Hebei Univ, Hebei Key Lab Machine Learning & Computat Intellig, Baoding 071000, Hebei, Peoples R China

推荐引用方式(GB/T 7714)：

APA：

MLA：

相关文献

[1]A deep model towards accurate boundary location and strong generalization for medical image segmentation [2]LEACS: a learnable and efficient active contour model with space-frequency pooling for medical image segmentation [3]Dual-decoder data decoupling training for semi-supervised medical image segmentation [4]LGI Net: Enhancing local-global information interaction for medical image segmentation [5]基于小波分解和markov场的ct/mri医学影像分割 [6]基于稳健特征统计的医学影像分割算法 [7]Hyperspectral-attention mechanism-based improvement of radiomics prediction method for primary liver cancer [8]ISANET: Non-small cell lung cancer classification and detection based on CNN and attention mechanism [9]DE-UNeXt: Dual encoder UNeXt for intracranial hemorrhage segmentation on a novel HBU CH dataset [10]Brain tumor magnetic resonance image segmentation by a multiscale contextual attention module combined with a deep residual UNet (MCA-ResUNet)