[1]孟延宗,李小霞,周颖玥,等.基于上下文特征感知和双频上采样的食管早癌图像分割[J].中国医学物理学杂志,2023,40(8):957-963.[doi:DOI:10.3969/j.issn.1005-202X.2023.08.006]
 MENG Yanzong,LI Xiaoxia,ZHOU Yingyue,et al.Early esophageal cancer image segmentation based on contextual feature awareness and dual frequency upsampling[J].Chinese Journal of Medical Physics,2023,40(8):957-963.[doi:DOI:10.3969/j.issn.1005-202X.2023.08.006]
点击复制

基于上下文特征感知和双频上采样的食管早癌图像分割()
分享到:

《中国医学物理学杂志》[ISSN:1005-202X/CN:44-1351/R]

卷:
40卷
期数:
2023年第8期
页码:
957-963
栏目:
医学影像物理
出版日期:
2023-09-01

文章信息/Info

Title:
Early esophageal cancer image segmentation based on contextual feature awareness and dual frequency upsampling
文章编号:
1005-202X(2023)08-0957-07
作者:
孟延宗1李小霞12周颖玥12文黎明3秦佳敏3刘爽利12
1.西南科技大学信息与工程学院, 四川 绵阳 621000; 2.特殊环境机器人技术四川省重点实验室, 四川 绵阳 621000; 3.四川绵阳四0四医院消化内科, 四川 绵阳 621000
Author(s):
MENG Yanzong1 LI Xiaoxia1 2 ZHOU Yingyue1 2 WEN Liming3 QIN Jiamin3 LIU Shuangli1 2
1. School of Information Engineering, Southwest University of Science and Technology, Mianyang 621000, China 2. Robot Technology Used for Special Environment Key Laboratory of Sichuan Province,Mianyang 621000, China 3. Department of Gastroenterology, Sichuan Mianyang 404 Hospital, Mianyang 621000, China
关键词:
食管早癌上下文特征感知注意力机制空洞卷积双频上采样
Keywords:
Keywords: early esophageal cancer contextual feature awareness attention mechanism dilated convolution dual frequency upsampling
分类号:
R318;R735.1
DOI:
DOI:10.3969/j.issn.1005-202X.2023.08.006
文献标志码:
A
摘要:
目的:针对食管早癌图像分割过程中病灶边缘等细节信息丢失的问题,在U-net基础上提出一种基于上下文特征感知和双频上采样的食管早癌图像分割网络。方法:利用注意力机制和可分离空洞卷积改进上下文特征感知模块,获取全文上下文信息,提取更多特征细节。提出双频上采样模块,分别从高频和低频进行上采样,有效减少单一上采样因像素插值产生的锯齿效应和转置卷积造成的棋盘效应,减少细节信息的丢失。结果:本文方法的平均交并比、敏感度和特异性分别达到80.34%、87.47%和91.53%。结论:本文模型优于nnU-Net等主流语义分割模型,保留更多的细节信息,提高食管早癌图像分割精度。
Abstract:
Abstract: Objective To propose a network for early esophageal cancer image segmentation using U-net with contextual feature awareness module and dual frequency upsampling module which solves the problem of loss of detailed information such as lesion edges during image segmentation. Methods The contextual feature awareness module improved with the attention mechanism and separable dilated convolution was used to obtain full-text contextual information and extract more feature details. The dual frequency upsampling module was adopted for upsampling from high frequency and low frequency, thereby effectively reducing the aliasing effect caused by pixel interpolation, minimizing the checkerboard effect caused by transposed convolution during single upsampling, and avoiding the loss of detail information. Results The mean intersection over union, sensitivity and specificity of the proposed method reached 80.34%, 87.47%, and 91.53%, respectively. Conclusion The proposed model is superior to mainstream semantic segmentation models such as nnU-Net for it can retain more detailed information and improve the accuracy of early esophageal cancer image segmentation.

备注/Memo

备注/Memo:
【收稿日期】2023-02-11 【基金项目】国家自然科学基金(62071399);四川省科技计划重点研发项目(2021YFG0383, 2023YFG0262) 【作者简介】孟延宗,硕士研究生,研究方向:医学图像语义分割,E-mail: 953590977@qq.com 【通信作者】周颖玥,副研究员,博士,研究方向:图像处理与分析,E-mail: 147256027@qq.com
更新日期/Last Update: 2023-09-06