融合注意力機(jī)制的改進(jìn)型DeepLabv3十語義分割
Improved DeepLabv 3+ semantic segmentation incorporating attention mechanisms
YANHe,LEIQiuxia,WANG Xu
(Liangjiang College of Artificial Intelligence, Chongqing University of Technology , Chongqing 401135,China) * Corresponding author,E-mail: yanhe@ cqut. edu. cn
Abstract:To address the chalenges of high computational complexity,limited detail extraction,and fuzzy boundaries in the current DeepLabv3+ semantic segmentation network,this study proposes an enhanced DeepLabv 3+ model incorporating attention mechanisms. Specifically,the lightweight MobileNetV2 is employed as the backbone to balance high representational capacity with a significant reduction in model parameters. A parameter-freelightweight atention mechanism(SimAM) is integrated into the lowlevel features of the backbone network to prioritize key features and enhance feature extraction capabilities. Furthermore,the global average pooling in the ASPP module is replaced with Haar Wavelet Transform Downsampling (HWD) to preserve spatial information. An External Attention Mechanism(EANet) is also introduced after the ASPP module to leverage contextual information and achieve multi-scale feature fusion,thereby improving semantic understanding and segmentation accuracy. Experimental results demonstrate that the proposed model achieves a 2.82% improvement in mean Intersection over Union(mIoU) on the VOC2Ol2 dataset compared to the original DeepLabv 3+ model. This research enhances the precision of semantic segmentation and ofers novel insights for advancing applications in computer vision.
Keywords:semantic segmentation;DeepLabv3 + ;Haar wavelet transform downsampling;External Attention(EANet) ;multi-scale integration
1引言
語義分割是計(jì)算機(jī)視覺領(lǐng)域中的一項(xiàng)重要任務(wù),旨在將圖像中的每個(gè)像素分配到不同的語義類別中[1]。(剩余15242字)
-
-
- 光學(xué)精密工程
- 2025年01期
- 基于反格雷碼輔助相移法的高魯棒...
- 基于多位姿標(biāo)靶的激光跟蹤姿態(tài)測(cè)...
- 膜厚監(jiān)控系統(tǒng)準(zhǔn)直聚焦耦合光路的...
- 基于雙投射結(jié)構(gòu)光系統(tǒng)的多尺度點(diǎn)...
- 受激布里淵散射自調(diào) Q 產(chǎn)生高...
- 網(wǎng)格激光高反抑制與中心提取誤差...
- 光纖準(zhǔn)直非球面微透鏡陣列模壓成...
- 鎳渣改質(zhì)及其在磁性復(fù)合流體拋光...
- 面向柔性鉗位型尺致動(dòng)器提速驅(qū)動(dòng)...
- 基于紋理奇異值分解的全參考圖像...
- 融合注意力機(jī)制的改進(jìn)型Deep...
- 多模態(tài)語義交互的文本圖像超分辨...
- 條件擴(kuò)散和多通道高低頻并行的紅...