基于圖像知識增強(qiáng)的中文多模態(tài)反諷檢測方法
Chinese multimodal irony detection method based on image knowledgeenhancement
LI Yueying,CAO Hui, ZHANG Jisai, XIA Xiaotian (KeyLaboratoryofLinguisticandCulturalComputingMinistryofEducation,InstituteofChineseEthnicInformationTechnology, Northwest Minzu University,Lanzhou 73Oo3O,China)
Abstract:With therapid development of social mediaandonlinecontent,theuseof ironyhasbecomecommonin the onlinecommunicationandinformationdisemination.However,thetraditionaltextanalysismethodsoftenfailtocapturethe meaningofironyaccurately,andrelyingsolelyontextualinformationhaslimitationsandisofinstability.Inthispaper,a Chinesemultimodal ironydatasetisconstructed.Thedatasetincludes5964annotateddatasamples,includingtwomodesoftext andimage.Theimagesplayanimportantroleinmultimodalironydetectiontasks.Inordertofullexplorethehidden information inimages,theimagecaptioninggenerationmodelViT-GPT-image-captioningisusedtogeneratethedescription informationoftheimageforimageknowledgeenhancement,soastoenhancetheunderstandingandcognitionoftheimage. Moreover,amultimodalatentionnetwork modelCMANetthatintegratesmodal informationforironydetectionisproposedtoget ridof theinsuffcientinformationcorelationbetweenmodesandlackofdataintheprocessofmulti-modaldatafusion. Experimental verification was performed on the dataset. The results show that the F1 -score of the proposed CMANet model has been improved by 1.49% and its accuracyby 1.89% in comparison with those of the baseline model.
Keywords:multimodal; irony detection; attention mechanism; cross-modal; deep learning; network fusion
0 引言
反諷是一種特殊的情感表達(dá)技巧,讓人難以直接理解表達(dá)者的真實(shí)意圖,達(dá)到委婉而含蓄的表達(dá)目的。(剩余11168字)
-
-
- 現(xiàn)代電子技術(shù)
- 2025年13期
- 基于FNM-Net的輕量級遙感...
- 基于YOLOv8n的輕量化道路...
- 面向復(fù)雜場景目標(biāo)提取的顏色增強(qiáng)...
- 基于改進(jìn)U-Net的細(xì)胞核圖像...
- 基于深度學(xué)習(xí)和Retinex理...
- 基于注意力機(jī)制和ACT網(wǎng)絡(luò)的人...
- 基于改進(jìn)RT-DETR的小目標(biāo)...
- 基于級聯(lián)式逆殘差網(wǎng)絡(luò)的游戲圖像...
- 基于顯著性特征的多視角動作圖像...
- 新能源接人下移動通信傳輸網(wǎng)絡(luò)控...
- 基于FMCW毫米波雷達(dá)遠(yuǎn)于2m...
- 基于樣本重要性的分布式深度學(xué)習(xí)...
- 基于RDMMIMOOFDM雷達(dá)...
- T-BOI:一種融合時間和行為...
- 基于改進(jìn)的灰狼算法優(yōu)化BP神經(jīng)...
- 基于改進(jìn)的ResNet網(wǎng)絡(luò)和特...
- 基于深度特征融合的惡意軟件檢測...
- 融合雙通道特征信息的醫(yī)療短文本...
- 聲源定位系統(tǒng)的廣義二次互相關(guān)算...
- 基于GAIL方法的魚類個體運(yùn)動...
- 基于圖像知識增強(qiáng)的中文多模態(tài)反...
- 基于ZYNQ-7000和AD9...
- 基于自適應(yīng)采樣的全息圖像壓縮感...
- 基于電感電容的鋰離子電池組雙層...
- 基于獨(dú)立線長預(yù)測信息的低功耗驅(qū)...
- 基于YOLOv8的多功能導(dǎo)盲系...