基于特征融合的音頻偽造檢測方法
關(guān)鍵詞:音頻深度偽造檢測;深度學(xué)習(xí);特征融合;聲碼器偽跡
中圖分類號:TN912.3 文獻(xiàn)標(biāo)志碼:A 文章編號:1001-3695(2025)07-025-2109-07
doi:10.19734/j.issn.1001-3695.2024.11.0460
Abstract:Advancements inartificialinteligence have madedistinguishingsynthesized speech fromgenuinespeech increasinglychallenging,complicating audio deepfake detection.Existing methods often exhibit low acuracy,poor generalization, and weakrobustness.Thisstudy proposed MFF-STViT,amethod integratingthreeaudio features with vocoderartifactfeatures through anovelfeature fusionmoduletoenhance representation.The fused features were processdusing animproved Transformer model,STViT,toreduce redundancyand improve detectionperformance.Onthe ASVspoof2019LA testset,the method reduced the equal error rate(EER)by 71.38% on average. On the ASVspoof2O21 LA dataset, it achieved average reductions of 44.41% in EERand 18.11% intheminimum tandem detection cost function(min-tDCF).For the ASVspoof2021 DF dataset, the average EER decreased by 57.81% ,with reductions exceeding 80% in specific partitions. These findings demonstrate the efectiveness of MFF-STViT in improving accuracy,generalization,and robustness.
Keywords:audio deepfake detection;deep learning;feature fusion;vocoder artifacts
0 引言
近年來,自動說話人確認(rèn)(automaticspeakerverification,ASV)系統(tǒng)因其采集方式簡便、特異性高、成本低等優(yōu)點被廣泛應(yīng)用于語音郵件、電話銀行、呼叫中心、生物特征認(rèn)證、法醫(yī)應(yīng)用等領(lǐng)域[1]。(剩余19472字)
-
-
- 計算機(jī)應(yīng)用研究
- 2025年07期
- 多模態(tài)行人重識別研究綜述...
- 語義通信在邊緣算力網(wǎng)絡(luò)中的應(yīng)用...
- 基于同態(tài)加密和零知識證明的區(qū)塊...
- HyperledgerFabr...
- PMoE:在P-tuning中...
- 基于大語言模型的多任務(wù)生成式重...
- 基于圖文對比融合的圖像人物情感...
- 基于深度特征交互與層次化多模態(tài)...
- 反向聚焦細(xì)粒度多模態(tài)語義對齊的...
- 基于CLIP文本特征增強(qiáng)的剪紙...
- 基于完整超圖神經(jīng)網(wǎng)絡(luò)的捆綁推薦...
- 基于高階鄰域信息交互的自監(jiān)督異...
- 基于超圖和分層頻譜濾波器的序列...
- 針對圖像指代分割的訓(xùn)練后量化策...
- 基于信息互補(bǔ)與交叉注意力的跨模...
- 基于強(qiáng)化學(xué)習(xí)協(xié)同進(jìn)化算法求解柔...
- 融合實體鄰域信息的時序知識圖譜...
- 互補(bǔ)盲點策略和U型Transf...
- SP-POMDP:堆疊物體抓取...
- 基于果蠅協(xié)同算法求解雙目標(biāo)混裝...
- 優(yōu)化時間窗改進(jìn)Dijkstra...
- 帶頻繁區(qū)域的空間并置模式挖掘方...
- 輔助任務(wù)增強(qiáng)的知識追蹤方法...
- 基于沖突避讓的多智能體有效旁路...
- 基于特征融合的音頻偽造檢測方法...
- 基于多視圖舌象特征融合的中醫(yī)證...
- 多元異構(gòu)耦合網(wǎng)絡(luò)中競爭性輿情信...
- 基于增強(qiáng)控制流圖與孿生網(wǎng)絡(luò)架構(gòu)...
- 獎勵回溯DQN驅(qū)動的多QoS工...
- 基于QUIC的擁塞控制算法動態(tài)...
- CN2Conv:面向物聯(lián)網(wǎng)設(shè)備...
- 面向物流數(shù)據(jù)共享的可撤銷屬性加...
- 一種具有多級安全目標(biāo)的動態(tài)對稱...
- 基于雙向數(shù)據(jù)流分析與圖抽象嵌入...
- 基于比特切片技術(shù)與指令集的LE...
- 基于隨機(jī)投影與改進(jìn)min-ma...
- 結(jié)合自適應(yīng)局部圖卷積與多尺度時...
- 基于圖元變換的建筑彩繪紋樣圖像...
- 雙流特征增強(qiáng)與融合的弱監(jiān)督時序...
- 多尺度降噪自編碼器的遮擋行人重...
- 基于深度正則化的三維高斯人體重...
- 基于雙曲空間的無監(jiān)督視頻異常檢...