高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

融合动态伪影追踪与空频交互分析的跨域伪造检测

李子龙 杨高明 韩栋宇 方贤进

李子龙, 杨高明, 韩栋宇, 方贤进. 融合动态伪影追踪与空频交互分析的跨域伪造检测[J]. 电子与信息学报. doi: 10.11999/JEIT251290
引用本文: 李子龙, 杨高明, 韩栋宇, 方贤进. 融合动态伪影追踪与空频交互分析的跨域伪造检测[J]. 电子与信息学报. doi: 10.11999/JEIT251290
LI Zilong, YANG Gaoming, HAN Dongyu, FANG Xianjin. Cross-Domain Deepfake Detection with Dynamic Artifacts Tracking and Spatial-Frequency Interaction Analysis[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT251290
Citation: LI Zilong, YANG Gaoming, HAN Dongyu, FANG Xianjin. Cross-Domain Deepfake Detection with Dynamic Artifacts Tracking and Spatial-Frequency Interaction Analysis[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT251290

融合动态伪影追踪与空频交互分析的跨域伪造检测

doi: 10.11999/JEIT251290 cstr: 32379.14.JEIT251290
基金项目: 国家自然科学基金(52374155),安徽自然科学基金(2308085MF218),安徽高等学校科学研究(2022AH040113),安徽理工大学医学专项培育项目(YZ2023H2B007)
详细信息
    作者简介:

    李子龙:男,硕士生,研究方向为深度伪造检测

    杨高明:男,教授,研究方向为网络与信息安全、深度伪造检测

    韩栋宇:男,硕士生,研究方向为人脸伪造检测

    方贤进:男,教授,研究方向为网络与信息安全、智能计算

    通讯作者:

    杨高明 gmyang@aust.edu.cn

  • 中图分类号: TP391.41

Cross-Domain Deepfake Detection with Dynamic Artifacts Tracking and Spatial-Frequency Interaction Analysis

Funds: The National Natural Science Foundation of China (52374155), Anhui Provincial Natural Science Foundation (2308085MF218), Natural Science Research Project of Colleges and Universities in Anhui Province (2022AH040113), Medical Special Cultivation Project of Anhui University of Science and Technology (YZ2023H2B007)
  • 摘要: 针对跨域深度伪造检测中存在依赖静态伪影与固定频段、局限单一分析域、全局关联能力不足的问题,该文提出一种融合动态伪影追踪与空频交互分析的金字塔式交互双流网络(PIDSNet)。首先,通过多分支特征提取模块与频谱卷积模块实现伪影特征的动态挖掘,降低对固定参数和频段的依赖,显著提高伪影特征的自适应捕捉能力。其次,通过融合金字塔挤压注意力模块和多头自注意力机制,实现全局特征与局部特征提取的平衡。最后,在空域和频域分别构建高斯金字塔与拉普拉斯金字塔,多层次提取高频信息和低频信息并实现跨域特征融合,构建新型空频特征动态交互机制。实验结果表明,在包含25种生成对抗网络和扩散模型的伪造数据集中平均准确度提升7.4%,为深度伪造检测的跨域泛化性研究提供了新的方案。
  • 图  1  金字塔式交互双流网络(PIDSNet)

    图  2  多分支特征提取模块(MBFE)和频域特征提取模块(FDFE)

    图  3  金字塔空频交互模块(PSFI)

    图  4  多头金字塔挤压注意力模块(MHPSA)

    图  5  Grad-CAM可视化PIDSNet所提取的特征

    表  1  ProGAN测试集评估结果(%)

    MethodsSettingsProGANSettingsProGAN
    InputAcc.A.P.InputAcc.A.P.
    Wang[26]2类64.692.74类90.999.4
    F3Net[16]2类97.098.54类98.4100.0
    Frank[27]2类85.781.34类89.885.2
    BiHPF[28]2类87.487.44类90.286.2
    FrePGAN[29]2类98.299.14类98.599.9
    LGrad[13]2类98.899.24类98.6100.0
    FreqNet[15]2类99.699.94类99.8100.0
    PIDSNet2类99.799.94类99.9100.0
    下载: 导出CSV

    表  2  ForenSynths测试集2类训练设置评估(%)

    MethodsStyleGANStyleGAN2CycleGANStarGANMean
    Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.
    Wang[26]52.882.875.796.658.681.551.274.359.683.8
    F3Net[16]84.599.582.299.881.289.7100.0100.087.097.3
    Frank[27]73.168.575.070.986.580.885.077.079.974.3
    BiHPF[28]71.674.177.081.186.086.693.880.882.180.7
    FrePGAN[29]80.892.072.294.069.170.398.5100.080.289.1
    LGrad[13]88.397.687.597.582.790.796.597.788.895.9
    FreqNet[15]88.498.985.898.188.799.895.5100.089.699.2
    PIDSNet90.598.492.799.390.499.297.1100.092.799.2
    下载: 导出CSV

    表  3  ForenSynths测试集4类训练设置评估(%)

    MethodsStyleGANStyleGAN2CycleGANStarGANMean
    Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.
    Wang[26]63.891.476.497.572.788.663.890.869.292.1
    F3Net[16]92.699.788.099.876.484.399.599.889.195.9
    Frank[27]74.572.073.171.475.571.299.599.580.778.5
    BiHPF[28]76.975.176.274.781.978.994.494.482.480.8
    FrePGAN[29]80.789.684.198.671.174.499.9100.084.090.7
    LGrad[13]89.594.890.697.584.594.094.897.689.996.0
    FreqNet[15]90.299.788.099.595.899.685.799.890.099.7
    PIDSNet91.499.997.899.891.598.8100.0100.095.299.6
    下载: 导出CSV

    表  4  GANGen数据集4类训练设置评估(%)

    MethodsAttGANBEGANCramerGANInfoMaxGANMMDGAN
    Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.
    Wang[26]51.183.750.244.981.597.571.194.772.994.4
    F3Net[16]85.294.887.197.589.599.867.183.173.799.6
    LGrad[13]68.693.869.989.250.354.071.182.057.567.3
    FreqNet[15]88.698.197.599.493.597.892.196.691.797.6
    PIDSNet95.999.399.5100.099.099.998.199.698.099.6
    下载: 导出CSV

    表  5  GANGen数据集4类训练设置评估(%)

    MethodsRelGANS3GANSNGANSTGAN9 GANs Mean
    Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.
    Wang[26]53.382.155.266.162.790.463.092.762.382.9
    F3Net[16]98.8100.065.470.051.693.660.399.975.493.1
    LGrad[13]89.199.178.586.078.087.454.868.068.680.8
    FreqNet[15]97.999.584.588.781.795.395.499.191.496.9
    PIDSNet100.0100.080.587.194.598.494.1100.095.598.2
    下载: 导出CSV

    表  6  DiffusionForensics数据集4类训练设置评估(%)

    MethodsLDMPNDMVQ-DiffusionStable Diffusion V1Stable Diffusion V2Mean
    Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.
    F3Net[16]100.0100.072.899.599.999.973.497.299.8100.089.299.3
    LGrad[13]99.7100.069.598.596.2100.090.499.497.1100.090.699.6
    Ojha[11]82.297.175.392.583.597.756.490.471.592.473.894.0
    FreqNet[15]94.299.278.899.797.8100.062.493.478.092.182.296.9
    PIDSNet99.7100.079.399.998.099.899.4100.099.8100.095.498.2
    下载: 导出CSV

    表  7  Ojha数据集4类训练设置评估(%)

    MethodsGlide100_10Glide100_27Glide50_27LDM100LDM200LDM200_cfg
    Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.
    F3Net[16]88.395.487.094.588.595.474.184.073.483.380.789.1
    LGrad[13]89.494.987.493.290.795.194.899.294.299.195.999.2
    Ojha[11]90.197.090.797.291.197.490.597.090.297.177.388.6
    FreqNet[15]78.889.376.286.978.088.594.399.393.899.293.098.8
    PIDSNet96.599.595.199.195.999.296.599.696.799.596.099.4
    下载: 导出CSV

    表  8  各数据集4类训练设置评估(%)

    MethodsParamsForenSynthsGANGenDiffusionForensicsOjhaMean
    Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.Acc.A.P.
    Wang[26]18.6M73.593.562.382.9------
    F3Net[16]48.9M91.096.775.493.189.299.382.090.382.994.4
    LGrad[13]46.6M91.696.868.680.890.699.692.196.883.291.6
    FreqNet[15]2.1M91.999.791.496.982.296.985.793.788.396.7
    PIDSNet2.4M96.199.795.598.295.498.296.199.495.798.7
    下载: 导出CSV

    表  9  ForenSynths数据集消融实验(%)

    MBFE FDFE PSFI MHPSA mean Acc.
    91.3
    90.9
    87.1
    92.7
    92.8
    95.2
    下载: 导出CSV

    表  10  金字塔类型的消融实验(%)

    Spatial DomainFrequency Domainmean Acc.
    92.7
    Gaussian Pyramid93.1
    Laplacian Pyramid93.9
    Gaussian PyramidLaplacian Pyramid95.2
    下载: 导出CSV

    表  11  金字塔层数的消融实验(%)

    the number of pyramid levelsmean Acc.
    190.3
    290.9
    391.9
    495.2
    592.6
    下载: 导出CSV
  • [1] LIU Zhian, LI Maomao, ZHANG Yong, et al. Fine-grained face swapping via regional GAN inversion[C]. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023: 8578–8587. doi: 10.1109/CVPR52729.2023.00829.
    [2] ZHAO Wenliang, RAO Yongming, SHI Weikang, et al. DiffSwap: High-fidelity and controllable face swapping via 3D-aware masked diffusion[C]. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023: 8568–8577. doi: 10.1109/CVPR52729.2023.00828.
    [3] BALIAH S, LIN Qinliang, LIAO Shengcai, et al. Realistic and efficient face swapping: A unified approach with diffusion models[C]. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Tucson, USA, 2025: 1062–1071. doi: 10.1109/WACV61041.2025.00112.
    [4] YUAN Shuaiwei, DONG Junyu, and LI Yuezun. Where the devil hides: Deepfake detectors can no longer be trusted[C]. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, USA, 2025: 8764–8774. doi: 10.1109/CVPR52734.2025.00819.
    [5] HUANG Zhenglin, HU Jinwei, LI Xiangtai, et al. SIDA: Social media image deepfake detection, localization and explanation with large multimodal model[C]. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, USA, 2025: 28831–28841. doi: 10.1109/CVPR52734.2025.02685.
    [6] 丁峰, 匡仁盛, 周越, 等. 深度伪造及其取证技术综述[J]. 中国图象图形学报, 2024, 29(2): 295–317. doi: 10.11834/jig.230088.

    DING Feng, KUANG Rensheng, ZHOU Yue, et al. A survey of deepfake and related digital forensics[J]. Journal of Image and Graphics, 2024, 29(2): 295–317. doi: 10.11834/jig.230088.
    [7] CAO Junyi, MA Chao, YAO Taiping, et al. End-to-end reconstruction-classification learning for face forgery detection[C]. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 2022: 4113–4122. doi: 10.1109/CVPR52688.2022.00408.
    [8] SHIOHARA K and YAMASAKI T. Detecting deepfakes with self-blended images[C]. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, 2022: 18720–18729. doi: 10.1109/CVPR52688.2022.01816.
    [9] 张晶, 许盼, 刘文君, 等. 多样性负实例生成的跨域人脸伪造检测[J]. 中国图象图形学报, 2025, 30(2): 421–434 doi: 10.11834/jig.240160.

    ZHANG Jing, XU Pan, LIU Wenjun, et al. Negative instance generation for cross-domain facial forgery detection[J]. Journal of Image and Graphics, 2025, 30(2): 421–434. doi: 10.11834/jig.240160.
    [10] YAN Zhiyuan, LUO Yuhao, LYU Siwei, et al. Transcending forgery specificity with latent space augmentation for generalizable deepfake detection[C]. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2024: 8984–8994. doi: 10.1109/CVPR52733.2024.00858.
    [11] OJHA U, LI Yuheng, and LEE Y J. Towards universal fake image detectors that generalize across generative models[C]. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023: 24480–24489. doi: 10.1109/CVPR52729.2023.02345.
    [12] KASHIANI H, TALEMI N A, and AFGHAH F. FreqDebias: Towards generalizable deepfake detection via consistency-driven frequency debiasing[C]. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, USA, 2025: 8775–8785. doi: 10.1109/CVPR52734.2025.00820.
    [13] TAN Chuangchuang, ZHAO Yao, WEI Shikui, et al. Learning on gradients: Generalized artifacts representation for GAN-generated images detection[C]. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 2023: 12105–12114. doi: 10.1109/CVPR52729.2023.01165.
    [14] TAN Chuangchuang, LIU Huan, ZHAO Yao, et al. Rethinking the up-sampling operations in CNN-based generative network for generalizable deepfake detection[C]. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2024: 28130–28139. doi: 10.1109/CVPR52733.2024.02657.
    [15] TAN Chuangchuang, ZHAO Yao, WEI Shikui, et al. Frequency-aware deepfake detection: Improving generalizability through frequency space domain learning[C]. Proceedings of the 38th AAAI Conference on Artificial Intelligence, Vancouver, Canada, 2024: 5052–5060. doi: 10.1609/aaai.v38i5.28310.
    [16] QIAN Yuyang, YIN Guojun, SHENG Lu, et al. Thinking in frequency: Face forgery detection by mining frequency-aware clues[C]. Proceedings of the 16th European Conference on Computer Vision, Glasgow, UK, 2020: 86–103. doi: 10.1007/978-3-030-58610-2_6.
    [17] BINH L M and WOO S. ADD: Frequency attention and multi-view based knowledge distillation to detect low-quality compressed deepfake images[C]. Proceedings of the 36th AAAI Conference on Artificial Intelligence, Washington, USA, 2022: 122–130. doi: 10.1609/aaai.v36i1.19886.
    [18] WANG Bo, WU Xiaohan, WANG Fei, et al. Spatial-frequency feature fusion based deepfake detection through knowledge distillation[J]. Engineering Applications of Artificial Intelligence, 2024, 133: 108341. doi: 10.1016/j.engappai.2024.108341.
    [19] 孙磊, 张洪蒙, 毛秀青, 等. 基于超分辨率重建的强压缩深度伪造视频检测[J]. 电子与信息学报, 2021, 43(10): 2967–2975. doi: 10.11999/JEIT200531.

    SUN Lei, ZHANG Hongmeng, MAO Xiuqing, et al. Super-resolution reconstruction detection method for deepfake hard compressed videos[J]. Journal of Electronics & Information Technology, 2021, 43(10): 2967–2975. doi: 10.11999/JEIT200531.
    [20] 王艳, 孙钦东, 荣东柱, 等. 伪影间共性机理驱动的多域感知社交网络深度伪造视频检测[J]. 电子与信息学报, 2024, 46(9): 3713–3721. doi: 10.11999/JEIT240025.

    WANG Yan, SUN Qindong, RONG Dongzhu, et al. Deepfake video detection on social networks using multi-domain aware driven by common mechanism analysis between artifacts[J]. Journal of Electronics & Information Technology, 2024, 46(9): 3713–3721. doi: 10.11999/JEIT240025.
    [21] WANG Zhendong, BAO Jianmin, ZHOU Wengang, et al. Dire for diffusion-generated image detection[C]. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2023: 22445–22455. doi: 10.1109/ICCV51070.2023.02051.
    [22] HOODA A, MANGAOKAR N, FENG R, et al. D4: Detection of adversarial diffusion deepfakes using disjoint ensembles[C]. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, USA, 2024: 3812–3822. doi: 10.1109/WACV57701.2024.00377.
    [23] LIU Baoping, LIU Bo, DING Ming, et al. Detection of diffusion model-generated faces by assessing smoothness and noise tolerance[C]. 2024 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), Toronto, Canada, 2024: 1–6. doi: 10.1109/BMSB62888.2024.10608232.
    [24] ZHANG Hu, ZU Keke, LU Jian, et al. EPSANet: An efficient pyramid squeeze attention block on convolutional neural network[C]. Proceedings of the 16th Asian Conference on Computer Vision, Macao, China, 2023: 1161–1177. doi: 10.1007/978-3-031-26313-2_33.
    [25] COOLEY J W, LEWIS P A W, and WELCH P D. The fast Fourier transform and its applications[J]. IEEE Transactions on Education, 1969, 12(1): 27–34. doi: 10.1109/TE.1969.4320436.
    [26] WANG Shengyu, WANG O, ZHANG R, et al. CNN-generated images are surprisingly easy to spot. for now[C]. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, USA, 2020: 8695–8704. doi: 10.1109/CVPR42600.2020.00872.
    [27] FRANK J, EISENHOFER T, SCHÖNHERR L, et al. Leveraging frequency analysis for deep fake image recognition[C]. Proceedings of the 37th International Conference on Machine Learning, 2020: 3247–3258. (查阅网上资料, 未找到对应的出版地信息, 请确认).
    [28] JEONG Y, KIM D, MIN S, et al. BiHPF: Bilateral high-pass filters for robust deepfake detection[C]. 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, USA, 2022: 48–57. doi: 10.1109/WACV51458.2022.00293.
    [29] JEONG Y, KIM D, RO Y, et al. FrePGAN: Robust deepfake detection using frequency-level perturbations[C]. Proceedings of the 36th AAAI Conference on Artificial Intelligence, Washington, USA, 2022: 1060–1068. doi: 10.1609/aaai.v36i1.19990. (查阅网上资料,无法确认出版地信息是否正确).
  • 加载中
图(5) / 表(11)
计量
  • 文章访问数:  17
  • HTML全文浏览量:  9
  • PDF下载量:  3
  • 被引次数: 0
出版历程
  • 修回日期:  2026-05-12
  • 录用日期:  2026-05-12
  • 网络出版日期:  2026-06-05

目录

    /

    返回文章
    返回