高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于多视角特征提取和双边缘对比学习的图像篡改检测算法

徐壮 叶子奕 潘恩康 刘春晓

徐壮, 叶子奕, 潘恩康, 刘春晓. 基于多视角特征提取和双边缘对比学习的图像篡改检测算法[J]. 电子与信息学报. doi: 10.11999/JEIT251271
引用本文: 徐壮, 叶子奕, 潘恩康, 刘春晓. 基于多视角特征提取和双边缘对比学习的图像篡改检测算法[J]. 电子与信息学报. doi: 10.11999/JEIT251271
XU Zhuang, YE Ziyi, PAN Enkang, LIU Chunxiao. A Multi-View Feature Extraction and Dual-Edge Contrastive Learning Approach for Image Forgery Detection[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT251271
Citation: XU Zhuang, YE Ziyi, PAN Enkang, LIU Chunxiao. A Multi-View Feature Extraction and Dual-Edge Contrastive Learning Approach for Image Forgery Detection[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT251271

基于多视角特征提取和双边缘对比学习的图像篡改检测算法

doi: 10.11999/JEIT251271 cstr: 32379.14.JEIT251271
基金项目: 国家自然科学基金(61976188),浙江省自然科学基金(LY24F020004),国家级大学生创新训练计划(202510353027),浙江省大学生创新训练计划(S202510353076)
详细信息
    作者简介:

    徐壮:徐 壮:男,硕士生,研究方向为视觉安全与深度学习

    叶子奕:女,本科生,研究方向为视觉安全与深度学习

    潘恩康:男,本科生,研究方向为视觉安全与深度学习

    刘春晓:男,副教授,研究方向为计算机视觉与计算机图形学、机器学习与智能系统、视觉安全与隐私保护

    通讯作者:

    刘春晓 cxliu@mail.zjgsu.edu.cn

  • 中图分类号: TP393.08

A Multi-View Feature Extraction and Dual-Edge Contrastive Learning Approach for Image Forgery Detection

Funds: National Natural Science Foundation of China (61976188), Zhejiang Provincial Natural Science Foundation of China (LY24F020004), National College Students Innovation and Entrepreneurship Training Program (202510353027), Zhejiang Provincial College Students Innovation and Entrepreneurship Training Program (S202510353076)
  • 摘要: 图像篡改检测技术在新闻审查和司法鉴定等领域具有重要应用价值。针对现有方法在逐像素分类问题定义下存在的标签冲突问题,以及篡改线索挖掘多集中于空间域而忽略其它视角特征的问题,本文提出了一种基于图像内不一致性的图像篡改检测改进模型,及其基于多视角特征提取和双边缘对比学习的图像篡改检测算法。算法基于模型的思路实现。提出的模型针对局限性,能够有效避免标签冲突问题,增强对于篡改线索的挖掘力度,提升了泛化能力,克服了现有方法存在的问题。实验结果表明,本文方法在pF1与pIoU指标上相比现有主流方法平均提升了26.0%和10.1%。
  • 图  1  逐像素分类问题定义方式下标签冲突示例

    图  2  两种问题定义方式示例

    图  3  算法整体框架

    图  4  双边缘掩码求取示例

    图  5  可视化展示结果

    表  1  训练数据集信息

    数据集名称 图片数量 拼接 拷贝移动 后处理 物体移除
    CASIA-v2[27] 5105
    SP-COCO[29] 200k - -
    CM-COCO[29] 200k - -
    CM-RAISE[29] 200k - -
    CM-C-RAISE[29] 200k - -
    IMD2020[30] 2010
    下载: 导出CSV

    表  2  测试数据集信息

    数据集名称 图片数量 拼接 拷贝移动 后处理 物体移除
    NIST[23] 564
    Columbia[24] 180 - - -
    COVERAGE[25] 100 - -
    DSO[26] 95 - -
    CASIA-v1[27] 920
    下载: 导出CSV

    表  3  算法的整体性能表现(%)

    方法NISTColumbiaCOVERAGEDSOCASIA-v1平均指标
    pF1pIoUpF1pIoUpF1pIoUpF1pIoUpF1pIoUpF1pIoU
    MVSS-Net[19](ICCV 2021)35.6226.6277.7168.5450.7039.1440.4127.8058.6748.6852.6242.16
    PSCC-Net[6](TCSVT 2022)40.3431.4288.2082.1145.9434.5041.5828.6457.7247.6954.7644.87
    CAT-Net[17](CVPR 2022)43.1235.5495.4893.1851.9444.1530.4620.5981.5275.2460.5053.74
    TruFor[21](CVPR 2023)44.5538.0697.9193.0654.5747.2241.7532.4483.4078.2664.4457.81
    CoDE[14](TIFS 2024)42.0333.9088.1284.4146.4436.2140.7430.0072.3363.7457.9349.65
    SparseViT[9](AAAI 2025)43.1135.5297.4795.8158.3251.2639.7529.9283.0877.5464.3558.01
    FMAE[22](AAAI 2025)47.0539.2193.5490.2665.4257.1552.4340.3975.3768.1366.7659.03
    Mesorch[8](AAAI 2025)47.6540.4497.0995.1863.4256.3342.3532.5384.7279.2467.0560.74
    SFIRE[12](AAAI 2025)48.8840.7497.9294.5464.9655.2756.3647.4433.1426.1160.2552.82
    MPC[13](TIFS 2025)47.1339.1596.2394.6163.5954.8650.8139.2975.1769.6266.5959.51
    本文方法74.5647.0497.7495.5086.2868.5677.9254.5685.8568.7584.4766.88
    注:表中粗体表示最优值,下划线表示次优值。
    下载: 导出CSV

    表  4  主要模块的消融实验结果(%)

    空间分支 噪声分支 双边缘对比学习策略 NIST COVERAGE DSO 平均指标
    - - - 73.01 78.56 76.05 75.87
    - - 73.14 82.68 77.05 77.62
    - 74.03 84.03 78.12 78.73
    74.56 86.28 77.92 79.59
    注:表中粗体表示最优值。
    下载: 导出CSV

    表  5  噪声提取分支的消融实验结果(%)

    移除特定噪声分支拼接拷贝移动移除平均指标
    Noiseprint++70.0673.5969.5571.07
    SRM卷积74.3372.5671.0272.64
    Bayar卷积72.5670.2872.9271.92
    最大池化75.1572.8870.3472.79
    平均池化残差74.5073.2271.2572.99
    傅里叶变换72.5671.3370.5671.48
    不移除75.5873.9773.3474.56
    下载: 导出CSV

    表  6  边缘宽度的消融实验结果(%)

    边缘宽度NISTCOVERAGEDSO平均指标
    174.3385.5678.0279.30
    374.5686.2877.9279.59
    573.1584.8876.3478.12
    772.5682.3475.5676.82
    注:表中粗体表示最优值。
    下载: 导出CSV
  • [1] FARID H and LYU Siwei. Higher-order wavelet statistics and their application to digital forensics[C]. Proceedings of 2003 Conference on Computer Vision and Pattern Recognition Workshop, Madison, USA, 2003: 94. doi: 10.1109/CVPRW.2003.10093.
    [2] NIU Yakun, TONDI B, ZHAO Yao, et al. Image splicing detection, localization and attribution via JPEG primary quantization matrix estimation and clustering[J]. IEEE Transactions on Information Forensics and Security, 2021, 16: 5397–5412. doi: 10.1109/TIFS.2021.3129654.
    [3] PYATYKH S, HESSER J, and ZHENG Lei. Image noise level estimation by principal component analysis[J]. IEEE Transactions on Image Processing, 2013, 22(2): 687–699. doi: 10.1109/TIP.2012.2221728.
    [4] ZORAN D and WEISS Y. Scale invariance and noise in natural images[C]. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan, 2009: 2209–2216. doi: 10.1109/ICCV.2009.5459476.
    [5] 毕秀丽, 魏杨, 肖斌, 等. 基于级联卷积神经网络的图像篡改检测算法[J]. 电子与信息学报, 2019, 41(12): 2987–2994. doi: 10.11999/JEIT190043.

    BI Xiuli, WEI Yang, XIAO Bin, et al. Image forgery detection algorithm based on cascaded convolutional neural network[J]. Journal of Electronics & Information Technology, 2019, 41(12): 2987–2994. doi: 10.11999/JEIT190043.
    [6] LIU Xiaohong, LIU Yaojie, CHEN Jun, et al. PSCC-Net: Progressive spatio-channel correlation network for image manipulation detection and localization[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(11): 7505–7517. doi: 10.1109/TCSVT.2022.3189545.
    [7] QU Chenfan, ZHONG Yiwu, LIU Chongyu, et al. Towards modern image manipulation localization: A large-scale dataset and novel methods[C]. Proceedings of 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA, 2024: 10781–10790. doi: 10.1109/CVPR52733.2024.01025.
    [8] ZHU Xuekang, MA Xiaochen, SU Lei, et al. Mesoscopic insights: Orchestrating multi-scale & hybrid architecture for image manipulation localization[C]. Proceedings of the 39th AAAI Conference on Artificial Intelligence, Philadelphia, USA, 2025: 11022–11030. doi: 10.1609/aaai.v39i10.33198.
    [9] SU Lei, MA Xiaochen, ZHU Xuekang, et al. Can we get rid of handcrafted feature extractors? SparseViT: Nonsemantics-centered, parameter-efficient image manipulation localization through spare-coding transformer[C]. Proceedings of the 39th AAAI Conference on Artificial Intelligence, Philadelphia, USA, 2025: 7024–7032. doi: 10.1609/aaai.v39i7.32754.
    [10] 李树原, 严彩萍, 李红. 用于图像篡改检测的混合Transformer网络[J]. 计算机辅助设计与图形学学报, 2024, 36(12): 2010–2019. doi: 10.3724/SP.J.1089.2024.20099.

    LI Shuyuan, YAN Caiping, and LI Hong. A hybrid Transformer network for image splicing forgery detection[J]. Journal of Computer-Aided Design & Computer Graphics, 2024, 36(12): 2010–2019. doi: 10.3724/SP.J.1089.2024.20099.
    [11] KONG Chenqi, LUO Anwei, WANG Shiqi, et al. Pixel-inconsistency modeling for image manipulation localization[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025, 47(6): 4455–4472. doi: 10.1109/TPAMI.2025.3541028.
    [12] KWON M J, LEE W, NAM S H, et al. SAFIRE: Segment any forged image region[C]. Proceedings of the 39th AAAI Conference on Artificial Intelligence, Philadelphia, USA, 2025: 4437–4445. doi: 10.1609/aaai.v39i4.32467.
    [13] LOU Zijie, CAO Gang, GUO Kun, et al. Exploring multi-view pixel contrast for general and robust image forgery localization[J]. IEEE Transactions on Information Forensics and Security, 2025, 20: 2329–2341. doi: 10.1109/TIFS.2025.3541957.
    [14] PENG Rongxuan, TAN Shunquan, MO Xianbo, et al. Employing reinforcement learning to construct a decision-making environment for image forgery localization[J]. IEEE Transactions on Information Forensics and Security, 2024, 19: 4820–4834. doi: 10.1109/TIFS.2024.3381470.
    [15] 马杰, 钟斌斌, 焦亚男. 基于极坐标正弦变换的Copy-move篡改检测[J]. 电子与信息学报, 2020, 42(5): 1172–1178. doi: 10.11999/JEIT190481.

    MA Jie, ZHONG Binbin, and JIAO Yanan. Copy-move forgeries detection based on polar sine transform[J]. Journal of Electronics & Information Technology, 2020, 42(5): 1172–1178. doi: 10.11999/JEIT190481.
    [16] 王青, 张荣. 基于DCT系数双量化映射关系的图像盲取证算法[J]. 电子与信息学报, 2014, 36(9): 2068–2074. doi: 10.3724/SP.J.1146.2013.01488.

    WANG Qing and ZHANG Rong. Exposing digital image forgeries based on double quantization mapping relation of DCT coefficient[J]. Journal of Electronics & Information Technology, 2014, 36(9): 2068–2074. doi: 10.3724/SP.J.1146.2013.01488.
    [17] KWON M J, YU I J, NAM S H, et al. CAT-Net: Compression artifact tracing network for detection and localization of image splicing[C]. Proceedings of 2021 IEEE Winter Conference on Applications of Computer Vision, Waikoloa, USA, 2021: 375–384. doi: 10.1109/WACV48630.2021.00042.
    [18] WU Yue, ABDALMAGEED W, and NATARAJAN P. ManTra-Net: Manipulation tracing network for detection and localization of image forgeries with anomalous features[C]. Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019: 9535–9544. doi: 10.1109/CVPR.2019.00977.
    [19] DONG Chengbo, CHEN Xinru, HU Ruohan, et al. MVSS-Net: Multi-view multi-scale supervised networks for image manipulation detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(3): 3539–3553. doi: 10.1109/TPAMI.2022.3180556.
    [20] COZZOLINO D and VERDOLIVA L. Noiseprint: A CNN-based camera model fingerprint[J]. IEEE Transactions on Information Forensics and Security, 2020, 15: 144–159. doi: 10.1109/TIFS.2019.2916364.
    [21] GUILLARO F, COZZOLINO D, SUD A, et al. TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization[C]. Proceedings of 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, Canada, 2023: 20606–20615. doi: 10.1109/CVPR52729.2023.01974.
    [22] ZHU Jiaying, LI Dong, FU Xueyang, et al. A lottery ticket hypothesis approach with sparse fine-tuning and MAE for image forgery detection and localization[C]. Proceedings of the 39th AAAI Conference on Artificial Intelligence, Philadelphia, USA, 2025: 10968–10976. doi: 10.1609/aaai.v39i10.33192.
    [23] GUAN Haiying, KOZAK M, ROBERTSON E, et al. MFC datasets: Large-scale benchmark datasets for media forensic challenge evaluation[C]. Proceedings of 2019 IEEE Winter Applications of Computer Vision Workshops, Waikoloa, USA, 2019: 63–72. doi: 10.1109/WACVW.2019.00018.
    [24] HSU Y F and CHANG S F. Detecting image splicing using geometry invariants and camera characteristics consistency[C]. Proceedings of 2006 IEEE International Conference on Multimedia and Expo, Toronto, Canada, 2006: 549–552. doi: 10.1109/ICME.2006.262447.
    [25] WEN Bihan, ZHU Ye, SUBRAMANIAN R, et al. COVERAGE — a novel database for copy-move forgery detection[C]. Proceedings of 2016 IEEE International Conference on Image Processing, Phoenix, USA, 2016: 161–165. doi: 10.1109/ICIP.2016.7532339.
    [26] DE CARVALHO T J, RIESS C, ANGELOPOULOU E, et al. Exposing digital image forgeries by illumination color classification[J]. IEEE Transactions on Information Forensics and Security, 2013, 8(7): 1182–1194. doi: 10.1109/TIFS.2013.2265677.
    [27] DONG Jing, WANG Wei, and TAN Tieniu. CASIA image tampering detection evaluation database[C]. Proceedings of 2013 IEEE China Summit and International Conference on Signal and Information Processing, Beijing, China, 2013: 422–426. doi: 10.1109/ChinaSIP.2013.6625374.
    [28] WANG Jingdong, SUN Ke, CHENG Tianheng, et al. Deep high-resolution representation learning for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43(10): 3349–3364. doi: 10.1109/TPAMI.2020.2983686.
    [29] KWON M J, NAM S H, YU I J, et al. Learning JPEG compression artifacts for image manipulation detection and localization[J]. International Journal of Computer Vision, 2022, 130(8): 1875–1895. doi: 10.1007/s11263-022-01617-5.
    [30] NOVOZÁMSKÝ A, MAHDIAN B, and SAIC S. IMD2020: A large-scale annotated dataset tailored for detecting manipulated images[C]. Proceedings of 2020 IEEE Winter Applications of Computer Vision Workshops, Snowmass, USA, 2020: 71–80. doi: 10.1109/WACVW50321.2020.9096940.
  • 加载中
图(5) / 表(6)
计量
  • 文章访问数:  16
  • HTML全文浏览量:  6
  • PDF下载量:  1
  • 被引次数: 0
出版历程
  • 修回日期:  2026-05-12
  • 录用日期:  2026-05-12
  • 网络出版日期:  2026-05-27

目录

    /

    返回文章
    返回