Information and Communications Technology and Policy

Information and Communications Technology and Policy

Information and Communications Technology and Policy ›› 2025, Vol. 51 ›› Issue (10): 73-86.doi: 10.12267/j.issn.2096-5931.2025.10.011

Previous Articles     Next Articles

A review of multimodal deepfake detection technology

WANG Ling1, YAN Kun2, NIE Peng2   

  1. 1 Telecommunications Science and Technology Research Institute, Beijing 100191, China
    2 Intellectual Property and Innovation Development Center, China Academy of Information and Communications Technology, Beijing 100191, China
  • Received:2025-05-10 Online:2025-10-25 Published:2025-11-06

Abstract:

The rapid development of deepfake technology has exacerbated the crisis of social trust and security threats, and its abuse scenarios have expanded from fake news and identity fraud to a wider field. In order to meet the challenges, the deepfake detection technology has gradually developed from single-modal to multimodal fusion detection, and the detection accuracy and robustness are significantly improved by integrating multi-source information such as audio-visual information. Firstly, the characteristics and application scenarios of multimodal datasets are analyzed. Secondly, the technical methodology system of detection-positioning-interpretation is classified and described. Then, the actual performance of the existing testing platform is evaluated. Finally, the future research directions are prospected. The purpose of this study is to construct a technical map of multimodal deepfake detection, and to provide theoretical support and practical reference for the development of the field.

Key words: deepfake detection, multimodal deepfake detection, datasets

CLC Number: