Hybrid deepfake detection using cnn for spatial analysis and lstm for temporal consistency| International Journal of Innovative Science and Research Technology

Hybrid Deepfake Detection Using CNN for Spatial Analysis and LSTM for Temporal Consistency

Authors : Lakshmi Venkata Manikanta Maguluri; Hema Naga Vamsi Kothamasu; Shiny Duela Johnson

Volume/Issue : Volume 10 - 2025, Issue 5 - May

Google Scholar : https://tinyurl.com/bdkk24k9

Scribd : https://tinyurl.com/2vvwk8b7

DOI : https://doi.org/10.38124/ijisrt/25may346

PlumX Metrics

Semantic Scholar

ResearchGate

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Abstract : Deepfake technology, driven by advancements in artificial intelligence, enables the creation of highly realistic manipulated videos, posing significant threats to security, privacy, and misinformation. Traditional detection methods struggle to keep pace with the evolving sophistication of deepfake techniques. This study proposes a hybrid deep learning approach that leverages Convolutional Neural Networks (CNN) for feature extraction and Long Short-Term Memory (LSTM) networks for temporal sequence analysis to enhance deepfake detection accuracy. The CNN model captures spatial inconsistencies and artifacts in individual frames, while the LSTM network analyzes sequential dependencies to detect temporal anomalies indicative of deepfakes. Experimental evaluations on benchmark datasets demonstrate the effectiveness of the approach, achieving high accuracy in distinguishing real from fake videos. The proposed model offers a robust and scalable solution for deepfake detection, contributing to the fight against digital media manipulation and misinformation.

Keywords : Deepfake Detection, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Artificial Intelligence, Digital Media Forensics, Misinformation, Temporal Analysis, Feature Extraction, Fake Video Identification.

References :

U. Masud, M. Sadiq, S. Masood, M. Ahmad, A. El-Latif, and A. Ahmed, "LW-DeepFakeNet: A Lightweight Time Distributed CNN-LSTM Network for Real-Time DeepFake Video Detection," Signal, Image and Video Processing, pp. 1–9, 2023.
Y. Patel, S. Tanwar, P. Bhattacharya, R. Gupta, T. Alsuwian, and I. E. Davidson, "An Improved Dense CNN Architecture for Deepfake Image Detection," IEEE Access, vol. 11, pp. 22081–22095, 2023.
V. N. Tran, S. H. Lee, H. S. Le, and K. R. Kwon, "High Performance Deepfake Video Detection on CNN-Based with Attention Target-Specific Regions and Manual Distillation Extraction," Applied Sciences, vol. 11, no. 16, pp. 76–78, 2021.
K. Warke, N. Dalavi, and S. Nahar, "DeepFake Detection Through Deep Learning Using ResNext CNN and LSTM," IEEE Transactions on Neural Networks and Learning Systems, vol. 10, no. 5, pp. 1–10, 2023.
G. H. Ishrak, Z. Mahmud, M. Z. A. Z. Farabe, T. K. Tinni, T. Reza, and M. Z. Parvez, "Explainable Deepfake Video Detection Using Convolutional Neural Network and CapsuleNet," arXiv preprint arXiv:2404.12841, 2024.
U. Masud, M. Sadiq, S. Masood, M. Ahmad, A. El-Latif, and A. Ahmed, "LW-DeepFakeNet: A Lightweight Time Distributed CNN-LSTM Network for Real-Time DeepFake Video Detection," Signal, Image and Video Processing, pp. 1–9, 2023.
Y. Patel, S. Tanwar, P. Bhattacharya, R. Gupta, T. Alsuwian, and I. E. Davidson, "An Improved Dense CNN Architecture for Deepfake Image Detection," IEEE Access, vol. 11, pp. 22081–22095, 2023.
V. N. Tran, S. H. Lee, H. S. Le, and K. R. Kwon, "High Performance Deepfake Video Detection on CNN-Based with Attention Target-Specific Regions and Manual Distillation Extraction," Applied Sciences, vol. 11, no. 16, pp. 76–78, 2021.
K. Warke, N. Dalavi, and S. Nahar, "DeepFake Detection Through Deep Learning Using ResNext CNN and LSTM," IEEE Transactions on Neural Networks and Learning Systems, vol. 10, no. 5, pp. 1–10, 2023.
G. H. Ishrak, Z. Mahmud, M. Z. A. Z. Farabe, T. K. Tinni, T. Reza, and M. Z. Parvez, "Explainable Deepfake Video Detection Using Convolutional Neural Network and CapsuleNet," arXiv preprint arXiv:2404.12841, 2024.
V. N. Tran, S. H. Lee, H. S. Le, and K. R. Kwon, "High Performance Deepfake Video Detection on CNN-Based with Attention Target-Specific Regions and Manual Distillation Extraction," Applied Sciences, vol. 11, no. 16, pp. 76–78, 2021.

Deepfake technology, driven by advancements in artificial intelligence, enables the creation of highly realistic manipulated videos, posing significant threats to security, privacy, and misinformation. Traditional detection methods struggle to keep pace with the evolving sophistication of deepfake techniques. This study proposes a hybrid deep learning approach that leverages Convolutional Neural Networks (CNN) for feature extraction and Long Short-Term Memory (LSTM) networks for temporal sequence analysis to enhance deepfake detection accuracy. The CNN model captures spatial inconsistencies and artifacts in individual frames, while the LSTM network analyzes sequential dependencies to detect temporal anomalies indicative of deepfakes. Experimental evaluations on benchmark datasets demonstrate the effectiveness of the approach, achieving high accuracy in distinguishing real from fake videos. The proposed model offers a robust and scalable solution for deepfake detection, contributing to the fight against digital media manipulation and misinformation.

CALL FOR PAPERS

Paper Submission Last Date
31 - July - 2025

Video Explanation for Published paper

CALL FOR PAPERS

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.