Hybrid Deepfake Detection Using CNN for Spatial Analysis and LSTM for Temporal Consistency


Authors : Lakshmi Venkata Manikanta Maguluri; Hema Naga Vamsi Kothamasu; Shiny Duela Johnson

Volume/Issue : Volume 10 - 2025, Issue 5 - May


Google Scholar : https://tinyurl.com/bdkk24k9

Scribd : https://tinyurl.com/2vvwk8b7

DOI : https://doi.org/10.38124/ijisrt/25may346

Google Scholar

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Note : Google Scholar may take 15 to 20 days to display the article.


Abstract : Deepfake technology, driven by advancements in artificial intelligence, enables the creation of highly realistic manipulated videos, posing significant threats to security, privacy, and misinformation. Traditional detection methods struggle to keep pace with the evolving sophistication of deepfake techniques. This study proposes a hybrid deep learning approach that leverages Convolutional Neural Networks (CNN) for feature extraction and Long Short-Term Memory (LSTM) networks for temporal sequence analysis to enhance deepfake detection accuracy. The CNN model captures spatial inconsistencies and artifacts in individual frames, while the LSTM network analyzes sequential dependencies to detect temporal anomalies indicative of deepfakes. Experimental evaluations on benchmark datasets demonstrate the effectiveness of the approach, achieving high accuracy in distinguishing real from fake videos. The proposed model offers a robust and scalable solution for deepfake detection, contributing to the fight against digital media manipulation and misinformation.

Keywords : Deepfake Detection, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Artificial Intelligence, Digital Media Forensics, Misinformation, Temporal Analysis, Feature Extraction, Fake Video Identification.

References :

  1. U. Masud, M. Sadiq, S. Masood, M. Ahmad, A. El-Latif, and A. Ahmed, "LW-DeepFakeNet: A Lightweight Time Distributed CNN-LSTM Network for Real-Time DeepFake Video Detection," Signal, Image and Video Processing, pp. 1–9, 2023.
  2. Y. Patel, S. Tanwar, P. Bhattacharya, R. Gupta, T. Alsuwian, and I. E. Davidson, "An Improved Dense CNN Architecture for Deepfake Image Detection," IEEE Access, vol. 11, pp. 22081–22095, 2023.
  3. V. N. Tran, S. H. Lee, H. S. Le, and K. R. Kwon, "High Performance Deepfake Video Detection on CNN-Based with Attention Target-Specific Regions and Manual Distillation Extraction," Applied Sciences, vol. 11, no. 16, pp. 76–78, 2021.
  4. K. Warke, N. Dalavi, and S. Nahar, "DeepFake Detection Through Deep Learning Using ResNext CNN and LSTM," IEEE Transactions on Neural Networks and Learning Systems, vol. 10, no. 5, pp. 1–10, 2023.
  5. G. H. Ishrak, Z. Mahmud, M. Z. A. Z. Farabe, T. K. Tinni, T. Reza, and M. Z. Parvez, "Explainable Deepfake Video Detection Using Convolutional Neural Network and CapsuleNet," arXiv preprint arXiv:2404.12841, 2024.
  6. U. Masud, M. Sadiq, S. Masood, M. Ahmad, A. El-Latif, and A. Ahmed, "LW-DeepFakeNet: A Lightweight Time Distributed CNN-LSTM Network for Real-Time DeepFake Video Detection," Signal, Image and Video Processing, pp. 1–9, 2023.
  7. Y. Patel, S. Tanwar, P. Bhattacharya, R. Gupta, T. Alsuwian, and I. E. Davidson, "An Improved Dense CNN Architecture for Deepfake Image Detection," IEEE Access, vol. 11, pp. 22081–22095, 2023.
  8. V. N. Tran, S. H. Lee, H. S. Le, and K. R. Kwon, "High Performance Deepfake Video Detection on CNN-Based with Attention Target-Specific Regions and Manual Distillation Extraction," Applied Sciences, vol. 11, no. 16, pp. 76–78, 2021.
  9. K. Warke, N. Dalavi, and S. Nahar, "DeepFake Detection Through Deep Learning Using ResNext CNN and LSTM," IEEE Transactions on Neural Networks and Learning Systems, vol. 10, no. 5, pp. 1–10, 2023.
  10. G. H. Ishrak, Z. Mahmud, M. Z. A. Z. Farabe, T. K. Tinni, T. Reza, and M. Z. Parvez, "Explainable Deepfake Video Detection Using Convolutional Neural Network and CapsuleNet," arXiv preprint arXiv:2404.12841, 2024.
  11. V. N. Tran, S. H. Lee, H. S. Le, and K. R. Kwon, "High Performance Deepfake Video Detection on CNN-Based with Attention Target-Specific Regions and Manual Distillation Extraction," Applied Sciences, vol. 11, no. 16, pp. 76–78, 2021.

Deepfake technology, driven by advancements in artificial intelligence, enables the creation of highly realistic manipulated videos, posing significant threats to security, privacy, and misinformation. Traditional detection methods struggle to keep pace with the evolving sophistication of deepfake techniques. This study proposes a hybrid deep learning approach that leverages Convolutional Neural Networks (CNN) for feature extraction and Long Short-Term Memory (LSTM) networks for temporal sequence analysis to enhance deepfake detection accuracy. The CNN model captures spatial inconsistencies and artifacts in individual frames, while the LSTM network analyzes sequential dependencies to detect temporal anomalies indicative of deepfakes. Experimental evaluations on benchmark datasets demonstrate the effectiveness of the approach, achieving high accuracy in distinguishing real from fake videos. The proposed model offers a robust and scalable solution for deepfake detection, contributing to the fight against digital media manipulation and misinformation.

Keywords : Deepfake Detection, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Artificial Intelligence, Digital Media Forensics, Misinformation, Temporal Analysis, Feature Extraction, Fake Video Identification.

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe