HPD: A Hybrid ML System for Real-Time Phishing Website Detection


Authors : Padma Priya S.; Swetha R.; Thirishya M.

Volume/Issue : ICMST-2025

Google Scholar : https://tinyurl.com/26ss6fp9

Scribd : https://tinyurl.com/2u8rzndy

DOI : https://doi.org/10.38124/ijisrt/25nov754

Abstract : Phishing remains one of the most widespread and evolving cyber threats, deceiving users through fake websites that mimic legitimate platforms to steal sensitive information. Traditional detection methods such as blacklists, signature- based filters, and manual verification fail to recognize newly emerging or obfuscated phishing sites. To overcome these challenges, this paper presents HPD (Hybrid Phishing Detection), a real-time hybrid machine learning framework that integrates heuristic analysis with advanced classification algorithms to improve accuracy and response time. The system extracts and analyzes multiple feature sets including lexical features from URLs, host-based parameters from domain registration data, and content-based attributes from webpage HTML structure and scripts. These features are processed using a hybrid ensemble model combining Random Forest, Support Vector Machine (SVM), and Logistic Regression, ensuring higher detection precision and reduced false positives. Experimental analysis using benchmark phishing datasets demonstrates that HPD achieves more than 97% accuracy, outperforming traditional single-model systems. The lightweight and scalable design allows deployment as a browser extension or through RESTful APIs for real-time threat detection. By enabling adaptive learning and integration with cloud-based threat intelligence, HPD offers a proactive and reliable solution for combating modern phishing attacks and enhancing web security.

Keywords : Phishing Detection, Cybersecurity, Hybrid Machine Learning, Real-Time System, Ensemble Learning, URL Analysis, Website Security.

Phishing remains one of the most widespread and evolving cyber threats, deceiving users through fake websites that mimic legitimate platforms to steal sensitive information. Traditional detection methods such as blacklists, signature- based filters, and manual verification fail to recognize newly emerging or obfuscated phishing sites. To overcome these challenges, this paper presents HPD (Hybrid Phishing Detection), a real-time hybrid machine learning framework that integrates heuristic analysis with advanced classification algorithms to improve accuracy and response time. The system extracts and analyzes multiple feature sets including lexical features from URLs, host-based parameters from domain registration data, and content-based attributes from webpage HTML structure and scripts. These features are processed using a hybrid ensemble model combining Random Forest, Support Vector Machine (SVM), and Logistic Regression, ensuring higher detection precision and reduced false positives. Experimental analysis using benchmark phishing datasets demonstrates that HPD achieves more than 97% accuracy, outperforming traditional single-model systems. The lightweight and scalable design allows deployment as a browser extension or through RESTful APIs for real-time threat detection. By enabling adaptive learning and integration with cloud-based threat intelligence, HPD offers a proactive and reliable solution for combating modern phishing attacks and enhancing web security.

Keywords : Phishing Detection, Cybersecurity, Hybrid Machine Learning, Real-Time System, Ensemble Learning, URL Analysis, Website Security.

CALL FOR PAPERS


Paper Submission Last Date
30 - November - 2025

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe