Authors :
K. Sai Pragna; Yukta Paranjpe; Suraj Mallapur; Mukesh Marwade; Gayathri K.; Bharani Kumar Depuru; Srija Depuru; Bhargavi Depuru
Volume/Issue :
Volume 11 - 2026, Issue 4 - April
Google Scholar :
https://tinyurl.com/y6evcr3d
Scribd :
https://tinyurl.com/4t8pw4f8
DOI :
https://doi.org/10.38124/ijisrt/26apr1604
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
Road traffic accidents remain a critical challenge due to the complex interaction of traffic conditions, signal
operations, and environmental factors. This paper presents a machine-learning-based approach to accident-risk
classification using traffic and signal data. The proposed system analyses a structured dataset comprising multiple features
related to traffic flow, road conditions, and contextual variables to predict accident-prone scenarios. A comprehensive data
preprocessing pipeline is implemented, including handling missing values, categorical encoding, feature scaling, and class
imbalance. Multiple classification models are evaluated to identify the most effective approach for risk prediction. The
results indicate that ensemble-based models achieve superior performance in capturing complex patterns within the data.
The final model is integrated into a deployment-ready framework that enables real-time accident-risk prediction via an
interactive interface. The proposed system supports proactive traffic management by identifying high-risk conditions in
advance, thereby improving road safety and decision-making.
Keywords :
Road Traffic Accidents, Accident Risk Prediction, Machine Learning, Ensemble Learning, Traffic Data Analysis, Intelligent Transportation Systems.
References :
- World Health Organization, "Global Status Report on Road Safety 2023," WHO, Geneva, 2023. Available: https://www.who.int/publications/i/item/9789240086517
- Federal Highway Administration (FHWA), "Traffic Detector Handbook, Third Edition," Publication No. FHWA-HRT-06-108, U.S. Department of Transportation, 2006.
- National Highway Traffic Safety Administration (NHTSA), "Fatality Analysis Reporting System (FARS)," U.S. Department of Transportation. Available: https://www.nhtsa.gov/research-data/fatality-analysis-reporting-system-fars
- N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: Synthetic Minority Over-sampling Technique," Journal of Artificial Intelligence Research, vol. 16, pp. 321–357, 2002.
- T. Fawcett, "An Introduction to ROC Analysis," Pattern Recognition Letters, vol. 27, no. 8, pp. 861–874, 2006.
- S. Studer, T. B. Bui, C. Drescher, A. Hanuschkin, L. Winkler, S. Peters, and K.-R. Müller, "Towards CRISP-ML(Q): A Machine Learning Process Model with Quality Assurance Methodology," Machine Learning and Knowledge Extraction, vol. 3, no. 2, pp. 392–413, 2021.
- M. Hussain, B. Pu, and H. Hussain, "Real-Time Crash Risk Forecasting for Signalised Intersections Using Loop Detector Data," Scientific Reports, vol. 14, 2024, doi: 10.1038/s41598-024-XXXXX.
- Y. Zhang, X. Yao, H. Wang, and J. Wu, "Real-Time Traffic Conflict Prediction for Signalised Intersections Using Video-Derived Trajectories and Deep Learning with SHAP Interpretability," Transportation Research Part C, 2024.
- L. Breiman, "Random Forests," Machine Learning, vol. 45, no. 1, pp. 5–32, 2001.
- T. Chen and C. Guestrin, "XGBoost: A Scalable Tree Boosting System," in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, 2016, pp. 785–794.
- S. M. Lundberg and S.-I. Lee, "A Unified Approach to Interpreting Model Predictions," in Advances in Neural Information Processing Systems (NeurIPS), 2017, pp. 4765–4774.
- S. Ahmed, M. A. Hossain, M. M. I. Bhuiyan, and S. K. Ray, "A Comparative Study of Machine Learning Algorithms to Predict Road Accident Severity," in Proceedings of the 20th International Conference on Ubiquitous Computing and Communications (IUCC/CIT/DSCI/SmartCNS), IEEE, 2021, pp. 390–397.
- F. N. Ogwueleka, S. Misra, T. C. Ogwueleka, and L. Fernandez-Sanz, "An Artificial Neural Network Model for Road Accident Prediction: A Case Study of a Developing Country," Acta Polytechnica Hungarica, vol. 11, no. 5, pp. 177–197, 2014.
- G. Shiran, R. Imaninasab, and R. Khayamim, "Crash Severity Analysis of Highways Based on Multinomial Logistic Regression Model, Decision Tree Techniques, and Artificial Neural Network: A Modelling Comparison," Sustainability, vol. 13, no. 10, p. 5670, 2021.
- A. M. Amiri, A. Sadri, N. Nadimi, and M. Shams, "A Comparison Between an Artificial Neural Network and a Hybrid Intelligent Genetic Algorithm in Predicting the Severity of Fixed Object Crashes Among Elderly Drivers," Accident Analysis & Prevention, vol. 138, art. 105468, 2020.
- S. Mokhtarimousavi, J. C. Anderson, A. Azizinamini, and M. Hadi, "Improved Support Vector Machine Models for Work Zone Crash Injury Severity Prediction and Analysis," Transportation Research Record, vol. 2673, no. 11, pp. 680–692, 2019.
- Z. Li, P. Liu, W. Wang, and C. Xu, "Using Support Vector Machine Models for Crash Injury Severity Analysis," Accident Analysis & Prevention, vol. 45, pp. 478–486, 2012.
- M. Yan and Y. Shen, "Traffic Accident Severity Prediction Based on Random Forest," Sustainability, vol. 14, no. 3, p. 1729, 2022.
- A. B. Parsa, A. Movahedi, H. Taghipour, S. Derrible, and A. K. Mohammadian, "Toward Safer Highways: Application of XGBoost and SHAP for Real-Time Accident Detection and Feature Analysis," Accident Analysis & Prevention, vol. 136, art. 105405, 2020.
- Y. Qu, Z. Lin, H. Li, and X. Zhang, "Feature Recognition of Urban Road Traffic Accidents Based on GA-XGBoost in the Context of Big Data," IEEE Access, vol. 7, pp. 170106–170115, 2019.
- Z. Ma, G. Mei, and S. Cuomo, "An Analytic Framework Using Deep Learning for the Prediction of Traffic Accident Injury Severity Based on Contributing Factors," Accident Analysis & Prevention, vol. 160, art. 106322, 2021.
- V. Adewopo, N. Elsayed, Z. Elsayed, M. Ozer, V. Wangia-Anderson, and A. Abdelgawad, "AI on the Road: A Comprehensive Analysis of Traffic Accidents and Accident Detection Systems in Smart Cities," arXiv:2307.12128, 2023.
- Y. Li, M. Li, J. Yuan, J. Lu, and M. Abdel-Aty, "Analysis and Prediction of Intersection Traffic Violations Using Automated Enforcement System Data," Accident Analysis & Prevention, vol. 162, art. 106422, 2021.
- B. Sharma, V. K. Katiyar, and K. Kumar, "Traffic Accident Prediction Model Using Support Vector Machines with a Gaussian Kernel," in Proceedings of the Fifth International Conference on Soft Computing for Problem Solving (SocProS), Springer, Singapore, 2016, pp. 1–10.
- N. Formosa, M. Quddus, S. Ison, M. Abdel-Aty, and J. Yuan, "Predicting Real-Time Traffic Conflicts Using Deep Learning," Accident Analysis & Prevention, vol. 136, art. 105429, 2020.
- H. Ren, Y. Song, J. Wang, Y. Hu, and J. Lei, "A Deep Learning Approach to the Citywide Traffic Accident Risk Prediction," in Proceedings of the 21st International Conference on Intelligent Transportation Systems (ITSC), IEEE, 2018, pp. 3346–3351.
- Z. Yuan, X. Zhou, and T. Yang, "Hetero-ConvLSTM: A Deep Learning Approach to Traffic Accident Prediction on Heterogeneous Spatiotemporal Data," in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, ACM, New York, 2018, pp. 984–992.
- L. Lin, Q. Wang, and A. W. Sadek, "A Novel Variable Selection Method Based on a Frequent Pattern Tree for Real-Time Traffic Accident Risk Prediction," Transportation Research Part C: Emerging Technologies, vol. 55, pp. 444–459, 2015.
- C. C. Ihueze and U. O. Onwurah, "Road Traffic Accidents Prediction Modelling: An Analysis of Anambra State, Nigeria," Accident Analysis & Prevention, vol. 112, pp. 21–29, 2018.
Road traffic accidents remain a critical challenge due to the complex interaction of traffic conditions, signal
operations, and environmental factors. This paper presents a machine-learning-based approach to accident-risk
classification using traffic and signal data. The proposed system analyses a structured dataset comprising multiple features
related to traffic flow, road conditions, and contextual variables to predict accident-prone scenarios. A comprehensive data
preprocessing pipeline is implemented, including handling missing values, categorical encoding, feature scaling, and class
imbalance. Multiple classification models are evaluated to identify the most effective approach for risk prediction. The
results indicate that ensemble-based models achieve superior performance in capturing complex patterns within the data.
The final model is integrated into a deployment-ready framework that enables real-time accident-risk prediction via an
interactive interface. The proposed system supports proactive traffic management by identifying high-risk conditions in
advance, thereby improving road safety and decision-making.
Keywords :
Road Traffic Accidents, Accident Risk Prediction, Machine Learning, Ensemble Learning, Traffic Data Analysis, Intelligent Transportation Systems.