Authors :
Amera M. Brash; Dr. Mohamed Dweib
Volume/Issue :
Volume 10 - 2025, Issue 1 - January
Google Scholar :
https://tinyurl.com/4swmfajd
Scribd :
https://tinyurl.com/565uf49b
DOI :
https://doi.org/10.5281/zenodo.14769413
Abstract :
This study investigates estimating ten-year Coronary Heart Disease risk (CHD). It found that age and systolic
blood pressure play crucial roles in making these predictions. Among various models tested, the Support Vector Machine
(SVM) classifier performed best, showing high accuracy and F1 score. Although decision tree demonstrated slightly better
accuracy, the SVM outperformed other models in most metrics. Balancing the dataset using SMOTE improved sensitivity.
The study suggests that with more data, especially from minority groups, the models could become even more accurate in
predicting CHD risk.
Keywords :
Coronary Heart Disease, AI, Machine Learning, Deep Learning, Exploratory Data Analysis, Diagnostic Algorithms Logistic regression, K-NN, Decision Trees, SVM, Random Forest.
References :
- World Health Organization Cardiovascular Diseases (CVDs) [(accessed on 10 January 2022)]. Available online: https://www.afro.who.int/health-topics/cardiovascular-diseases
- [(accessed on 10 January 2022)]. Available online: https://www.heart.org/en/health-topics/high-blood-pressure/why-high-blood-pressure-is-a-silent-killer/know-your-risk-factors-for-high-blood-pressure
- Alaa, A. M., Bolton, T., Di Angelantonio, E., Rudd, J. H. F., & van der Schaar, M. (2019). Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants. PloS One, 14(5), e0213653. https://doi.org/10.1371/journal.pone.0213653
- Palaniappan, S., & Awang, R. (2008). Intelligent heart disease prediction system using data mining techniques. 2008 IEEE/ACS International Conference on Computer Systems and Applications.
- Kumar Dubey, A., Choudhary, K., & Sharma, R. (2021). Predicting heart disease based on influential features with machine learning. Intelligent Automation & Soft Computing, 30(3), 929–943. https://doi.org/10.32604/iasc.2021.018382
- (N.d.). Nih.gov. Retrieved October 15, 2024, from https://www.ncbi.nlm.nih.gov/pmc/
- Sk, K. B., Roja, Priya, S. S., Dalavi, L., Vellela, S. S., & Reddy, V. (2023). Coronary heart disease prediction and classification using hybrid machine learning algorithms. 2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA), 7, 1–7.
- https://doi.org/10.1109/ICIDCA56705.2023.10099579
- Melillo, P., De Luca, N., Bracale, M., & Pecchia, L. (2013). Classification tree for risk assessment in patients suffering from congestive heart failure via long-term heart rate variability. IEEE Journal of Biomedical and Health Informatics, 17(3), 727–733. https://doi.org/10.1109/JBHI.2012.2236579
- Rahhal, M. M. A., Bazi, Y., AlHichri, H., Alajlan, N., Melgani, F., & Yager, R. R. (2016). Deep learning approach for active classification of electrocardiogram signals. Information Sciences, 345, 340–354. https://doi.org/10.1016/j.ins.2016.01.082
- Guidi, G., Pettenati, M. C., Melillo, P., & Iadanza, E. (2014). A machine learning system to improve heart failure patient assistance. IEEE Journal of Biomedical and Health Informatics, 18(6), 1750–1756. https://doi.org/10.1109/jbhi.2014.2337752
- Keogh, E., & Mueen, A. (2017). Curse of dimensionality. In Encyclopedia of Machine Learning and Data Mining (pp. 314–315). Springer US.
- Wettschereck, D., & Dietterich, T. G. (1995). Machine Learning, 19(1), 5–27. https://doi.org/10.1023/a:1022603022740
- Wettschereck, D., Aha, D. W., & Mohri, T. (1997). Artificial Intelligence Review, 11(1/5), 273–314. https://doi.org/10.1023/a:1006593614256
- Dun, B., Wang, E., & Majumder, S. (2016). Heart disease diagnosis on medical data using ensemble learning. Proceedings of the International Conference on Big Data, 1–6.
- Singh, R. S., Saini, B. S., & Sunkaria, R. K. (2018). Detection of coronary artery disease by reduced features and extreme learning machine. Medicine and Pharmacy Reports, 91(2), 166–175. https://doi.org/10.15386/cjmed-882
- Yaghouby, F., Ayatollahi, A., & Soleimani, R. (2009). Classification of cardiac abnormalities using reduced features of heart rate variability signal. World Applied Sciences Journal, 6(11), 1547–1554.
- Asl, B. M., Setarehdan, S. K., & Mohebbi, M. (2008). Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal. Artificial Intelligence in Medicine, 44(1), 51–64. https://doi.org/10.1016/j.artmed.2008.04.007
- Guyon, I., Gunn, S., Nikravesh, M., & Zadeh, L. A. (2008). Feature extraction: Foundations and applications. Springer.
- Rajagopal, R., & Ranganathan, V. (2017). Evaluation of effect of unsupervised dimensionality reduction techniques on automated arrhythmia classification. Biomedical Signal Processing and Control, 34, 1–8. https://doi.org/10.1016/j.bspc.2016.12.017
- Kamencay, P., Hudec, R., Benco, M., & Zachariasova, M. (2013). Feature extraction for object recognition using PCA-KNN with application to medical image analysis. 2013 36th International Conference on Telecommunications and Signal Processing (TSP).
- UCI Machine Learning Repository. (2020). Heart disease data set. http://archive.ics.uci.edu/ml/datasets/heart+disease
- Khan, S. S., & Quadri, S. M. K. (2016). Prediction of angiographic disease status using rule-based data mining techniques. Research Trend. Retrieved from http://www.researchtrend.net
- Throughout life, heart attacks are twice as common in men than women. (2016, November 8). Harvard Health. https://www.health.harvard.edu/heart-health/throughout-life-heart-attacks-are-twice-as-common-in-men-than-women
- Trigka, M., & Dritsas, E. (2023). Long-term coronary artery disease risk prediction with machine learning models. Sensors (Basel, Switzerland), 23(3), 1193. https://doi.org/10.3390/s23031193
- Hammoud, A., Karaki, A., Tafreshi, R., Abdulla, S., & Wahid, M. (2024). Coronary heart disease prediction: A comparative study of machine learning algorithms. Journal of Advances in Information Technology, 15(1), 27–32. https://doi.org/10.12720/jait.15.1.27-32
This study investigates estimating ten-year Coronary Heart Disease risk (CHD). It found that age and systolic
blood pressure play crucial roles in making these predictions. Among various models tested, the Support Vector Machine
(SVM) classifier performed best, showing high accuracy and F1 score. Although decision tree demonstrated slightly better
accuracy, the SVM outperformed other models in most metrics. Balancing the dataset using SMOTE improved sensitivity.
The study suggests that with more data, especially from minority groups, the models could become even more accurate in
predicting CHD risk.
Keywords :
Coronary Heart Disease, AI, Machine Learning, Deep Learning, Exploratory Data Analysis, Diagnostic Algorithms Logistic regression, K-NN, Decision Trees, SVM, Random Forest.