Estimating Ten-Year Coronary Heart Disease Risk through Machine Learning Techniques


Authors : Amera M. Brash; Dr. Mohamed Dweib

Volume/Issue : Volume 10 - 2025, Issue 1 - January


Google Scholar : https://tinyurl.com/4swmfajd

Scribd : https://tinyurl.com/565uf49b

DOI : https://doi.org/10.5281/zenodo.14769413


Abstract : This study investigates estimating ten-year Coronary Heart Disease risk (CHD). It found that age and systolic blood pressure play crucial roles in making these predictions. Among various models tested, the Support Vector Machine (SVM) classifier performed best, showing high accuracy and F1 score. Although decision tree demonstrated slightly better accuracy, the SVM outperformed other models in most metrics. Balancing the dataset using SMOTE improved sensitivity. The study suggests that with more data, especially from minority groups, the models could become even more accurate in predicting CHD risk.

Keywords : Coronary Heart Disease, AI, Machine Learning, Deep Learning, Exploratory Data Analysis, Diagnostic Algorithms Logistic regression, K-NN, Decision Trees, SVM, Random Forest.

References :

  1. World Health Organization Cardiovascular Diseases (CVDs) [(accessed on 10 January 2022)]. Available online: https://www.afro.who.int/health-topics/cardiovascular-diseases
  2. [(accessed on 10 January 2022)]. Available online: https://www.heart.org/en/health-topics/high-blood-pressure/why-high-blood-pressure-is-a-silent-killer/know-your-risk-factors-for-high-blood-pressure
  3. Alaa, A. M., Bolton, T., Di Angelantonio, E., Rudd, J. H. F., & van der Schaar, M. (2019). Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants. PloS One, 14(5), e0213653. https://doi.org/10.1371/journal.pone.0213653
  4. Palaniappan, S., & Awang, R. (2008). Intelligent heart disease prediction system using data mining techniques. 2008 IEEE/ACS International Conference on Computer Systems and Applications.
  5. Kumar Dubey, A., Choudhary, K., & Sharma, R. (2021). Predicting heart disease based on influential features with machine learning. Intelligent Automation & Soft Computing, 30(3), 929–943. https://doi.org/10.32604/iasc.2021.018382
  6. (N.d.). Nih.gov. Retrieved October 15, 2024, from https://www.ncbi.nlm.nih.gov/pmc/
  7. Sk, K. B., Roja, Priya, S. S., Dalavi, L., Vellela, S. S., & Reddy, V. (2023). Coronary heart disease prediction and classification using hybrid machine learning algorithms. 2023 International Conference on Innovative Data Communication Technologies and Application (ICIDCA), 7, 1–7.
  8. https://doi.org/10.1109/ICIDCA56705.2023.10099579
  9. Melillo, P., De Luca, N., Bracale, M., & Pecchia, L. (2013). Classification tree for risk assessment in patients suffering from congestive heart failure via long-term heart rate variability. IEEE Journal of Biomedical and Health Informatics, 17(3), 727–733. https://doi.org/10.1109/JBHI.2012.2236579
  10. Rahhal, M. M. A., Bazi, Y., AlHichri, H., Alajlan, N., Melgani, F., & Yager, R. R. (2016). Deep learning approach for active classification of electrocardiogram signals. Information Sciences, 345, 340–354. https://doi.org/10.1016/j.ins.2016.01.082
  11. Guidi, G., Pettenati, M. C., Melillo, P., & Iadanza, E. (2014). A machine learning system to improve heart failure patient assistance. IEEE Journal of Biomedical and Health Informatics, 18(6), 1750–1756. https://doi.org/10.1109/jbhi.2014.2337752
  12. Keogh, E., & Mueen, A. (2017). Curse of dimensionality. In Encyclopedia of Machine Learning and Data Mining (pp. 314–315). Springer US.
  13. Wettschereck, D., & Dietterich, T. G. (1995). Machine Learning, 19(1), 5–27. https://doi.org/10.1023/a:1022603022740
  14. Wettschereck, D., Aha, D. W., & Mohri, T. (1997). Artificial Intelligence Review, 11(1/5), 273–314. https://doi.org/10.1023/a:1006593614256
  15. Dun, B., Wang, E., & Majumder, S. (2016). Heart disease diagnosis on medical data using ensemble learning. Proceedings of the International Conference on Big Data, 1–6.
  16. Singh, R. S., Saini, B. S., & Sunkaria, R. K. (2018). Detection of coronary artery disease by reduced features and extreme learning machine. Medicine and Pharmacy Reports, 91(2), 166–175. https://doi.org/10.15386/cjmed-882
  17. Yaghouby, F., Ayatollahi, A., & Soleimani, R. (2009). Classification of cardiac abnormalities using reduced features of heart rate variability signal. World Applied Sciences Journal, 6(11), 1547–1554.
  18. Asl, B. M., Setarehdan, S. K., & Mohebbi, M. (2008). Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal. Artificial Intelligence in Medicine, 44(1), 51–64. https://doi.org/10.1016/j.artmed.2008.04.007
  19. Guyon, I., Gunn, S., Nikravesh, M., & Zadeh, L. A. (2008). Feature extraction: Foundations and applications. Springer.
  20. Rajagopal, R., & Ranganathan, V. (2017). Evaluation of effect of unsupervised dimensionality reduction techniques on automated arrhythmia classification. Biomedical Signal Processing and Control, 34, 1–8. https://doi.org/10.1016/j.bspc.2016.12.017
  21. Kamencay, P., Hudec, R., Benco, M., & Zachariasova, M. (2013). Feature extraction for object recognition using PCA-KNN with application to medical image analysis. 2013 36th International Conference on Telecommunications and Signal Processing (TSP).
  22. UCI Machine Learning Repository. (2020). Heart disease data set. http://archive.ics.uci.edu/ml/datasets/heart+disease
  23. Khan, S. S., & Quadri, S. M. K. (2016). Prediction of angiographic disease status using rule-based data mining techniques. Research Trend. Retrieved from http://www.researchtrend.net
  24. Throughout life, heart attacks are twice as common in men than women. (2016, November 8). Harvard Health. https://www.health.harvard.edu/heart-health/throughout-life-heart-attacks-are-twice-as-common-in-men-than-women
  25. Trigka, M., & Dritsas, E. (2023). Long-term coronary artery disease risk prediction with machine learning models. Sensors (Basel, Switzerland), 23(3), 1193. https://doi.org/10.3390/s23031193
  26. Hammoud, A., Karaki, A., Tafreshi, R., Abdulla, S., & Wahid, M. (2024). Coronary heart disease prediction: A comparative study of machine learning algorithms. Journal of Advances in Information Technology, 15(1), 27–32. https://doi.org/10.12720/jait.15.1.27-32

This study investigates estimating ten-year Coronary Heart Disease risk (CHD). It found that age and systolic blood pressure play crucial roles in making these predictions. Among various models tested, the Support Vector Machine (SVM) classifier performed best, showing high accuracy and F1 score. Although decision tree demonstrated slightly better accuracy, the SVM outperformed other models in most metrics. Balancing the dataset using SMOTE improved sensitivity. The study suggests that with more data, especially from minority groups, the models could become even more accurate in predicting CHD risk.

Keywords : Coronary Heart Disease, AI, Machine Learning, Deep Learning, Exploratory Data Analysis, Diagnostic Algorithms Logistic regression, K-NN, Decision Trees, SVM, Random Forest.

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe