Authors :
İlker Çakar; Muhammed Kürşad UÇAR
Volume/Issue :
Volume 9 - 2024, Issue 10 - October
Google Scholar :
https://tinyurl.com/mtw5m2rc
Scribd :
https://tinyurl.com/bdh93szh
DOI :
https://doi.org/10.38124/ijisrt/IJISRT24OCT1557
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
Breast cancer is among the most prevalent
diseases encountered among women worldwide. Early
diagnosis of breast cancer is crucial for the treatment of
the disease. Detecting the disease at an early stage
prevents deaths resulting from the condition. Recently,
computer-aided systems have been developed to ensure
early-stage diagnosis and accuracy of breast cancer.
Computer-aided systems developed with machine
learning approaches significantly contribute to the
process of diagnosing breast cancer. The aim of this study
is to propose a new classification system based on
machine learning algorithms developed for the diagnosis
of breast cancer. In this study, sub-data sets were created
by reducing features, and data cleaning processes were
applied. After these procedures, stages such as feature
selection and feature extraction were applied. In this
study, classification processes such as Ensemble, k-
Nearest Neighbors (kNN), Support Vector Machines
(SVMs), and Hybrid Artificial Intelligence were used in
line with machine learning. With the obtained results, a
Breast Cancer diagnosis algorithm was created.
Performance evaluation criteria such as accuracy rate,
specificity, sensitivity, kappa number and F-Measure
were applied to the created algorithms. In the results
obtained in this study, the highest accuracy rate was
found to be 99.3% with the Ensemble method, the highest
specificity rate was 98.7% with the Ensemble method,
and the highest sensitivity rate was found to be 100%
with many methods. In light of these results, it was
observed that the machine learning algorithms used in
this study, implemented in the Matlab environment, were
effective. Consequently, it was proven that higher
accuracy, specificity, and sensitivity rates can be found
with different machine learning techniques. This also
demonstrates that the study in our article is a reliable one
in detecting diseased and healthy individuals in the
diagnosis of breast cancer, showing that it is a more
applicable and feasible study in the healthcare field.
Keywords :
Breast Cancer Diagnosis; Machine Learning; Ensemble Method; Performance Review; Hybrit Artificial Intelligence.
References :
- “What Is Breast Cancer? | CDC.” Accessed: Jan. 14, 2024. [Online]. Available: https://www.cdc.gov/cancer/breast/basic_info/what-is-breast-cancer.htm
- M. Z. Islam, M. M. Islam, and A. Asraf, “A combined deep CNN-LSTM network for the detection of novel coronavirus (COVID-19) using X-ray images,” Informatics Med. Unlocked, vol. 20, Jan. 2020, doi: 10.1016/j.imu.2020.100412.
- “Breast Cancer Facts and Statistics 2024.” Accessed: Jan. 15, 2024. [Online]. Available: https://www.breastcancer.org/facts-statistics?gad_source=1&gclid=CjwKCAiAzJOtBhALEiwAtwj8tlbQuo59n0mvpqVNs4YuzG07eSYQa53w4PbnkQQYEyqnfQyC5Nq41hoCSMIQAvD_BwE
- “What Is Breast Cancer? | American Cancer Society | American Cancer Society.” Accessed: Jan. 15, 2024. [Online]. Available: https://www.cancer.org/cancer/types/breast-cancer/about/what-is-breast-cancer.html
- M. Fan et al., “Joint Prediction of Breast Cancer Histological Grade and Ki-67 Expression Level Based on DCE-MRI and DWI Radiomics,” IEEE J. Biomed. Heal. Informatics, vol. 24, no. 6, pp. 1632–1642, Jun. 2020, doi: 10.1109/JBHI.2019.2956351.
- F. Bray, J. Ferlay, I. Soerjomataram, R. L. Siegel, L. A. Torre, and A. Jemal, “Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries,” CA. Cancer J. Clin., vol. 68, no. 6, pp. 394–424, Nov. 2018, doi: 10.3322/caac.21492.
- “Breast cancer.” Accessed: Jan. 15, 2024. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/breast-cancer
- “Content on Early Breast Cancer.” Accessed: Jan. 26, 2024. [Online]. Available: https://www.webmd.com/breast-cancer/toc-early-breast-cancer
- “Breast Cancer Treatment Options - National Breast Cancer Foundation.” Accessed: Jan. 15, 2024. [Online]. Available: https://www.nationalbreastcancer.org/breast-cancer-treatment/
- V. Chaurasia, S. Pal, and B. B. Tiwari, “Prediction of benign and malignant breast cancer using data mining techniques,” J. Algorithms Comput. Technol., vol. 12, no. 2, pp. 119–126, Jun. 2018, doi: 10.1177/1748301818756225.
- K. Cheng, J. Wang, J. Liu, X. Zhang, Y. Shen, and H. Su, “Public health implications of computer-aided diagnosis and treatment technologies in breast cancer care,” AIMS Public Heal., vol. 10, no. 4, p. 867, 2023, doi: 10.3934/PUBLICHEALTH.2023057.
- E. H. Houssein, M. M. Emam, A. A. Ali, and P. N. Suganthan, “Deep and machine learning techniques for medical imaging-based breast cancer: A comprehensive review,” Apr. 01, 2021, Elsevier Ltd. doi: 10.1016/j.eswa.2020.114161.
- V. J. Kadam, S. M. Jadhav, and K. Vijayakumar, “Breast Cancer Diagnosis Using Feature Ensemble Learning Based on Stacked Sparse Autoencoders and Softmax Regression,” J. Med. Syst., vol. 43, no. 8, Aug. 2019, doi: 10.1007/s10916-019-1397-z.
- S. I. Ayon, M. M. Islam, and M. R. Hossain, “Coronary Artery Heart Disease Prediction: A Comparative Study of Computational Intelligence Techniques,” IETE J. Res., vol. 68, no. 4, pp. 2488–2507, 2022, doi: 10.1080/03772063.2020.1713916.
- L. J. Muhammad, M. M. Islam, S. S. Usman, and S. I. Ayon, “Predictive Data Mining Models for Novel Coronavirus (COVID-19) Infected Patients’ Recovery,” SN Comput. Sci., vol. 1, no. 4, Jul. 2020, doi: 10.1007/s42979-020-00216-w.
- M. R. Haque, M. M. Islam, H. Iqbal, M. Sumon Reza, and M. K. Hasan, “Performance Evaluation of Random Forests and Artificial Neural Networks for the Classification of Liver Disorder.”
- S. Islam Ayon and M. Milon Islam, “Diabetes Prediction: A Deep Learning Approach,” Int. J. Inf. Eng. Electron. Bus., vol. 11, no. 2, pp. 21–27, Mar. 2019, doi: 10.5815/ijieeb.2019.02.03.
- M. F. Ak, “A comparative analysis of breast cancer detection and diagnosis using data visualization and machine learning applications,” Healthc., vol. 8, no. 2, 2020, doi: 10.3390/healthcare8020111.
- R. C. Conceição et al., “Classification of breast tumor models with a prototype microwave imaging system,” Med. Phys., vol. 47, no. 4, pp. 1860–1870, Apr. 2020, doi: 10.1002/mp.14064.
- D. Muduli, R. Dash, and B. Majhi, “Automated breast cancer detection in digital mammograms: A moth flame optimization based ELM approach,” Biomed. Signal Process. Control, vol. 59, May 2020, doi: 10.1016/j.bspc.2020.101912.
- Z. Huang and D. Chen, “A Breast Cancer Diagnosis Method Based on VIM Feature Selection and Hierarchical Clustering Random Forest Algorithm,” IEEE Access, vol. 10, pp. 3284–3293, 2022, doi: 10.1109/ACCESS.2021.3139595.
- A. Kumar Jakhar, M. Singh, and A. Gupta, “SELF: A Stacked-based Ensemble Learning Framework for Breast Cancer Classiication SELF: A Stacked-based Ensemble Learning Framework for Breast Cancer Classification,” 2022, doi: 10.21203/rs.3.rs-2013877/v1.
- A. F. M. Agarap, “On breast cancer detection: An application of machine learning algorithms on the Wisconsin diagnostic dataset,” in ACM International Conference Proceeding Series, Association for Computing Machinery, Feb. 2018, pp. 5–9. doi: 10.1145/3184066.3184080.
- A. U. Haq et al., “Detection of Breast Cancer through Clinical Data Using Supervised and Unsupervised Feature Selection Techniques,” IEEE Access, vol. 9, pp. 22090–22105, 2021, doi: 10.1109/ACCESS.2021.3055806.
- U. Naseem et al., “An Automatic Detection of Breast Cancer Diagnosis and Prognosis Based on Machine Learning Using Ensemble of Classifiers,” IEEE Access, vol. 10, pp. 78242–78252, 2022, doi: 10.1109/ACCESS.2022.3174599.
- S. Alghunaim and H. H. Al-Baity, “On the Scalability of Machine-Learning Algorithms for Breast Cancer Prediction in Big Data Context,” IEEE Access, vol. 7, pp. 91535–91546, 2019, doi: 10.1109/ACCESS.2019.2927080.
- M. K. Uçar, M. R. Bozkurt, C. Bilgin, and K. Polat, “Automatic detection of respiratory arrests in OSA patients using PPG and machine learning techniques,” Neural Comput. Appl., vol. 28, no. 10, pp. 2931–2945, Oct. 2017, doi: 10.1007/s00521-016-2617-9.
- M. K. Uçar, M. R. Bozkurt, C. Bilgin, and K. Polat, “Automatic sleep staging in obstructive sleep apnea patients using photoplethysmography, heart rate variability signal and machine learning techniques,” Neural Comput. Appl., vol. 29, no. 8, pp. 1–16, Apr. 2018, doi: 10.1007/s00521-016-2365-x.
- M. D. Ganggayah, N. A. Taib, Y. C. Har, P. Lio, and S. K. Dhillon, “Predicting factors for survival of breast cancer patients using machine learning techniques,” BMC Med. Inform. Decis. Mak., vol. 19, no. 1, pp. 1–17, 2019, doi: 10.1186/s12911-019-0801-4.
- D. A. Omondiagbe, S. Veeramani, and A. S. Sidhu, “Machine Learning Classification Techniques for Breast Cancer Diagnosis,” IOP Conf. Ser. Mater. Sci. Eng., vol. 495, no. 1, 2019, doi: 10.1088/1757-899X/495/1/012033.
- V. Chaurasia and S. Pal, “Applications of Machine Learning Techniques to Predict Diagnostic Breast Cancer,” SN Comput. Sci., vol. 1, no. 5, 2020, doi: 10.1007/s42979-020-00296-8.
- Y. Khourdifi and M. Bahaj, “Applying best machine learning algorithms for breast cancer prediction and classification,” 2018 Int. Conf. Electron. Control. Optim. Comput. Sci. ICECOCS 2018, pp. 1–5, 2019, doi: 10.1109/ICECOCS.2018.8610632.
- H. Asri, H. Mousannif, H. Al Moatassime, and T. Noel, “Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis,” Procedia Comput. Sci., vol. 83, no. Fams, pp. 1064–1069, 2016, doi: 10.1016/j.procs.2016.04.224.
- E. A. Bayrak, P. Kirci, and T. Ensari, “Comparison of machine learning methods for breast cancer diagnosis,” 2019 Sci. Meet. Electr. Biomed. Eng. Comput. Sci. EBBT 2019, pp. 4–6, 2019, doi: 10.1109/EBBT.2019.8741990.
- Y. Shinde, A. Kenchappagol, and S. Mishra, “Comparative Study of Machine Learning Algorithms for Breast Cancer Classification,” Smart Innov. Syst. Technol., vol. 286, pp. 545–554, 2022, doi: 10.1007/978-981-16-9873-6_49.
- “Breast Cancer.” Accessed: May 08, 2024. [Online]. Available: https://www.kaggle.com/datasets/reihanenamdari/breast-cancer/data
- M. K. UÇAR, M. R. BOZKURT, and C. BİLGİN, “Elektrokardiyogram Sinyalinin Uyku / Uyanıklık Evreleri için İstatistiksel Olarak İncelenmesi,” Süleyman Demirel Üniversitesi Fen Bilim. Enstitüsü Derg., vol. 24, no. 2, pp. 502–507, Aug. 2020, doi: 10.19113/sdufenbed.555651.
- E. Kartal, “S ı n ı fland ı rmaya Dayal ı Makine Ö ğ renmesi Teknikleri ve Kardiyolojik Risk De ğ erlendirmesine İ li ş kin Bir Uygulama,” no. January, 2015.
- M. K. UÇAR, “Eta Correlation Coefficient Based Feature Selection Algorithm for Machine Learning: E-Score Feature Selection Algorithm,” J. Intell. Syst. Theory Appl., vol. 2, no. 1, pp. 7–12, 2019, doi: 10.38016/jista.498799.
- “‘Ensemble Karar Ağaçları-Ensemble Decision Trees (DT) / Regresyon Algoritması’ by İLYAS BERK FIRSAT on Prezi.” Accessed: Mar. 28, 2024. [Online]. Available: https://prezi.com/p/vzfhtellvaz4/ensemble-karar-agaclar-ensemble-decision-trees-dt-regresyon-algoritmas/
- “Makine Öğrenimi Bölüm-5 (Karar Ağaçları).” Accessed: Mar. 27, 2024. [Online]. Available: https://www.linkedin.com/pulse/makine-öğrenimi-bölüm-5-karar-ağaçları-eyüp-kaan-ülgen/?originalSubdomain=tr
- “KNN (K-En Yakin Komsu). KNN algoritması, son derece basit… | by ABDULLAH ATCILI | Machine Learning Turkiye | Medium.” Accessed: Mar. 28, 2024. [Online]. Available: https://medium.com/machine-learning-türkiye/knn-k-en-yakın-komşu-7a037f056116
- “k-en Yakın Komşu Algoritması ve Bir Uygulama (Kredi Riskini Sınıflandırma) - PDF Free Download.” Accessed: Mar. 27, 2024. [Online]. Available: https://docplayer.biz.tr/15448102-K-en-yakin-komsu-algoritmasi-ve-bir-uygulama-kredi-riskini-siniflandirma.html
- B. H. Aymen Fathalla Alhasadi, “PREDICTING BREAST CANCER BY USING ARTIFICIAL NEURAL NETWORK A MASTER’S THESIS,” 2016.
- M. Kürşad, “OBSTRÜKTİF UYKU APNE TEŞHİSİ İÇİN MAKİNE ÖĞRENMESİ TABANLI YENİ BİR YÖNTEM GELİŞTİRİLMESİ DOKTORA TEZİ.”
- Y. Wang, Y. M. Chu, A. Thaljaoui, Y. A. Khan, W. Chammam, and S. Z. Abbas, “A multi-feature hybrid classification data mining technique for human-emotion,” BioData Min., vol. 14, no. 1, Dec. 2021, doi: 10.1186/s13040-021-00254-x.
- S. Sharma, A. Aggarwal, and T. Choudhury, “Breast Cancer Detection Using Machine Learning Algorithms,” Proc. Int. Conf. Comput. Tech. Electron. Mech. Syst. CTEMS 2018, no. Ml, pp. 114–118, 2018, doi: 10.1109/CTEMS.2018.8769187.
- M. Kaya Keleş, “Breast cancer prediction and detection using data mining classification algorithms: A comparative study,” Teh. Vjesn., vol. 26, no. 1, pp. 149–155, 2019, doi: 10.17559/TV-20180417102943.
- S. Bharati, M. A. Rahman, and P. Podder, “Breast cancer prediction applying different classification algorithm with comparative analysis using WEKA,” 4th Int. Conf. Electr. Eng. Inf. Commun. Technol. iCEEiCT 2018, pp. 581–584, 2019, doi: 10.1109/CEEICT.2018.8628084.
- A. Bharat, N. Pooja, and R. A. Reddy, “Using Machine Learning algorithms for breast cancer risk prediction and diagnosis,” 2018 IEEE 3rd Int. Conf. Circuits, Control. Commun. Comput. I4C 2018, no. x, pp. 1–4, 2018, doi: 10.1109/CIMCA.2018.8739696.
- N. Fatima, L. Liu, S. Hong, and H. Ahmed, “Prediction of Breast Cancer, Comparative Review of Machine Learning Techniques, and Their Analysis,” IEEE Access, vol. 8, pp. 150360–150376, 2020, doi: 10.1109/ACCESS.2020.3016715.
- M. Abdar et al., “A new nested ensemble technique for automated diagnosis of breast cancer,” Pattern Recognit. Lett., vol. 132, pp. 123–131, Apr. 2020, doi: 10.1016/j.patrec.2018.11.004.
- A. LG and E. AT, “Using Three Machine Learning Techniques for Predicting Breast Cancer Recurrence,” J. Heal. Med. Informatics, vol. 04, no. 02, 2013, doi: 10.4172/2157-7420.1000124.
- S. K. Maliha, R. R. Ema, S. K. Ghosh, H. Ahmed, M. R. J. Mollick, and T. Islam, “Cancer Disease Prediction Using Naive Bayes,K-Nearest Neighbor and J48 algorithm,” 2019 10th Int. Conf. Comput. Commun. Netw. Technol. ICCCNT 2019, pp. 1–7, 2019, doi: 10.1109/ICCCNT45670.2019.8944686.
- M. H. Memon, J. P. Li, A. U. Haq, M. H. Memon, W. Zhou, and R. Lacuesta, “Breast Cancer Detection in the IOT Health Environment Using Modified Recursive Feature Selection,” Wirel. Commun. Mob. Comput., vol. 2019, 2019, doi: 10.1155/2019/5176705.
- A. A. Said, L. A. Abd-Elmegid, S. Kholeif, and A. A. Gaber, “Classification based on clustering model for predicting main outcomes of breast cancer using Hyper-Parameters Optimization,” Int. J. Adv. Comput. Sci. Appl., vol. 9, no. 12, pp. 268–273, 2018, doi: 10.14569/IJACSA.2018.091239.
- P. Israni, “Breast cancer diagnosis (BCD) model using machine learning,” Int. J. Innov. Technol. Explor. Eng., vol. 8, no. 10, pp. 4456–4463, 2019, doi: 10.35940/ijitee.J9973.0881019.
- S. N. Singh and S. Thakral, “Using data mining tools for breast cancer prediction and analysis,” 2018 4th Int. Conf. Comput. Commun. Autom. ICCCA 2018, pp. 1–4, 2018, doi: 10.1109/CCAA.2018.8777713.
- A. Al Bataineh, “A comparative analysis of nonlinear machine learning algorithms for breast cancer detection,” Int. J. Mach. Learn. Comput., vol. 9, no. 3, pp. 248–254, Jun. 2019, doi: 10.18178/ijmlc.2019.9.3.794.
Breast cancer is among the most prevalent
diseases encountered among women worldwide. Early
diagnosis of breast cancer is crucial for the treatment of
the disease. Detecting the disease at an early stage
prevents deaths resulting from the condition. Recently,
computer-aided systems have been developed to ensure
early-stage diagnosis and accuracy of breast cancer.
Computer-aided systems developed with machine
learning approaches significantly contribute to the
process of diagnosing breast cancer. The aim of this study
is to propose a new classification system based on
machine learning algorithms developed for the diagnosis
of breast cancer. In this study, sub-data sets were created
by reducing features, and data cleaning processes were
applied. After these procedures, stages such as feature
selection and feature extraction were applied. In this
study, classification processes such as Ensemble, k-
Nearest Neighbors (kNN), Support Vector Machines
(SVMs), and Hybrid Artificial Intelligence were used in
line with machine learning. With the obtained results, a
Breast Cancer diagnosis algorithm was created.
Performance evaluation criteria such as accuracy rate,
specificity, sensitivity, kappa number and F-Measure
were applied to the created algorithms. In the results
obtained in this study, the highest accuracy rate was
found to be 99.3% with the Ensemble method, the highest
specificity rate was 98.7% with the Ensemble method,
and the highest sensitivity rate was found to be 100%
with many methods. In light of these results, it was
observed that the machine learning algorithms used in
this study, implemented in the Matlab environment, were
effective. Consequently, it was proven that higher
accuracy, specificity, and sensitivity rates can be found
with different machine learning techniques. This also
demonstrates that the study in our article is a reliable one
in detecting diseased and healthy individuals in the
diagnosis of breast cancer, showing that it is a more
applicable and feasible study in the healthcare field.
Keywords :
Breast Cancer Diagnosis; Machine Learning; Ensemble Method; Performance Review; Hybrit Artificial Intelligence.