Authors :
Abbas Sani; Bachcha Lal Pal; Ajay Singh Dhabariya; Faisal Rasheed; Asifa Shah; Usman Haruna; Babangida Salis Mu'az; Jamilu Habu
Volume/Issue :
Volume 9 - 2024, Issue 9 - September
Google Scholar :
https://tinyurl.com/4jmmh38f
Scribd :
https://tinyurl.com/5n6zxex3
DOI :
https://doi.org/10.38124/ijisrt/IJISRT24SEP367
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
This study provides a methodical overview of
deep learning (DL) applications in data mining,
encompassing the datasets, methods, and methodologies
used in various fields. Through the use of targeted
keywords in numerous scientific archives, a significant
number of papers was found, sorted, and examined in
order to chart the development of deep learning in data
mining from its birth to the present state. The fully draws
attention to the rising number of papers, which indicates
that there is increased interest in using DL to difficult
data processing tasks.
The incorporation of deep learning techniques is the
main emphasis of the paper's discussion of the history and
relevant work in machine learning and data mining. It
investigates the use of DL in several application areas,
including the detection of financial trouble, the analysis of
crime data, and educational data mining, showcasing the
versatility of these methods across industries.
The methodology section details the data different
collection process and also the systematic approach used
to review and analyze the literature. The paper provides
an in-depth analysis of different data mining techniques,
including classification, clustering, regression, and
dimensionality reduction, and presents example use cases
for each one among them.
Furthermore, the paper examines the role of deep
learning in enhancing data mining tasks, offering insights
into the architectures and configurations of neural
networks. It presents a comparative study of machine
learning and deep learning, figuring out the advantages of
DL in handling complex and unstructured data.
At the end, the paper concludes that future
directions for research, emphasizing the potential of DL
to address challenges in big data analytics and the need
for continued exploration of its applications in data
mining.
Keywords :
Deep Learning, Data Mining, Machine Learning, Neural Networks, Big Data, Systematic Review.
References :
- Abbas, S., Pal, B. L., S., A., R., F., S., A., U., H., Mua’az, B., & A. Y., A. (2024). Comprehensive Review on Natural Language Generation for Automated Report Writing in Finance. British Journal of Computer, Networking and Information Technology, 7(3), 85–93. https://doi.org/10.52589/BJCNIT-ELBOL7TY
- Abdullah, D. A., & AL-Anber, N. J. (2023). Implement data mining and deep learning techniques to detect financial distress. 020009. https://doi.org/10.1063/5.0119272
- Alzubaidi, L., Zhang, J., Humaidi, A. J., Al-Dujaili, A., Duan, Y., Al-Shamma, O., Santamaría, J., Fadhel, M. A., Al-Amidie, M., & Farhan, L. (2021). Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data, 8(1), 53. https://doi.org/10.1186/s40537-021-00444-8
- Ateş, E. C. (2021). Big Data, Data Mining, Machine Learning, and Deep Learning Concepts in Crime Data. Journal of Penal Law & Criminology, 293–319. https://doi.org/10.26650/JPLC2020-813328
- Azure. (2024, January 19). Deep learning vs. Machine learning in Azure Machine Learning [Online post]. https://learn.microsoft.com/en-us/azure/machine-learning/concept-deep-learning-vs-machine-learning?view=azureml-api-2
- Baruah, A. J., Goswami, J., Bora, D. J., & Baruah, S. (2022). A Comparative Research of Different Classification Algorithms. In J. S. Raj, R. Palanisamy, I. Perikos, & Y. Shi (Eds.), Intelligent Sustainable Systems (Vol. 213, pp. 631–646). Springer Singapore. https://doi.org/10.1007/978-981-16-2422-3_50
- Bhurre, S., Raikwar, S., Prajapat, S., & Pathak, D. (2024). Analyzing and Comparing Clustering Algorithms for Student Academic Data. In N. Naik, P. Jenkins, P. Grace, L. Yang, & S. Prajapat (Eds.), Advances in Computational Intelligence Systems (Vol. 1453, pp. 640–651). Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-47508-5_49
- Brownlee, jason. (2021, January 1). 7 Time Series Datasets for Machine Learning [Online post].
- Chahal*, A., & Gulia, P. (2019). Machine Learning and Deep Learning. International Journal of Innovative Technology and Exploring Engineering, 8(12), 4910–4914. https://doi.org/10.35940/ijitee.L3550.1081219
- Chen, B., Haas, P., & Scheuermann, P. (2002). A new two-phase sampling based algorithm for discovering association rules. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 462–468. https://doi.org/10.1145/775047.775114
- Cohen, I. (2024, January 2). What is Anomaly Detection? Examining the Essentials [Online post]. https://www.anodot.com/blog/what-is-anomaly-detection/
- Denfeld, Q. E., Burger, D., & Lee, C. S. (2023). Survival analysis 101: An easy start guide to analysing time-to-event data. European Journal of Cardiovascular Nursing, 22(3), 332–337. https://doi.org/10.1093/eurjcn/zvad023
- El Guabassi, I., Bousalem, Z., Marah, R., & Qazdar, A. (2021). Forecasting Students’ Academic Performance Using Different Regression Algorithms. In S. Motahhir & B. Bossoufi (Eds.), Digital Technologies and Applications (Vol. 211, pp. 221–231). Springer International Publishing. https://doi.org/10.1007/978-3-030-73882-2_21
- Erlandsson, F., Bródka, P., Borg, A., & Johnson, H. (2016). Finding Influential Users in Social Media Using Association Rule Learning. Entropy, 18(5), 164. https://doi.org/10.3390/e18050164
- Gaurav. (2024). 5 Regression Algorithms You Should Know: Introductory Guide [Online post]. https://www.analyticsvidhya.com/blog/2021/05/5-regression-algorithms-you-should-know-introductory-guide/
- Guruvayur, R. G., & R, Dr. R. (2017). A DETAILED STUDY ON MACHINE LEARNING TECHNIQUES FOR DATA MINING. IEEE. https://telcobuddy.ai/img/resources/3.pdf
- Hernández-Blanco, A., Herrera-Flores, B., Tomás, D., & Navarro-Colorado, B. (2019). A Systematic Review of Deep Learning Approaches to Educational Data Mining. Complexity, 2019(1), 1306039. https://doi.org/10.1155/2019/1306039
- Mahawar, K., & Rattan, P. (2024). Empowering education: Harnessing ensemble machine learning approach and ACO-DT classifier for early student academic performance prediction. Education and Information Technologies. https://doi.org/10.1007/s10639-024-12976-6
- ohri, ajay. (2021, February 3). Text Mining Algorithms: A Comprehensive Overview (2021) [Online post]. https://u-next.com/blogs/data-science/text-mining-algorithms/
- Rodriguez, M. Z., Comin, C. H., Casanova, D., Bruno, O. M., Amancio, D. R., Costa, L. D. F., & Rodrigues, F. A. (2019). Clustering algorithms: A comparative approach. PLOS ONE, 14(1), e0210236. https://doi.org/10.1371/journal.pone.0210236
- Rosebrock, A. (2020, March 2). Anomaly detection with Keras, TensorFlow, and Deep Learning [Online post]. https://pyimagesearch.com/2020/03/02/anomaly-detection-with-keras-tensorflow-and-deep-learning/
- Sorzano, C. O. S., Vargas, J., & Montano, A. P. (2014). A survey of dimensionality reduction techniques (arXiv:1403.2877). arXiv. http://arxiv.org/abs/1403.2877
- Srikant, R., & Agrawal, R. (1996). Mining sequential patterns: Generalizations and performance improvements. In P. Apers, M. Bouzeghoub, & G. Gardarin (Eds.), Advances in Database Technology—EDBT ’96 (Vol. 1057, pp. 1–17). Springer Berlin Heidelberg. https://doi.org/10.1007/BFb0014140
- Talaei Khoei, T., Ould Slimane, H., & Kaabouch, N. (2023). Deep learning: Systematic review, models, challenges, and research directions. Neural Computing and Applications, 35(31), 23103–23124. https://doi.org/10.1007/s00521-023-08957-4
- Wiegrebe, S., Kopper, P., Sonabend, R., Bischl, B., & Bender, A. (2024). Deep learning for survival analysis: A review. Artificial Intelligence Review, 57(3), 65. https://doi.org/10.1007/s10462-023-10681-3
- Yin, H., Aryani, A., Petrie, S., Nambissan, A., Astudillo, A., & Cao, S. (2024). A Rapid Review of Clustering Algorithms (arXiv:2401.07389). arXiv. http://arxiv.org/abs/2401.07389
- Yosef, A., Roth, I., Shnaider, E., Baranes, A., & Schneider, M. (2024). Horizontal Learning Approach to Discover Association Rules. Computers, 13(3), 62. https://doi.org/10.3390/computers13030062
This study provides a methodical overview of
deep learning (DL) applications in data mining,
encompassing the datasets, methods, and methodologies
used in various fields. Through the use of targeted
keywords in numerous scientific archives, a significant
number of papers was found, sorted, and examined in
order to chart the development of deep learning in data
mining from its birth to the present state. The fully draws
attention to the rising number of papers, which indicates
that there is increased interest in using DL to difficult
data processing tasks.
The incorporation of deep learning techniques is the
main emphasis of the paper's discussion of the history and
relevant work in machine learning and data mining. It
investigates the use of DL in several application areas,
including the detection of financial trouble, the analysis of
crime data, and educational data mining, showcasing the
versatility of these methods across industries.
The methodology section details the data different
collection process and also the systematic approach used
to review and analyze the literature. The paper provides
an in-depth analysis of different data mining techniques,
including classification, clustering, regression, and
dimensionality reduction, and presents example use cases
for each one among them.
Furthermore, the paper examines the role of deep
learning in enhancing data mining tasks, offering insights
into the architectures and configurations of neural
networks. It presents a comparative study of machine
learning and deep learning, figuring out the advantages of
DL in handling complex and unstructured data.
At the end, the paper concludes that future
directions for research, emphasizing the potential of DL
to address challenges in big data analytics and the need
for continued exploration of its applications in data
mining.
Keywords :
Deep Learning, Data Mining, Machine Learning, Neural Networks, Big Data, Systematic Review.