Enhancing Flood Management Through Machine Learning: A Comprehensive Analysis of the CatBoost Application


Authors : Ogundolie O. I.; Olabiyisi S. O.; Ganiyu R. A; Jeremiah Y. S; Ogundolie F. A.

Volume/Issue : Volume 9 - 2024, Issue 6 - June


Google Scholar : https://tinyurl.com/44dpcynn

Scribd : https://tinyurl.com/2tu8pbna

DOI : https://doi.org/10.38124/ijisrt/IJISRT24JUN1770

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.


Abstract : River flooding is a major natural disaster that has caused enormous damage to our environment, infrastructure and human life. River flooding has led to flooding in river basins which has disrupted human activities and fatalities. This study is a review of river basin flooding, the impact of machine learning techniques in flood prediction in river basins, flood management in the past and the impact of machine learning in flood management. This review further examined how the Categorical boosting algorithm (CatBoost) which is a machine learning technique, could improve flood prediction in river basins and its applications in flood management. Several case studies of how CatBoost models have been used to predict flooding and enhance early warning systems were also reviewed in this study. CatBoost has been recognized to be excellent in working on categorical variables making it efficient in handling datasets with complex relationships. This makes it applicable for flood prediction in river basins considering the factors involved in flooding. CatBoost's effectiveness in flood forecasting and flood susceptibility modelling was demonstrated in some case studies. CatBoost has the potential to change flood management, minimize the disastrous impacts of floods, and enhance sustainable development, regardless of its limits. The review highlights the importance of machine learning to improve flood protection and the need for concerted efforts to get beyond implementation obstacles and take full advantage of CatBoost's flood management capabilities.

Keywords : Flooding, CatBoost, Flood Management, Machine Learning.

References :

  1. Abdi-Dehkordi, M., Bozorg-Haddad, O., Salavitabar, A., Mohammad-Azari, S., and Goharian, E. (2021). Development of flood mitigation strategies toward sustainable development. Natural hazards, 108(3), 2543-2567.
  2. Abedi, R., Costache, R., Shafizadeh-Moghadam, H., and Pham, Q. B. (2022). Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees. Geocarto International, 37(19), 5479-5496.
  3. Ajibade, F. O., Ajibade, T. F., Idowu, T. E., Nwogwu, N. A., Adelodun, B., Lasisi, K. H., and Adewumi, J. R. (2021). Flood‐prone area mapping using GIS‐based analytical hierarchy frameworks for Ibadan city, Nigeria. Journal of Multi‐Criteria Decision Analysis, 28(5-6), 283-295.
  4. Al-Kindi, K. M., and Alabri, Z. (2024). Investigating the Role of the Key Conditioning Factors in Flood Susceptibility Mapping Through Machine Learning Approaches. Earth Systems and Environment, 1-19.
  5. Ansari, M. S., Warner, J., Sukhwani, V., and Shaw, R. (2022). Implications of flood risk reduction interventions on community resilience: An assessment of community perception in Bangladesh. Climate, 10(2), 20.
  6. Boone, T., Ganeshan, R., Jain, A., and Sanders, N. R. (2019). Forecasting sales in the supply chain: Consumer analytics in the big data era. International journal of forecasting, 35(1), 170-180.
  7. Canadell, J. G., Monteiro, P. M., Costa, M. H., Cotrim da Cunha, L., Cox, P. M., Eliseev, A. V., and Zickfeld, K. (2023). Intergovernmental Panel on Climate Change (IPCC). Global carbon and other biogeochemical cycles and feedbacks. In Climate change 2021: The physical science basis. Contribution of working group I to the Sixth Assessment Report of the intergovernmental panel on climate change (pp. 673-816). Cambridge University Press.
  8. Chakraborty, S. K., and Chakraborty, S. K. (2021). Land-use changes: floodplains, dams, and reservoirs–integrated river basin management. Riverine Ecology Volume 2: Biodiversity Conservation, Conflicts and Resolution, 531-607.
  9. Chandrathilake, T. (2022). The need for ecohydrological research in tropical forests for healthy watersheds. Journal of Tropical Forestry and Environment, 12(02).
  10. Dierauer, J. R., Allen, D. M., and Whitfield, P. H. (2021). Climate change impacts on snow and streamflow drought regimes in four ecoregions of British Columbia. Canadian Water Resources Journal/Revue canadienne des ressources hydriques, 46(4), 168-193.
  11. Dorogush, A. V., Ershov, V., and Gulin, A. (2018). CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363.
  12. Glago, F. J. (2021). Flood disaster hazards; causes, impacts and management: a state-of-the-art review. Natural hazards-impacts, adjustments and resilience, 29-37.
  13. Hammami, S., Zouhri, L., Souissi, D., Souei, A., Zghibi, A., Marzougui, A., and Dlala, M. (2019). Application of the GIS based multi-criteria decision analysis and analytical hierarchy process (AHP) in the flood susceptibility mapping (Tunisia). Arabian Journal of Geosciences, 12, 1-16.
  14. Handayani, W., Chigbu, U. E., Rudiarto, I., and Putri, I. H. S. (2020). Urbanization and Increasing flood risk in the Northern Coast of Central Java—Indonesia: An assessment towards better land use policy and flood management. Land, 9(10), 343.
  15. Hooper, B. P., and Lloyd, G. J. (2011). Report on IWRM in transboundary basins. UNEP-DHI Cent. Water Environ.
  16. Huang, G., Wu, L., Ma, X., Zhang, W., Fan, J., Yu, X., Zeng, W. and Zhou, H., (2019). Evaluation of CatBoost method for prediction of reference evapotranspiration in humid regions. Journal of Hydrology, 574, 1029-1041.
  17. Jia, H., Chen, F., Pan, D., Du, E., Wang, L., Wang, N.,andYang, A. (2022). Flood risk management in the Yangtze River basin—Comparison of 1998 and 2020 events. International Journal of Disaster Risk Reduction, 68, 102724.
  18. Jonkman, S. N. (2005). Global perspectives on loss of human life caused by floods. Natural hazards, 34(2), 151-175.
  19. Kulkarni, C. S. (2022) Advancing Gradient Boosting: A Comprehensive Evaluation of the CatBoost Algorithm for Predictive Modeling. J Artif Intell Mach Learn & Data Sci 2022, 1(1), 54-57.
  20. Kumar, V., Kedam, N., Sharma, K. V., Khedher, K. M., and Alluqmani, A. E. (2023). A comparison of machine learning models for predicting rainfall in urban metropolitan Cities. Sustainability, 15(18), 13724.
  21. Kumar, V., Kedam, N., Sharma, K. V., Mehta, D. J., and Caloiero, T. (2023b). Advanced machine learning techniques to improve hydrological prediction: A comparative analysis of streamflow prediction models. Water, 15(14), 2572.
  22. Maskrey, S. A., Mount, N. J., & Thorne, C. R. (2022). Doing flood risk modelling differently: Evaluating the potential for participatory techniques to broaden flood risk management decision‐making. Journal of Flood Risk Management, 15(1), e12757.
  23. Merz, B., Blöschl, G., Vorogushyn, S., Dottori, F., Aerts, J. C., Bates, P., and Macdonald, E. (2021). Causes, impacts and patterns of disastrous river floods. Nature Reviews EarthandEnvironment, 2(9), 592-609.
  24. Molinari, D., Scorzini, A. R., Arrighi, C., Carisi, F., Castelli, F., Domeneghetti, A., ... and Ballio, F. (2020). Are flood damage models converging to “reality”? Lessons learnt from a blind test. Natural Hazards and Earth System Sciences, 20(11), 2997-3017.
  25. Mosavi, A., Ozturk, P.,and Chau, K. W. (2018). Flood prediction using machine learning models: Literature review. Water, 10(11), 1536.
  26. Mushtaq, B., Bandh, S. A., Shafi, S., Mushtaq, B., Bandh, S. A., and Shafi, S. (2020). Management of water resources. Environmental Management: Environmental Issues, Awareness and Abatement, 1-46.
  27. Nasirzadehdizaji, R., and Akyüz, D. E. (2022). Application of swat hydrological model to assess the impacts of land use change on sediment loads. International Journal of Agriculture Environment and Food Sciences, 6(1), 108-120.
  28. Nguyen, H. D., Dang, D. K., Nguyen, N. Y., Pham Van, C., Van Nguyen, T. T., Nguyen, Q. H., and Bui, Q. T. (2024). Integration of machine learning and hydrodynamic modeling to solve the extrapolation problem in flood depth estimation. Journal of Water and Climate Change, 15(1), 284-304.
  29. Peretz T. (2018). Mastering The New Generation of Gradient Boosting. https://towardsdatascience.com/https-medium-com-talperetz24-mastering-the-new-generation-of-gradient-boosting-db04062a7ea2 .Accessed on 29th December 2021
  30. Petrucci, O. (2022). Factors leading to the occurrence of flood fatalities: a systematic review of research papers published between 2010 and 2020. Natural hazards and earth system sciences, 22(1), 71-83.
  31. Prokešová, R., Horáčková, Š., and Snopková, Z. (2022). Surface runoff response to long-term land use changes: Spatial rearrangement of runoff-generating areas reveals a shift in flash flood drivers. Science of the Total Environment, 815, 151591.
  32. Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., and Gulin, A. (2018). CatBoost: unbiased boosting with categorical features. In Advances in neural information processing systems 6638-6648.
  33. Qian, Q. and Eslamian, S. (2022). Impact of urbanization on flooding. In Flood Handbook (pp. 97-112). CRC Press.
  34. Rajkhowa, S., and Sarma, J. (2021). Climate change and flood risk, global climate change. In Global climate change (pp. 321-339). Elsevier.
  35. Roy, P., Pal, S. C., Chakrabortty, R., Chowdhuri, I., Malik, S., and Das, B. (2020). Threats of climate and land use change on future flood susceptibility. Journal of Cleaner Production, 272, 122757.
  36. Saber, M., Boulmaiz, T., Guermoui, M., Abdrabo, K. I., Kantoush, S. A., Sumi, T., ... and Mabrouk, E. (2022). Examining LightGBM and CatBoost models for Wadi flash flood susceptibility prediction. Geocarto International, 37(25), 7462-7487.
  37. Saber, M., Boulmaiz, T., Guermoui, M., Abdrabo, K. I., Kantoush, S. A., Sumi, T., ... and Mabrouk, E. (2023). Enhancing flood risk assessment through integration of ensemble learning approaches and physical-based hydrological modeling. Geomatics, Natural Hazards and Risk, 14(1), 2203798.
  38. Saravanan, S., Abijith, D., Reddy, N. M., Parthasarathy, K. S. S., Janardhanam, N., Sathiyamurthi, S., and Sivakumar, V. (2023). Flood susceptibility mapping using machine learning boosting algorithms techniques in Idukki district of Kerala India. Urban Climate, 49, 101503.
  39. Sedighkia, M., Jahanshahloo, M., and Datta, B. (2023). Evaluating minimum environmental flow requirements in rivers: a combined decision-tree approach integrating hydrological, physical habitat, and water quality indexes. Journal of Sustainable Water in the Built Environment, 9(4), 04023006.
  40. Seydi, S. T., Kanani-Sadat, Y., Hasanlou, M., Sahraei, R., Chanussot, J., and Amani, M. (2022a). Comparison of machine learning algorithms for flood susceptibility mapping. Remote Sensing, 15(1), 192.
  41. Shrivastava, P., Smith, M. S., O’Brien, K., and Zsolnai, L. (2020). Transforming sustainability science to generate positive social and environmental change globally. One Earth, 2(4), 329-340.
  42. Szczepanek, R. (2022). Daily streamflow forecasting in the mountainous catchment using XGBoost, LightGBM and CatBoost. Hydrology, 9(12), 226.
  43. Tanoue, M., Taguchi, R., Nakata, S., Watanabe, S., Fujimori, S., and Hirabayashi, Y. (2020). Estimation of direct and indirect economic losses caused by a flood with long‐lasting inundation: application to the 2011 Thailand flood. Water Resources Research, 56(5), e2019WR026092.
  44. Tehrany, M. S., Pradhan, B., and Jebur, M. N. (2015). Flood susceptibility analysis and its verification using a novel ensemble support vector machine and frequency ratio method. Stochastic environmental research and risk assessment, 29(4), 1149-1165.
  45. Tounsi, A. (2023). Enhancing Flood Forecasting Systems Through the Integration of Machine Learning and Advanced Sensing and Modeling Techniques (Doctoral dissertation, Stevens Institute of Technology).
  46. Van Ackere, S., Verbeurgt, J., De Sloover, L., Gautama, S., De Wulf, A., & De Maeyer, P. (2019). A review of the internet of floods: Near real-time detection of a flood event and its impact. Water, 11(11), 2275.
  47. Van Phong, T., Nguyen, D. D., and Pham, B. T. (2023). Modeling and Mapping of Flood Susceptibility at Que Son District, Quang Nam Province, Vietnam using CatBoost. In IOP Conference Series: Materials Science and Engineering (Vol. 1289, No. 1, p. 012019). IOP Publishing.
  48. Wagenaar, D., Curran, A., Balbi, M., Bhardwaj, A., Soden, R., Hartato, E., ... and Lallemant, D. (2020). Invited perspectives: How machine learning will change flood risk and impact assessment. Natural hazards and earth system sciences, 20(4), 1149-1161.
  49. Wang, L., Cui, S., Li, Y., Huang, H., Manandhar, B., Nitivattananon, V., Fang, X., and Huang, W. (2022). A review of the flood management: From flood control to flood resilience. Heliyon, 8(11). https://doi.org/10.1016/j.heliyon.2022.e11763
  50. Wu, C., Ju, Y., Yang, S., Zhang, Z., and Chen, Y. (2023). Reconstructing annual XCO2 at a 1 km× 1 km spatial resolution across China from 2012 to 2019 based on a spatial CatBoost method. Environmental Research. [HTML]
  51. Wu, Y., Ju, H., Qi, P., Li, Z., Zhang, G., and Sun, Y. (2023). Increasing flood risk under climate change and social development in the Second Songhua River basin in Northeast China. Journal of Hydrology: Regional Studies, 48, 101459.
  52. Xia, J., and Chen, J. (2021). A new era of flood control strategies from the perspective of managing the 2020 Yangtze River flood. Science China Earth Sciences, 64(1), 1-9.
  53. Xiang, W., Xu, P., Fang, J., Zhao, Q., Gu, Z., and Zhang, Q. (2022). Multi-dimensional data-based medium-and long-term power-load forecasting using double-layer CatBoost. Energy Reports, 8, 8511-8522.
  54. Xu, Y., Rao, C., Xiao, X., and Hu, F. (2023). Novel Early-Warning Model for Customer Churn of Credit Card Based on GSAIBAS-CatBoost. CMES-Computer Modeling in Engineering and Sciences, 137(3).
  55. Yao, S., Du, W., Chen, N., Wang, C., & Chen, Z. (2023). An integrated process-based framework for flood phase segmentation and assessment. International Journal of Geographical Information Science, 37(6), 1315-1337.
  56. Zeleňáková, M., Gaňová, L., Purcz, P., and Satrapa, L. (2015). Methodology of flood risk assessment from flash floods based on hazard and vulnerability of the river basin. Natural Hazards, 79, 2055-2071.
  57. Zhao, D. H., Shen, H. W., Tabios III, G. Q., Lai, J. S., and Tan, W. Y. (1994). Finite-volume two-dimensional unsteady-flow model for river basins. Journal of Hydraulic Engineering, 120(7), 863-883.
  58. Zhao, Z., Wang, H., Zhang, Y., Deng, C., Xie, Q., and Wang, C. (2020). Problems and countermeasures of river management in the process of rapid urbanization in China. Water, 12(8), 2260.
  59. Zhong, W., Zhang, D., Sun, Y., and Wang, Q. (2023). A CatBoost-Based Model for the Intensity Detection of Tropical Cyclones over the Western North Pacific Based on Satellite Cloud Images. Remote Sensing, 15(14), 3510.

River flooding is a major natural disaster that has caused enormous damage to our environment, infrastructure and human life. River flooding has led to flooding in river basins which has disrupted human activities and fatalities. This study is a review of river basin flooding, the impact of machine learning techniques in flood prediction in river basins, flood management in the past and the impact of machine learning in flood management. This review further examined how the Categorical boosting algorithm (CatBoost) which is a machine learning technique, could improve flood prediction in river basins and its applications in flood management. Several case studies of how CatBoost models have been used to predict flooding and enhance early warning systems were also reviewed in this study. CatBoost has been recognized to be excellent in working on categorical variables making it efficient in handling datasets with complex relationships. This makes it applicable for flood prediction in river basins considering the factors involved in flooding. CatBoost's effectiveness in flood forecasting and flood susceptibility modelling was demonstrated in some case studies. CatBoost has the potential to change flood management, minimize the disastrous impacts of floods, and enhance sustainable development, regardless of its limits. The review highlights the importance of machine learning to improve flood protection and the need for concerted efforts to get beyond implementation obstacles and take full advantage of CatBoost's flood management capabilities.

Keywords : Flooding, CatBoost, Flood Management, Machine Learning.

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe