AQIP: Air Quality Index Prediction Using Supervised ML Classifiers


Authors : Nayan Adhikari; Pallabi Ghosh; Abhinaba Bhattacharyya; Siddhartha Chatterjee

Volume/Issue : Volume 10 - 2025, Issue 7 - July


Google Scholar : https://tinyurl.com/y5b2mj5v

Scribd : https://tinyurl.com/mspf79ua

DOI : https://doi.org/10.38124/ijisrt/25jul758

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Note : Google Scholar may take 30 to 40 days to display the article.


Abstract : In current years, Air pollution has emerged as a significant environmental concern. Accuracy modeling the complex relationships between air quality variables using advanced machine learning techniques is a promising area of research. The study aims to evaluate and compare the performance of supervised machine learning methods including Support Vector Regressor (SVR), Random Forest (RF), XGBoost, LightGBM for the prediction of air quality index. For the research, we collect a dataset from Kaggle. To assess the model performance, metrices such as root-mean-square-error (RMSE), Mean Absolute Error (MAE) and coefficient of determination (R2 ) were used. Experimental result showed how LightGBM model outperformed the others in AQI prediction (RMSE = 1.4704, R2 = 0.9987 and MAE = 0.1824). Furthermore, all models were evaluated using these metrices, offering a clear comparison that highlighted the factors contributing to the improved accuracy.

Keywords : Air Quality; Air Pollutant; Support Vector Regressor; Random Forest Regressor; XGBoost; LightGBM; Root-Mean- Squared-Error; Mean Absolute Error; Coefficient of Determination; Supervised Methos.

References :

  1. Liang, Y.-C., Maimury, Y., Chen, A. H.-L., &   Juarez, J. R. C. (2020). Machine Learning-Based Prediction of Air Quality. Applied Sciences, 10(24), 9151.
  2. Natarajan, S. K., Shanmurthy, P., Arockiam, D., Balusamy, B., & Selvarajan, S. (2024). Optimized machine learning model for air quality index prediction in major cities in India. Scientific Reports, 14(1), 6795.
  3. Guo, Z., Jing, X., Ling, Y., Yang, Y., Jing, N., Yuan, R., & Liu, Y. (2024). Optimized air quality management based on air quality index prediction and air pollutants identification in representative cities in China. Scientific Reports, 14(1), 17923.
  4. Patil, R. M., Dinde, H. T., Powar, S. K., & Ganeshkhind, P. M. (2020). A literature review on prediction of air quality index and forecasting ambient air pollutants using machine learning algorithms. International Journal of Innovative Science and Research Technology, 5(8), 1148-1152.
  5. Dragomir, E. G. (2010). Air quality index prediction using K-nearest neighbor technique. Bulletin of PG University of Ploiesti, Series Mathematics, Informatics, Physics, LXII, 1(2010), 103-108.
  6. Mani, G., & Viswanadhapalli, J. K. (2022). Prediction and forecasting of air quality index in Chennai using regression and ARIMA time series models. Journal of Engineering Research, 10(2A), 179-194.
  7. Umoh, M. D., Evans, U. F., & Utting, C. (2024). Air Quality Index Prediction Using Machine Learning Algorithms for Certain Locations in Nigeria. Journal of Environmental Science and Management, 25(2), 98-112.
  8. Pant, A., Sharma, S., & Pant, K. (2023). Evaluation of machine learning algorithms for air quality index (AQI) Prediction. Journal of Reliability and Statistical Studies, 229-242.
  9. Liang, Y.-C., Maimury, Y., Chen, A. H.-L., & Juarez, J. R. C. (2020). Machine Learning-Based Prediction of Air Quality. Applied Sciences, 10(24), 9151.
  10. Castelli, M., Clemente, F. M., Popovič, A., Silva, S., & Vanneschi, L. (2020). A machine learning approach to predict air quality in California. Complexity, 2020(1), 8049504.
  11. Ravindiran, G., Hayder, G., Kanagarathinam, K., Alagumalai, A., & Sonne, C. (2023). Air quality prediction by machine learning models: A predictive study on the indian coastal city of Visakhapatnam. Chemosphere, 338, 139518.
  12. Gupta, N. S., Mohta, Y., Heda, K., Armaan, R., Valarmathi, B., & Arulkumaran, G. (2023). Prediction of air quality index using machine learning techniques: a comparative analysis. Journal of Environmental and Public Health, 2023(1), 4916267.
  13. Avvari, P., Nacham, P., Sasanapuri, S., Mankena, S. R., Kudipudi, P., & Madapati, A. (2023). Air Quality Index Prediction. In E3S Web of Conferences (Vol. 391, p. 01103). EDP Sciences.
  14. Liu, H., Li, Q., Yu, D., & Gu, Y. (2019). Air quality index and air pollutant concentration prediction based on machine learning algorithms. Applied sciences, 9(19), 4069.
  15. Zhou, Y., Wang, W., Wang, K., & Song, J. (2022). Application of LightGBM algorithm in the initial Design of a Library in the cold area of China based on comprehensive performance. Buildings, 12(9), 1309.
  16. Kang, G. K., Gao, J. Z., Chiao, S., Lu, S., & Xie, G. (2018). Air quality prediction: big data and machine learning approaches. Int. J. Environ. Sci. Dev, 9(1), 8-16.
  17. Sharma, R., Shilimkar, G., & Pisal, S. (2021). Air quality prediction by machine learning. Int. J. Sci. Res. Sci. Technol, 8, 486-492.
  18. Amjad, M., Ahmad, I., Ahmad, M., Wróblewski, P., Kamiński, P., & Amjad, U. (2022). Prediction of pile bearing capacity using XGBoost algorithm: modeling and performance evaluation. Applied Sciences, 12(4), 2126.
  19. Singh, M. P., Bisht, N., Choudhary, M., Goswami, A., & Tagore, N. K. (2025). A Web-Based Supervised Machine Learning Model for Air Quality Index and Respiratory Care Prediction. Procedia Computer Science, 258, 1747-1756.
  20. Iskandaryan, D., Ramos, F., & Trilles, S. (2020). Air quality prediction in smart cities using machine learning technologies based on sensor data: a review. Applied Sciences, 10(7), 2401.
  21. Ghosh, P., Hazra, S., & Chatterjee, S. Future Prospects Analysis in Healthcare Management Using Machine Learning Algorithms. the International Journal of Engineering and Science Invention (IJESI), ISSN (online), 2319-6734.
  22. Hazra, S., Mahapatra, S., Chatterjee, S., & Pal, D. (2023). Automated Risk Prediction of Liver Disorders Using Machine Learning. In the proceedings of 1st International conference on Latest Trends on Applied Science, Management, Humanities and Information Technology (SAICON-IC-LTASMHIT-2023) on 19th June (pp. 301-306).
  23. Gon, A., Hazra, S., Chatterjee, S., & Ghosh, A. K. (2023). Application of machine learning algorithms for automatic detection of risk in heart disease. In Cognitive cardiac rehabilitation using IoT and AI tools (pp. 166-188). IGI Global.
  24. Das, S., Chatterjee, S., Sarkar, D., & Dutta, S. (2022). Comparison Based Analysis and Prediction for Earlier Detection of Breast Cancer Using Different Supervised ML Approach. In Emerging Technologies in Data Mining and Information Security: Proceedings of IEMIS 2022, Volume 3 (pp. 255-267). Singapore: Springer Nature Singapore.
  25. Das, S., Chatterjee, S., Karani, A. I., & Ghosh, A. K. (2023, November). Stress Detection While Doing Exam Using EEG with Machine Learning Techniques. In International Conference on Innovations in Data Analytics (pp. 177-187). Singapore: Springer Nature Singapore.
  26. Hazra, S. (2024). Pervasive nature of AI in the health care industry: high-performance medicine.
  27. Sima Das, Siddhartha Chatterjee, Sutapa Bhattacharya, Solanki Mitra, Arpan Adhikary and Nimai Chandra Giri “Movie’s-Emotracker: Movie Induced Emotion Detection by using EEG and AI Tools”, In the proceedings of the 4th International conference on Communication, Devices and Computing (ICCDC 2023), Springer-LNEE SCOPUS Indexed, DOI: 10.1007/978-981-99-2710-4_46, pp.583-595, vol. 1046 on 28th July, 2023.
  28. Chatterjee, R., Chatterjee, S., Samanta, S., & Biswas, S. (2024, December). AI Approaches to Investigate EEG Signal Classification for Cognitive Performance Assessment. In 2024 6th International Conference on Computational Intelligence and Networks (CINE) (pp. 1-7). IEEE.
  29. Adhikary, A., Das, S., Mondal, R., & Chatterjee, S. (2024, February). Identification of Parkinson’s Disease Based on Machine Learning Classifiers. In International Conference on Emerging Trends in Mathematical Sciences & Computing (pp. 490-503). Cham: Springer Nature Switzerland.
  30. Ghosh, P., Dutta, R., Agarwal, N., Chatterjee, S., & Mitra, S. (2023). Social media sentiment analysis on third booster dosage for COVID-19 vaccination: a holistic machine learning approach. Intelligent Systems and Human Machine Collaboration: Select Proceedings of ICISHMC 2022, 179-190.
  31. Rupa Debnath; Rituparna Mondal; Arpita Chakraborty; Siddhartha Chatterjee (2025) Advances in Artificial Intelligence for Lung Cancer Detection and Diagnostic Accuracy: A Comprehensive Review. International Journal of Innovative Science and Research Technology, 10(5), 1579-1586. https://doi.org/10.38124/IJISRT/25may1339
  32. Nitu Saha; Rituparna Mondal; Arunima Banerjee; Rupa Debnath; Siddhartha Chatterjee; (2025) Advanced Deep Lung Care Net: A Next Generation Framework for Lung Cancer Prediction. International Journal of Innovative Science and Research Technology, 10(6), 2312-2320. https://doi.org/10.38124/ijisrt/25jun1801
  33. Poushali Das; Washim Akram; Arijita Ghosh; Suman Biswas; Siddhartha Chatterjee (2025) Enhancing Diagnostic Accuracy: Leveraging Continuous pH Surveillance for Immediate Health Evaluation. International Journal of Innovative Science and Research Technology, 10(7), 7-12. https://doi.org/10.38124/ijisrt/25jul123
  34. Manali Sarkar; Aparajita Das; Sraddha Roy Choudhury; Siddhartha Chatterjee (2025). A* Based Optimized Travel Recommendation System for Smart Mobility. International Journal of Innovative Science and Research Technology, 10(5), 3185-3193. https://doi.org/10.38124/ijisrt/25may2352
  35. Hazra, S., Chatterjee, S., Mandal, A., Sarkar, M., Mandal, B.K. (2023). An Analysis of Duckworth-Lewis-Stern Method in the Context of Interrupted Limited over Cricket Matches. In: Chaki, N., Roy, N.D., Debnath, P., Saeed, K. (eds) Proceedings of International Conference on Data Analytics and Insights, ICDAI 2023. ICDAI 2023. Lecture Notes in Networks and Systems, vol 727. Springer, Singapore. https://doi.org/10.1007/978-981-99-3878-0_46
  36. Babli Kumari, Renu Dhir, Siddhartha Chatterjee, and Suchi Jain. 2025. Automated Identification of Traffic Accidents in Images and Videos Employing Advanced Deep Learning Methods. In 26th International Conference on Distributed Computing and Networking (ICDCN 2025), Janu ary 04–07, 2025, Hyderabad, India. ACM, New York, NY, USA,6 pages. https://doi.org/10.1145/3700838.370368
  37. Madhuparna Das Hait; Priya Das; Washim Akram; Siddhartha Chatterjee (2025): A Comparative Analysis of Linear Regression Techniques: Evaluating Predictive Accuracy and Model Effectiveness. International Journal of Innovative Science and Research Technology, 10(7), 127-139. https://doi.org/10.38124/ijisrt/25jul34.

In current years, Air pollution has emerged as a significant environmental concern. Accuracy modeling the complex relationships between air quality variables using advanced machine learning techniques is a promising area of research. The study aims to evaluate and compare the performance of supervised machine learning methods including Support Vector Regressor (SVR), Random Forest (RF), XGBoost, LightGBM for the prediction of air quality index. For the research, we collect a dataset from Kaggle. To assess the model performance, metrices such as root-mean-square-error (RMSE), Mean Absolute Error (MAE) and coefficient of determination (R2 ) were used. Experimental result showed how LightGBM model outperformed the others in AQI prediction (RMSE = 1.4704, R2 = 0.9987 and MAE = 0.1824). Furthermore, all models were evaluated using these metrices, offering a clear comparison that highlighted the factors contributing to the improved accuracy.

Keywords : Air Quality; Air Pollutant; Support Vector Regressor; Random Forest Regressor; XGBoost; LightGBM; Root-Mean- Squared-Error; Mean Absolute Error; Coefficient of Determination; Supervised Methos.

CALL FOR PAPERS


Paper Submission Last Date
31 - December - 2025

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.
Subscribe
OR

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.
Subscribe