Abstract
Water is a vital resource essential for all life, and its quality has been compromised by pollution and contamination in recent decades. The Water Quality Index (WQI) is crucial for evaluating water validity for several purposes, including drinking. Accurate WQI prediction allows for proactive strategies to combat water contamination, preserve public well-being, and guarantee access to safe water sources. This study introduces a novel approach utilizing advanced Machine Learning (ML) techniques for WQI prediction, demonstrating substantial improvements over traditional methods. The methods include the Ridge Model, Lasso Model, Random Forest (RF) Model, Extra Trees (ExT) Model, AdaBoost (AB) Model, XGBoost (XGB) Model, Gradient Boosting (GB) Model, LightGBM Model, Linear Regression (LR) Model, K-nearest neighbor (KNN) Model, Regressor (R) Model, Decision Tree (DT) Model, Multi-layer Perceptron (MLP) Model and Support Vector Regressor (SVR) Model, to determine the most effective models for predicting WQI. The proposed models are trained on a publicly available dataset from 145 groundwater well samples collected between January and April 2018 in Abu Dhabi, the United Arab Emirates (UAE). The models’ performance was assessed using various metrics, including Mean Square Error (MSE), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Adjusted R-squared, Mean Absolute Percentage Error (MAPE) and R-squared (R2). Experimental results indicate promising performance across all models. In particular, the LR Model proved to be exceptionally accurate, precisely predicting WQI values with 100% accuracy during testing. According to the experimental findings, this model surpassed others in regression tasks, achieving an R2 value of 100% in WQI prediction. The proposed research confirms the effectiveness of ML algorithms in the field of Water Resources and will serve as a reference for the researchers working in the field of WQI prediction.
Similar content being viewed by others
Data Availability
Data is available at https://doi.org/10.1016/j.gsd.2021.100611.
References
Adimalla N (2021) Application of the entropy weighted water quality index (EWQI) and the pollution index of groundwater (PIG) to assess groundwater quality for drinking purposes: a case study in a rural area of Telangana State, India. Arch Environ Contam Toxicol 80(1):31–40
Ahmad S, Kutty AA, Raji F, Saimy IS (2015) Water quality classification based on water quality index in Sungai Langat, Selangor, Malaysia. Jurnal Teknologi 77(30):139–144
Ahmad T, Muhammad S, Umar M, Azhar MU, Ahmed A, Ahmed A, Ullah R (2024) Spatial distribution of physicochemical parameters and drinking and irrigation water quality indices in the Jhelum River, Pakistan. Environ Geochem Health 46(8):263
Ahmed AKA, El-Rawy M, Ibraheem AM, Al-Arifi N, Abd-Ellah MK (2023) Forecasting of Groundwater Quality by using Deep Learning Time Series techniques in an Arid Region. Sustainability 15(8):6529
Akhtar N, Ishak MIS, Ahmad MI, Umar K, Md Yusuff MS, Anees MT, Qadir A, Almanasir A, Y. K (2021) Modification of the water quality index (WQI) process for simple calculation using the multi-criteria decision-making (MCDM) method: a review. Water 13(7):905
Al-Kindi KM, Janizadeh S (2022) Machine learning and Hyperparameters algorithms for identifying Groundwater Aflaj potential mapping in Semi-arid ecosystems using LiDAR, Sentinel-2, GIS data, and analysis. Remote Sens 14(21):5425
Al-Ruzouq R, Shanableh A, Merabtene T, Siddique M, Khalil MA, Idris A, Almulla E (2019) Potential groundwater zone mapping based on geo-hydrological considerations and multi-criteria spatial analysis: North UAE. CATENA 173:511–524
Al-Ruzouq R, Shanableh A, Mukherjee S, Khalil MA, Gibril MB, Jena R, Yilmaz AG, Hammouri NA (2023) Comparative analysis of machine learning and analytical hierarchy analysis for artificial groundwater recharge map development. Environ Earth Sci 82(23):580
Al-Ruzouq R, Shanableh A, Jena R, Mukherjee S, Khalil MA, Gibril MBA, Pradhan B, Hammouri NA (2024) Hybrid deep learning and remote sensing for the delineation of artificial groundwater recharge zones. Egypt J Remote Sens Space Sci 27(2):178–191
Aldhyani THH, Al-Yaari M, Alkahtani H, Maashi M (2020) Water quality prediction using artificial intelligence algorithms. Applied Bionics and Biomechanics, 2020
Aldris B, Farhoud N (2020) Wastewater treatment efficiency of an experimental MBBR system under different influent concentrations. DYSONA - Appl Sci 1(1):20–28. https://doi.org/10.30493/das.2020.103717
Alhamd ADS, Ibrahim MA (2024) Unveiling soil and groundwater salinity dynamics and its impact on date palm yield in Southern Basrah, Iraq. DYSONA-Applied Science, 25–32
Alshehri F, Rahman A (2023) Coupling machine and deep learning with explainable artificial intelligence for improving prediction of groundwater quality and decision-making in Arid Region, Saudi Arabia. Water 15(12):2298
Baig F, Sherif M, Sefelnasr A, Faiz MA (2023) Groundwater vulnerability to contamination in the gulf cooperation council region: A review. In Groundwater for Sustainable Development (Vol. 23). Elsevier B.V. https://doi.org/10.1016/j.gsd.2023.101023
Baig F, Ali L, Faiz MA, Chen H, Sherif M (2024) How accurate are the machine learning models in improving monthly rainfall prediction in hyper arid environment? J Hydrol 633:131040
Banda TD, Kumarasamy MV (2020) Development of Water Quality Indices (WQIs): a review. Pol J Environ Stud, 29(3)
Batarseh M, Imreizeeq E, Tilev S, Al Alaween M, Suleiman W, Al Remeithi AM, Tamimi A, M. K., Alawneh A, M (2021) Assessment of groundwater quality for irrigation in the arid regions using irrigation water quality index (IWQI) and GIS-Zoning maps: Case study from Abu Dhabi Emirate, UAE. Groundw Sustainable Dev 14:100611
Brown RM, McClelland NI, Deininger RA, Tozer RG (1970) A water quality index-do we dare. Water Sew Works, 117(10)
Derdour A, Abdo HG, Almohamad H, Alodah A, Al Dughairi AA, Ghoneim SSM, Ali E (2023) Prediction of Groundwater Quality Index using classification techniques in arid environments. Sustainability 15(12):9687
Dharma VL, Nurtanio NK, Nugroho FS, Anggreainy MS, Kurniawan A (2023) A Review on Machine Learning Methods for Water Quality Prediction. 2023 4th International Conference on Artificial Intelligence and Data Sciences (AiDAS), 131–138
Dimple D, Rajput J, Al-Ansari N, Elbeltagi A (2022) Predicting Irrigation Water Quality Indices based on Data-Driven algorithms: Case Study in Semiarid Environment. J Chem 2022(1):4488446
Dornier Consult GTZ, ADNOC (2005) &. Groundwater Assessment Project in Abu Dhabi: Status Report on Phases 1Xa, 1Xb, and 1Xc
Drogkoula M, Kokkinos K, Samaras N (2023) A comprehensive survey of machine learning methodologies with emphasis in water resources management. Appl Sci 13(22):12147
ElHaj K, Issa S, Alshamsi D, Cherkose BA (2023) Modeling and prediction of Groundwater Level fluctuations using geoinformatics and Artificial neural networks in Al Ain City, UAE. Water resources Management and sustainability: solutions for arid regions. Springer, 205–217
Elmahdy S, Ali T, Mohamed M (2020) Flash Flood susceptibility modeling and magnitude index using machine learning and geohydrological models: a modified hybrid approach. Remote Sens 12(17):2695
Elmahdy S, Ali T, Mohamed M (2021) Regional mapping of groundwater potential in ar rub Al Khali, arabian peninsula using the classification and regression trees model. Remote Sens 13(12):2300
Ewaid SH, Abed SA, Al-Ansari N, Salih RM (2020) Development and evaluation of a water quality index for the Iraqi rivers. Hydrology 7(3):67
Fang J, Yang Y, Yi P, Xiong L, Shen J, Ahmed A, ElHaj K, Alshamsi D, Murad A, Hussein S (2024) Geospatial stable isotopes signatures of groundwater in United Arab Emirates using machine learning. J Hydrology: Reg Stud 55:101938
Farzana SZ, Paudyal DR, Chadalavada S, Alam MJ (2024) Decision Support Framework for Water Quality Management in Reservoirs Integrating Artificial Intelligence and statistical approaches. Water 16(20):2944
Gao Y, Qian H, Ren W, Wang H, Liu F, Yang F (2020) Hydrogeochemical characterization and quality assessment of groundwater based on integrated-weight water quality index in a concentrated urban area. J Clean Prod 260:121006
Goodarzi MR, Niknam ARR, Barzkar A, Niazkar M, Zare Mehrjerdi Y, Abedi MJ, Pour H, M (2023) Water quality index estimations using machine learning algorithms: a case study of Yazd-Ardakan Plain, Iran. Water 15(10):1876
Hanoon SK, Abdullah AF, Shafri HZM, Wayayok A (2022) A novel approach based on machine learning and public engagement to predict water-scarcity risk in urban areas. ISPRS Int J Geo-Information 11(12):606
Hmoud Al-Adhaileh M, Waselallah Alsaade F (2021) Modelling and prediction of water quality by using artificial intelligence. Sustainability 13(8):4259
Horton RK (1965) An index number system for rating water quality. J Water Pollut Control Fed 37(3):300–306
Hussein EE, Jat Baloch MY, Nigar A, Abualkhair HF, Aldawood FK, Tageldin E (2023) Machine learning algorithms for predicting the water quality index. Water 15(20):3540
Ibrahim H, Yaseen ZM, Scholz M, Ali M, Gad M, Elsayed S, Khadr M, Hussein H, Ibrahim HH, Eid MH (2023) Evaluation and prediction of groundwater quality for irrigation using an integrated water quality indices, machine learning models and GIS approaches: a representative case study. Water 15(4):694
Jain D, Shah S, Mehta H, Lodaria A, Kurup L (2021) A Machine Learning Approach to Analyze Marine Life Sustainability. Proceedings of International Conference on Intelligent Computing, Information and Control Systems: ICICCS 2020, 619–632
Jibrin AM, Al-Suwaiyan M, Aldrees A, Dan’azumi S, Usman J, Abba SI, Yassin MA, Scholz M, Sammen SS (2024) Machine learning predictive insight of water pollution and groundwater quality in the Eastern Province of Saudi Arabia. Sci Rep 14(1):20031
Khan MSI, Islam N, Uddin J, Islam S, Nasir MK (2022) Water quality prediction and classification based on principal component regression and gradient boosting classifier approach. J King Saud University-Computer Inform Sci 34(8):4773–4781
Khoi DN, Quan NT, Linh DQ, Nhi PTT, Thuy NTD (2022) Using machine learning models for predicting the water quality index in the La Buong River, Vietnam. Water 14(10):1552
khouri L, Sallom A (2023) The impact of spatial and temporal shifts on Orontes River water quality parameters. DYSONA - Appl Sci 4(2):35–41. https://doi.org/10.30493/das.2023.364393
Kumar P, Ahmed AN, Sherif M, Sefelnasr A, Elshafie A (2023) Development of Long Short-Term Memory Model for Predictionof Water Table Depth in United Arab Emirates. Water resources Management and sustainability: solutions for arid regions.Springer, 141–152
Kumari M, Rai SC (2020) Hydrogeochemical evaluation of groundwater quality for drinking and irrigation purposes using water quality index in semi arid region of India. J Geol Soc India 95:159–168
Li P, Wu J (2019) Drinking water quality and public health. Exposure Health 11(2):73–79
Li Z, Ma C, Sun Y, Lu X, Fan Y (2022) Ecological health evaluation of rivers based on phytoplankton biological integrity index and water quality index on the impact of anthropogenic pollution: a case of Ashi River Basin. Front Microbiol 13:942205
Lukhabi DK, Mensah PK, Asare NK, Pulumuka-Kamanga T, Ouma KO (2023) Adapted water quality indices: limitations and potential for water quality monitoring in Africa. Water 15(9):1736
Mahmoud MT, Hamouda MA, Kendi A, R. R., Mohamed MM (2018) Health risk assessment of household drinking water in a district in the UAE. Water 10(12):1726
Malek NHA, Yaacob WFW, Nasir SAM, Shaadan N (2022) Prediction of Water Quality classification of the Kelantan River Basin, Malaysia, using machine learning techniques. Water (Switzerland) 14(7). https://doi.org/10.3390/w14071067
Miller T, Cembrowska-Lech D, Kisiel A, Kołodziejczak M, Krzemińska A, Jawor M, Lewita K, Kozlovska P, Mosiundz S (2023) Advancing water quality monitoring through artificial neural networks: present insights and future opportunities in scientific exploration. Sci Collect «InterConf+» 32(151):399–409
Mishra RK (2023) Fresh water availability and its global challenge. Br J Multidisciplinary Adv Stud 4(3):1–78
Mogane LK, Masebe T, Msagati TAM, Ncube E (2023) A comprehensive review of water quality indices for lotic and lentic ecosystems. Environ Monit Assess 195(8):926
Mohammed MAA, Khleel NAA, Szabó NP, Szűcs P (2023) Modeling of groundwater quality index by using artificial intelligence algorithms in northern Khartoum State, Sudan. Model Earth Syst Environ 9(2):2501–2516
Murad AA, Nuaimi A, H., Al Hammadi M (2007) Comprehensive assessment of water resources in the United Arab Emirates (UAE). Water Resour Manage 21:1449–1463
Nair J, P, Vijaya, S M (2022) River water quality prediction and index classification using machine learning. J Phys: Conf Ser 2325(1):012011
Nong X, Shao D, Zhong H, Liang J (2020) Evaluation of water quality in the South-to-North Water Diversion Project of China using the water quality index (WQI) method. Water Res 178:115781
Osman AIA, Ahmed AN, Huang YF, Kumar P, Birima AH, Sherif M, Sefelnasr A, Ebraheemand AA, El-Shafie A (2022) Past, present and perspective methodology for groundwater modeling-based machine learning approaches. Arch Comput Methods Eng 29(6):3843–3859
Radhakrishnan N, Pillai AS (2020) Comparison of water quality classification models using machine learning. 2020 5th International Conference on Communication and Electronics Systems (ICCES), 1183–1188
Ruidas D, Pal SC, Chowdhuri I, Saha A, Biswas T, Islam AR, M. T., Shit M (2023) Hydrogeochemical evaluation for human health risk assessment from contamination of coastal groundwater aquifers of Indo-Bangladesh Ramsar site. J Clean Prod 399:136647
Saeedi M, Abessi O, Sharifi F, Meraji H (2010) Development of groundwater quality index. Environ Monit Assess 163:327–335
Sahour H, Sultan M, Abdellatif B, Emil M, Abotalib AZ, Abdelmohsen K, Vazifedan M, Mohammad AT, Hassan SM, Metwalli MR (2022) Identification of shallow groundwater in arid lands using multi-sensor remote sensing data and machine learning algorithms. J Hydrol 614:128509
Sefelnasr A, Ebraheem AA, Sherif M, Al Mulla M (2023) Water resources, uses, and their Integrated Management in theUnited Arab Emirates. Integr Drought Manage 2:673–704
Shams MY, Elshewey AM, El-kenawy ESM, Ibrahim A, Talaat FM, Tarek Z (2023) Water quality prediction using machine learning models based on grid search method. Multimedia Tools Appl. https://doi.org/10.1007/s11042-023-16737-4
Sherif M, Liaqat MU, Baig F, Al-Rashed M (2023) Water resources availability, sustainability and challenges in the GCC countries: an overview. Heliyon, vol 9. Issue 10). Elsevier Ltd. https://doi.org/10.1016/j.heliyon.2023.e20543
Suwadi NA, Derbali M, Sani NS, Lam MC, Arshad H, Khan I, Kim K-I (2022) An optimized approach for predicting water quality features based on machine learning. Wirel Commun Mob Comput 2022(1):3397972
Tabassum S, Kotnala CB, Masih RK, Shuaib M, Alam S, Alar TM (2023) Performance analysis of machine learning techniques for predicting water quality index using physiochemical parameters. 2023 International Conference on Sustainable Computing and Smart Systems (ICSCSS), 372–377
Uddin MG, Nash S, Rahman A, Olbert AI (2023) Performance analysis of the water quality index model for predicting water state using machine learning techniques. Process Saf Environ Prot 169:808–828
Vilupuru JR, Amuluru DC, Begum G (2022) Water quality analysis using artificial intelligence algorithms. 2022 4th International Conference on Inventive Research in Computing Applications (ICIRCA), 1193–1199
Wang X, Li Y, Qiao Q, Tavares A, Liang Y (2023) Water quality prediction based on machine learning and comprehensive weighting methods. Entropy 25(8):1186
Waraga OA, Abdeljaber A, Talib MA, Abdallah M (2021) Investigating Water Consumption Patterns Through Time Series Clustering. 2021 14th International Conference on Developments in ESystems Engineering (DeSE), 44–49
Wee WJ, Zaini NB, Ahmed AN, El-Shafie A (2021) A review of models for water level forecasting based on machine learning. Earth Sci Inf 14:1707–1728
William P, Oyebode OJ, Ramu G, Gupta M, Bordoloi D, Shrivastava A (2023) Artificial Intelligence based Models to Support Water Quality Prediction using Machine Learning Approach. 2023 International Conference on Circuit Power and Computing Technologies (ICCPCT), 1496–1501
Wu Z, Lai X, Li K (2021) Water quality assessment of rivers in Lake Chaohu Basin (China) using water quality index. Ecol Ind 121:107021
Xia L, Han Q, Shang L, Wang Y, Li X, Zhang J, Yang T, Liu J, Liu L (2022) Quality assessment and prediction of municipal drinking water using water quality index and artificial neural network: a case study of Wuhan, central China, from 2013 to 2019. Sci Total Environ 844:157096
Yagoub MM, Tesfaldet YT, Elmubarak MG, Al Hosani N (2022) Extraction of urban quality of life indicators using remote sensing and machine learning: the case of Al Ain city, United Arab Emirates (UAE). ISPRS Int J Geo-Information 11(9):458
Yang Y, Li P, Elumalai V, Ning J, Xu F, Mu D (2023) Groundwater quality assessment using EWQI with updated water quality classification criteria: a case study in and around Zhouzhi County, Guanzhong Basin (China). Exposure Health 15(4):825–840
Younis HI, Kizhisseri MI, Mohamed MM (2023) Forecasting Future Water demands for Sustainable Development in Al-Ain City, United Arab Emirates. Water (Switzerland) 15(21). https://doi.org/10.3390/w15213800
Funding
This study was funded by the United Arab Emirates University (funds no. 12S139 and 12S158).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests. There are no financial, personal, or professional relationships that could influence the research presented in this article.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ahmad, T., Ali, L., Alshamsi, D. et al. AI-Powered Water Quality Index Prediction: Unveiling Machine Learning Precision in Hyper-Arid Regions. Earth Syst Environ 9, 677–694 (2025). https://doi.org/10.1007/s41748-024-00524-8
Received:
Revised:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1007/s41748-024-00524-8