Most of the freshwater bodies around world are being contaminated, reducing the suitability of the water. Urbanization has led to an increase in water pollution, posing a severe concern for human life (Agrawal et al., 2021; Chen et al., 2022). Due to the continuous growth in urbanization, only 0.3% of the world's water resources are usable (Kılıç, 2020). Around 785 million people globally do not have access to a safe and reliable water source (Shadabi & Ward, 2022), and around 2.5 billion people do not have proper sanitation. This lack of access to clean and safe water undermines efforts to end extreme poverty and disease in the world's poorest countries (Schweitzer et al., 2020). Water quality assessment plays a crucial role in informing water management decisions. It provides valuable information about the status of water resources, enabling authorities to identify potential risks and take appropriate measures to protect and improve water quality. Assessment methods, such as the Canadian Council of Ministers of the Environment Water Quality Index (CCME-WQI), the National Sanitation Foundation Water Quality Index (NSF-WQI), the Irrigation Water Quality Index (IWQI) and the Weighted Arithmetic Water Quality Index Method (WAWQI) are commonly used to evaluate water quality (Islam, 2024; Khan et al., 2022). These methods analyse various physicochemical parameters to determine the overall quality of water, including its suitability for drinking, irrigation, and other purposes. By assessing water quality, authorities can identify pollution sources, such as industrial effluents, sewage, and agricultural runoff, and implement measures to mitigate their impact (Lamrini et al., 2022; To, 2020). Water quality assessment also helps in monitoring trends and identify areas where water quality is deteriorating, allowing for targeted interventions and the development of effective water management strategies (Lee, 2021).
In this article, the authors aim to review existing literature on water quality assessment methods, including traditional indices and the application of machine learning techniques. Additionally, they seek to develop machine learning models for predicting WQI values based on physicochemical parameters, evaluate the performance of these machine learning models against traditional methods in terms of accuracy and efficiency, and investigate the potential of machine learning models to provide dynamic and adaptable solutions for water quality assessment in the face of changing environmental conditions.
In recent years, there has been a growing interest in utilizing machine learning techniques to enhance water quality assessment by predicting WQI values. Machine learning algorithms have shown promise in their ability to analyse vast datasets and generate predictive models (Oreški et al., 2023). These models try to offer a more efficient and accurate way of calculating WQI than traditional methods, which involve manual measurement and analysis of various parameters, by harnessing the power of data-driven modelling (Tabassum et al., 2023). However, multiple studies have revealed that the traditional WQI model produced significantly higher uncertainty in its modelling process (Juwana et al., 2016; Rezaie-Balf et al., 2020; Sutadian et al., 2015; Uddin et al., 2021). As a result, the WQI model needs to reflect accurate water quality status by overestimating and underestimating of WQI values. Some researchers have opted for a non-physical approach to overcome these issues, successfully predicting WQI using artificial intelligence (AI).
However, while the narrative above outlines the current challenges and the potential of machine learning in water quality assessment, it's imperative to ground this discussion in existing scientific literature. Therefore, this study aims to bridge this gap by conducting a comprehensive evaluation of water quality assessment through machine learning, focusing on predicting WQI values.
The application of machine learning techniques in water quality assessment, specifically for the prediction of WQI, is not merely a replacement for deterministic expressions derived from water quality parameters. While it's true that traditional methods offer a straightforward equation for calculating WQI, the complexity and variability inherent in water quality data often necessitate a more nuanced approach. Machine learning models excel in identifying patterns and relationships within large datasets that might not be immediately apparent through deterministic calculations. Furthermore, these models can adapt to changes in water quality parameters over time, offering a dynamic and robust tool for water quality assessment that deterministic methods may not provide. This adaptability is crucial in the face of varying environmental factors and pollution sources, ensuring accurate and timely assessments that support effective water management strategies.
The remainder of this article is as follows. Section 2 explores the related works of the study and a comprehensive review of various machine learning algorithms applications in water quality assessments, including a tabular representation of their recent advancements. Section 3 discussed the step-by-step procedure of the study. Section 4 outlines the result and discussion part. This section also suggested the most reliable model for predictions in our dataset analyses. The conclusion part is presented in Section 5.