Soil erosion remains a critical environmental challenge that impacts ecosystems, agriculture, and infrastructure (Olii et al., 2023). The degradation of fertile topsoil, sedimentation of waterways, and loss of vegetation cover are just some of the detrimental effects caused by erosion, which can lead to long-term ecological damage (Arabameri et al., 2019) and economic losses (Almouctar et al., 2021). Predicting and managing soil erosion susceptibility (SES) is therefore essential for sustainable land use and environmental conservation (Kucuker & Cedano Giraldo, 2022). To achieve this, accurate and reliable models that can predict the susceptibility of different areas to erosion are needed. Traditionally, modeling SES has relied on empirical methods that use historical data and simple statistical relationships to predict future erosion patterns (Saini et al., 2015). However, these methods often fail to capture the complexity of the interactions between the many environmental variables that influence erosion processes, such as rainfall intensity, soil type, land use, and topography (Olii, Olii, et al., 2024). As a result, there is a growing need for more sophisticated modeling approaches that can better account for these complexities and provide more accurate predictions (Golijanin et al., 2022; Kucuker & Cedano Giraldo, 2022).
Geospatial modeling, when combined with machine learning techniques, significantly enhances the capability to predict and analyze SES. Machine learning (ML) models such as Random Forest (RF), Decision Tree (DT), Artificial Neural Networks (ANN), Support Vector Machines (SVM), Generalized Linear Models (GLM), etc can handle complex, non-linear relationships between environmental factors and SES, which traditional geospatial models might not fully capture (Al-Bawi et al., 2021; Gayen et al., 2019). By integrating these models with GIS and remote sensing data, the spatial patterns of SES can be more accurately mapped and predicted (Olii, Olii, et al., 2024). This combination allows for a more data-driven approach, where the models can learn from large datasets, adjust to various geographical contexts, and improve prediction accuracy by using classified and weighted factors tailored to local environmental conditions. The synergy between geospatial modeling and ML offers powerful tools for more effective land management and soil erosion prevention strategies.
Most studies utilizing ML for SES modeling rely on raw or normalized continuous data, without prior classification into discrete classes or the assignment of weights based on expert judgment (Golkarian et al., 2023; Huang et al., 2023; Phinzi & Szabó, 2024). This approach can lead to less interpretable models, as the continuous nature of the data might obscure important distinctions between different categories of environmental factors. Additionally, the lack of expert-informed weights may result in the model underestimating or overestimating the significance of certain variables, potentially compromising the accuracy and robustness of predictions. This limitation highlights the need for more sophisticated methods that incorporate domain expertise into the ML modeling process to improve the reliability and practical applicability of SES assessments. This leaves a gap in understanding how the integration of this traditional approach with advanced ML models could enhance prediction accuracy and model interpretability. This study introduces a novel approach by integrating traditional classification and weighting of environmental factors with advanced ML models like SVM and GLM for SES mapping. Unlike previous studies that often apply these models individually, this research emphasizes the innovative combination of pre-classifying factors into discrete classes and assigning weights based on expert knowledge. This methodology not only enhances model interpretability, making the results more accessible to practitioners and decision-makers, but also addresses common machine learning challenges such as data complexity, overfitting, and multicollinearity. Furthermore, the adaptable classification system allows the model to be customized to various geographic settings, increasing its applicability and robustness. This study offers a significant advancement over traditional modeling approaches by improving prediction accuracy and model stability, particularly in regions with unique environmental conditions.
This study aims to address this research gap by exploring the potential for integrating GLM and SVM in the geospatial modeling of SES. The research will involve a systematic comparison of these models to evaluate their effectiveness in predicting SES across different spatial scales and environmental conditions based on prior classification into discrete classes. Additionally, the study will investigate how these models can be optimized to handle the complexities of spatial data, including the influence of diverse and non-linear environmental variables such as climate, topography, and land use. By integrating GLM and SVM, this research seeks to develop a more robust and comprehensive framework for predicting erosion susceptibility, which could enhance the accuracy and reliability of SES assessments. The findings of this study have the potential to contribute significantly to the field of environmental modeling, offering new insights into the strengths and limitations of GLM and SVM and providing a basis for future research on the integration of statistical and ML approaches in geospatial modeling.