Laser-induced breakdown spectroscopy (LIBS) is a robust technique employed in various fields for elemental detection and quantification purposes [1]. The technique has been used to study materials’ compositions to investigate the constituent elements for different applications. For example, food processing industries have employed LIBS to detect radioactive and heavy elements that may be hazardous to human health [2]. Similarly, building and construction industries have utilized the method to detect chloride concentrations in concrete materials, which are the major cause of corrosion of metallic constituents [3]. Furthermore, LIBS has found crucial applications in medicine, where patients’ specimens are investigated to identify elements present together with their respective concentrations [4]. The operation of LIBS requires the sample under consideration to be subjected to a concentrated, high-energy laser pulse. The laser-material interaction generates a high-temperature plasma that contains the emission signatures of the constituent atoms. The atomic emission intensities are then collected via an advanced optical device called a spectrometer and processed, after which the output is generated as spectra [5]. In principle, the spectrometer simply captures light of varying intensities and wavelengths. Noteworthy, as the LIBS operate based on matching elements with their wavelength, the elements may not have a unique wavelength since the electrons can be excited from the same or varying energy levels during the plasma formation phase [6, 7].
Soil science is a remarkable field that consists of soil biology, soil physics, and soil chemistry [8]. The soil unconfined compressive strength is a critical engineering quantity that is employed during the design and audit of fundamental geotechnical and environmental structures like pavements, buildings [9], tunnels, railways, road foundations, bridges, and dams [10]. The UCS is a direct measure of the soil’s compaction strength. The traditional means of obtaining such a quantity is via the unconfined compression test in the laboratory [11]. Nevertheless, the technique is time-consuming and costly, thus raising the overall cost of the construction work. In addition, the accuracy of the measurements is significantly affected by the equipment quality and the level of technical expertise of the device operator. Therefore, it will be imperative to devise a facile, low-cost, and more reliable way method of estimating the soil unconfined compressive strength.
Recently, the area of soil mechanics has recorded a significant advancement, particularly with respect to testing methodologies [12]. Multiple in-situ mechanisms and laboratory tests were improvised and strengthened yielding extraordinary progress in the geotechnical investigation of soil [13–15]. These techniques provide quantitative properties of soil materials that can be used for analysis and assessments. However, the majority of these methods are costly, time-consuming, and demand intensive physical efforts. Due to the limitations of the existing techniques used in measuring soil properties and considering the importance of such engineering parameters in construction industries, several empirical methods were proposed to estimate soil properties [16, 17]. It is worth highlighting the complexity involved in using empirical techniques to model the indirect relationship between the soil unconfined compressive strength, which is a physical property, and the soil’s chemical properties. Hence, such empirical methods are limited in the accuracy of their predictions. This calls for the deployment of more advanced tools like machine learning algorithms to estimate the soil UCS based on certain properties extracted from the samples such as laser-induced breakdown spectroscopic emission intensities of the constituent elements.
Machine learning is a powerful and intelligent tool that can be used to predict unknown variables from available input data –based on initially trained models [18]. The desired parameter is called the target variable while the rest of the input features are termed descriptors of the model. The descriptors are generally certain quantities that define the properties of the material under investigation and have some reasonable degree of correlation with the unknown quantity [19]. However, they do not have an explicit relationship with the target variable. The model is initially trained using an available dataset comprising both the descriptors and the target variable. It starts by mapping the descriptors unto the target variable using a high-dimensional feature space, after which it can predict the target based on input descriptors only. The potential of machine learning techniques to create a relationship between the input descriptors and target variable in a complex problem using a high-dimensional feature space, without leveraging on their explicit relationship, makes them suitable for advanced scientific research works [20]. Several research works have been reported where machine learning techniques were utilized to solve multiple problems in business, construction, sciences, engineering, medicine, and energy, to mention some [20–24]. Furthermore, many researchers have employed machine learning tools to conduct calibration-free LIBS investigations [25, 26]. This is possible owing to the ability of LIBS to generate enough data and the potential of machine learning algorithms to create the appropriate pattern between the descriptors and the target parameter. In general, LIBS can only provide information relating to the chemical properties of the material such as the elemental intensities and concentration. However, with sophisticated machine learning tools, the physical properties like soil UCS can be extrapolated based on the well-mapped pattern produced by the machine learning tools, as carried out in the present study.
Herein, multiple types of soil samples were collected across different regions of the Kingdom of Saudi Arabia and investigated under the laser-induced breakdown spectroscopy device. Thereafter, the samples were taken for physical properties measurements in the laboratory to obtain their bulk density, soil unconfined compressive strength, and moisture content. The LIBS-generated intensities of the constituent elements, soil’s bulk density, and moisture content were used as input features to train two different machine learning algorithms to predict the soil unconfined compressive strength. The developed models were further validated by investigating the UCS of soil samples stabilized with lime and cement. The two models exhibited excellent prediction accuracy based on the employed metric performance indicators.
The remaining sections of the article are organized as follows: section 2 provides a brief mathematical description of the proposed models. Section 3 highlights the statistics of the dataset utilized in this work as well as the optimization strategies adopted to achieve high prediction accuracy. The results of the predictions and model performance evaluation mechanisms are presented in section 4. Moreover, the applications of the model in studying cement and lime-treated samples are also presented in section 4.