In Press (Summer)                   Back to the articles list | Back to browse issues page


XML Persian Abstract Print


Department of Forest Science and Engineering, Faculty of Natural Resources, University of Guilan, Soumeh Sara, Iran , asalehi@guilan.ac.ir
Abstract:   (153 Views)
1- Introduction
In recent years, with the rise of challenges such as food security, land degradation, water resource scarcity, climate change, and biodiversity conservation, the importance of understanding the physical and chemical properties of soils has increased significantly. These properties have a direct impact on food production, carbon storage, water filtration, and climate regulation, making accurate information on these characteristics essential for environmental and agricultural management. Traditional methods of measuring soil properties have limited applicability over large areas due to their cost and time requirements. However, the emergence of machine learning algorithms and remote sensing data has facilitated more precise mapping. Digital Soil Mapping (DSM) has emerged as an innovative approach that combines advanced algorithms with environmental variables to provide high-quality estimates of soil properties. Recent studies indicate that utilizing models such as Random Forest (RF) and Cubist, paired with topographic and remote sensing data, is effective for predicting soil variables and can significantly enhance the management of natural resources, agriculture, and watersheds. In this study, sampling locations were determined using the Cubist Latin Hypercube Sampling (cLHS) method.
2- Material and methods
Auxiliary variables in the latin hypercube model included bands and indices from Sentinel-2 satellite imagery, encompassing visible spectrum bands, short-wave infrared, vegetation indices, and properties derived from a Digital Elevation Model (DEM), including elevation, slope, aspect, and valley flatness index with high resolution. The Boruta method was employed to select influential variables. The machine learning models used in this study included RF, Cubist, and RF-Cubist, with the data divided into two sections: training (70%) and testing (30%). Accordingly, all data were evaluated as testing data based on the aforementioned method.
3- Results
The results indicated that for the depth of 0-30 cm, the Random forest (RF) model demonstrated the best performance for sand, with RMSE, Bias, Correlation (Cor), and Relative Root Squared Error (RRSE) values of 10.19%, -0.30, 0.92, and 0.41, respectively. Additionally, the Cubist model provided the best estimates for organic carbon, with RMSE, Bias, Cor, and RRSE values of %0.23, 0.005, 0.71, and 0.70, respectively. The RF-Cubist model achieved satisfactory results in predicting silt, with RMSE, Bias, Cor, and RRSE values of 7.80, -0.87, 0.87, and 0.28, respectively. At the depth of 30-60 cm, the RF model again performed best for sand, yielding RMSE, Bias, Cor, and RRSE values of %11.42, -0.23, 0.88, and 0.47, respectively. The Cubist model was noted for its performance in estimating organic carbon, achieving RMSE, Bias, Cor, and RRSE values of %0.26, 0.05, 0.53, and 0.85, respectively. Furthermore, the RF-Cubist model yielded good results in predicting silt, with RMSE, Bias, Cor, and RRSE values of %9.28, -0.58, 0.83, and 0.30, respectively. Uncertainty maps for the various parameters indicated that the RF model exhibited lower uncertainty at many locations compared to other models.
4- Discussion & Conclusions
A specific model should provide the best results with the least error. For instance, at a depth of 0-30 cm, the Random Forest (RF) model exhibited higher accuracy in estimating soil physical properties, characterized by a high R² and lower RMSE, compared to the Cubist and RF-Cubist models. Conversely, the Cubist model demonstrated superior accuracy in estimating soil chemical properties. At a depth of 30-60 cm, the RF-Cubist model outperformed the RF and Cubist models, achieving the best performance with a high R² and lower RMSE. According to results from other studies, the variability of soil properties is attributed to environmental factors, diversity of parent materials, and eomorphological processes, leading to the formation of different landforms and influencing soil particles, especially clay. The low average soil organic carbon at both surface and subsurface depths (0.36 and 0.33) is affected by factors such as poor vegetation cover, limited use of organic fertilizers, and reduced soil fertility. The use of the Boruta model for selecting environmental variables has reduced prediction errors compared to other methods, and variables derived from the Digital Elevation Model (DEM) have been recognized as effective in class modeling and soil property estimation. In comparing predictive models, the RF model demonstrated superior performance due to its lower error values (RMSE and Bias) and higher correlation coefficients compared to the Cubist and RF-Cubist models. Nevertheless, the Cubist model also displayed high accuracy in modeling due to its ability to capture the complexities between soil characteristics. Multiple studies have further confirmed that the RF model provides reliable results with low uncertainty when estimating properties such as organic carbon percentage and clay content. Ultimately, to improve results in future studies, it is recommended that model selection be tailored to the sampling depth and type of soil characteristics, data quality be enhanced, and the use of hybrid models be further developed.
 
     

Received: 2024/12/22

Add your comments about this article : Your username or Email:
CAPTCHA

Send email to the article author


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

© 2025 CC BY-NC 4.0 | Environmental Erosion Research Journal

Designed & Developed by : Yektaweb