Big Geospatial Data Analysis

High resolution middle eastern soil attributes mapping via open data and cloud computing

Table of Contents

Video Abstract

Motivation

At the moment, there are no detailed maps covering a large extension of the Middle East region using bare topsoil reflectance or traditional environmental controllers—e.g. geological maps, climate data, and relief attributes. Indeed, there is no work that mapped calcium carbonates from middle eastern soils, and the available maps for clay, silt, sand, organic matter, and pH present coarser resolution than the proposed in this study. Thus, we aimed to map soil attributes at 0–20 cm depth with 30 m spatial resolution in a part of the Middle East region using remote sensing data and RF algorithm and provide spatial uncertainties.

Methods

The DSM framework was implemented in this study.

Importance of covariates

Although several covariates had a low linear correlation with soil observations, they were all relatively important for DSM of topsoil attributes, ranging from 4% to 98%. Annual precipitation, the annual range of temperature, and elevation were the most important covariates (greater than31%) driving the spatial variability of topsoil attributes in the Middle East region

Performance of spatial models

In overall, the R2 values of the predictions of topsoil attributes from the testing set ranged from 0.40 to 0.66, indicating that the models explained around 40–66% of the variation present in topsoil attributes.

Mapping of soil properties

Despite considerable local spatial variation on soil attributes patterns, we observed that from northwest to southeast clay, silt and OC content gradually decreased while the amount of sand, CaCO3, and pH increased in the topsoil. This occurred across a precipitation gradient, as suggested by the correlation between soil observations and annual precipitation.

Conclusion

The DSM framework implemented in this study—e.g. the use of cloud computing freely available in GEE for covariates obtaining, bootstrapping and results assessment— overcame some limitations related to computing infrastructure for large-extent applications. In fact, the excellent quality of the soil dataset—measurements, spatial location and distribution—, the strength of covariates—high explanatory power—that accurately represented soil spatial patterns, and the RF’s consistency, were crucial to the success of this work.

Ruhollah Taghizadeh
Ruhollah Taghizadeh
Postdoc Researcher

My research interests include soil mapping, data science, and machine learning