Statistical Applications to Downscale GRACE-Derived Terrestrial Water Storage Data and to Fill Temporal Gaps

Sahour, Hossein; Sultan, Mohamed; Vazifedan, Mehdi; Abdelmohsen, Karem; Karki, Sita; Yellich, John A.; Gebremichael, Esayas; Alshehri, Fahad; Elbayoumi, Tamer M.

doi:10.3390/rs12030533

Open AccessArticle

Statistical Applications to Downscale GRACE-Derived Terrestrial Water Storage Data and to Fill Temporal Gaps

¹

Department of Geological and Environmental Sciences, Western Michigan University, Kalamazoo, MI 49008, USA

²

Department of Statistics, Western Michigan University, Kalamazoo, MI 49008, USA

³

Geodynamics Department, National Research Institute of Astronomy and Geophysics (NRIAG), Helwan, Cairo 11421, Egypt

⁴

Michigan Geological Survey, Western Michigan University, Kalamazoo, MI 49008, USA

⁵

Department of Geological Sciences, Texas Christian University, Fort Worth, TX 76129, USA

⁶

Department of Mathematics, North Carolina A&T State University, Greensboro, NC 27411, USA

^*

Author to whom correspondence should be addressed.

Remote Sens. 2020, 12(3), 533; https://0-doi-org.brum.beds.ac.uk/10.3390/rs12030533

Submission received: 31 December 2019 / Revised: 24 January 2020 / Accepted: 3 February 2020 / Published: 6 February 2020

(This article belongs to the Section Remote Sensing in Geology, Geomorphology and Hydrology)

Download

Browse Figures

Versions Notes

Abstract

:

The Gravity Recovery and Climate Experiment (GRACE) has been successfully used to monitor variations in terrestrial water storage (GRACE_TWS) and groundwater storage (GRACE_GWS) across the globe, yet such applications are hindered on local scales by the limited spatial resolution of GRACE data. Using the Lower Peninsula of Michigan as a test site, we developed optimum procedures to downscale GRACE Release-06 monthly mascon solutions. A four-fold exercise was conducted. Cluster analysis was performed to identify the optimum number and distribution of clusters (areas) of contiguous pixels of similar geophysical signals (GRACE_TWS time series); three clusters were identified (cluster 1: 13,700 km²; cluster 2: 59,200 km²; cluster 3: 33,100 km²; Step I). Variables (total precipitation, normalized difference vegetation index (NDVI), snow cover, streamflow, Lake Michigan level, Lake Huron level, land surface temperature, soil moisture, air temperature, and evapotranspiration (ET)), which could potentially contribute to, or correlate with, GRACE_TWS over the test site were identified, and the dataset was randomly partitioned into training (80%) and testing (20%) datasets (Step II). Multivariate regression, artificial neural network, and extreme gradient boosting techniques were applied on the training dataset for each of the identified clusters to extract relationships between the identified hydro-climatic variables and GRACE_TWS solutions on a coarser scale (13,700–33,100 km²), and were used to estimate GRACE_TWS at a spatial resolution matching that of the fine-scale (0.125° × 0.125° or 120 km²) inputs. The statistical models were evaluated by comparing the observed and modeled GRACE_TWS values using the R-squared, the Nash–Sutcliffe model efficiency coefficient (NSE), and the normalized root-mean-square error (NRMSE; Step III). Lastly, temporal variations in GRACE_GWS were extracted using outputs of land surface models and those of the optimum downscaling methodology (downscaled GRACE_TWS) (Step IV). Findings demonstrate that (1) consideration should be given to the cluster-based extreme gradient boosting technique in downscaling GRACE_TWS for local applications given their apparent enhanced performance (average value: R-squared: 0.86; NRMSE 0.37; NSE 0.86) over the multivariate regression (R-squared: 0.74; NRMSE 0.56; NSE 0.64) and artificial neural network (R-squared: 0.76; NRMSE 0.5; NSE 0.37) methods; and (2) identifying local hydrologic variables and the optimum downscaling approach for individual clusters is critical to implementing this method. The adopted method could potentially be used for groundwater management purposes on local scales in the study area and in similar settings elsewhere.

Keywords:

downscaling; GRACE; multivariate regression; artificial neural network; XGBoost

Graphical Abstract

1. Introduction

The Gravity Recovery and Climate Experiment (GRACE) is a satellite mission that was jointly implemented by the National Aeronautics and Space Administration (NASA) in the United States and the Deutschen Zentrum für Luftund Raumfahrt (DLR) in Germany to map the temporal variations in the global gravity field [1,2]. The GRACE satellites were launched in March 2002, and the GRACE Follow-On (GRACE-FO) mission was launched in May 2018; since then, their applications have resulted in advances in hydrologic sciences (e.g., [3,4,5,6,7]) in the assessment and monitoring of spatial and temporal variations in groundwater storage (GWS) in many parts of the world, including Africa [3,4,5], the Middle East [6,7], China [8], India [9], California [10], and Mexico [11]. However, such applications are hampered by the relatively low horizontal resolution of GRACE data and the fact that GRACE does not have vertical resolution [12,13]. In other words, GRACE cannot determine in which compartment (e.g., surface water, groundwater, or soil moisture) the observed mass variations are occurring.

Many studies utilizing GRACE data for hydrological research and applications (e.g., [14,15]) target large aquifers and watersheds (areas of 450 × 10³ to 6 × 10⁶ km²). However, the majority of the world’s aquifers and watersheds are much smaller; even for the larger ones, one often needs to understand the partitioning of water on the sub-basin level. A finer resolution of GRACE solutions would be a useful tool for tracking the changes in GWS (GRACE_GWS) on local scales, especially for regions that do not have sufficient in-situ monitoring sites.

Downscaling techniques allow predictions to be made at a finer spatial resolution than that of the original dataset [16]. Downscaling approaches, especially those developed for climate models and later applied to remotely acquired data, can be classified into two main groups: Dynamic downscaling and statistical downscaling [16]. The former approach has been successfully applied to downscale global climate models (GCMs) over regions of interest by integrating GCM outputs with the physical characteristics of Earth’s surface in the area of interest. Monthly GRACE terrestrial water storage (GRACE_TWS) solutions from Center for Space Research (CSR), Jet Propulsion Laboratory (JPL), and Deutsches GeoForschungsZentrum (GFZ) (spatial resolution: 150,000 km²) were assimilated into a land surface model to generate high-resolution water storage changes within the major watersheds of the Mississippi River [17]. GRACE_TWS derived from spherical harmonic (SH) solutions were assimilated into the Catchment land surface model to extract GRACE-based drought indicators (spatial resolution: 1° × 1.25°) for North America [18]. Gridded (25 km²) Advanced Microwave Scanning Radiometer–Earth Observing System (AMSR-E) data were assimilated into a fine-scale (1 km²) NOAH land surface model using three-dimensional and one-dimensional Kalman filters [19]. The JPL GRACE_TWS mascon solutions (3.0° × 3.0°) were assimilated into the fine-resolution (0.05° × 0.05°) hydrologic models [20,21]. The scale factor was used to minimize the leakage errors and to improve the spatial resolution of the JPL spherical harmonics solutions [22]. Such applications often require extensive computing time and resources that are not available for many researchers [23]. Also, many of the above-mentioned procedures depend on the selected hydrological model, some of which are lacking surface or groundwater components.

Statistical downscaling, on the other hand, does not require these resources. Statistical downscaling evaluates observed spatial and temporal relationships between inputs (independent variables) and outputs (dependent variables) using coarse-scale datasets (inputs and outputs) and applies the extracted relationships to produce the dependent variables at a spatial resolution matching that of the fine-scale inputs [24]. A variety of statistical methods have been applied to downscale remote sensing or ground-based data, including Markov chains and support vector machines [25], regression kriging [26], neural networks [27], and stochastic models [28]. Stepwise regression was successfully applied to downscale satellite-based precipitation data (TRMM3B43 products) and average daily precipitation and air temperature data from weather stations [29,30]. Artificial neural networks (ANNs) were used to downscale GCM outputs [31] and rainfall [32]. The major limitation of the statistical approaches comes from the assumption of stationarity between the coarse- and fine-scale dynamics and from the uncertainty and probability associated with this assumption [23].

Statistical approaches were successfully used to downscale GRACE data to a high-resolution (~16 km²) dataset of groundwater storage changes over a portion of California’s Central Valley using ANN techniques [27]. In this study, temporal GRACE_TWS and a series of widely available hydrologic variables were used as model inputs and target data were extracted from groundwater storage changes that were estimated from an extensive well network dataset (2189 wells). Similarly, variations in GWS were extracted and tested against temporal variations in groundwater levels in Texas, Nebraska, and Illinois [33] by using coarse-resolution (3° × 3°) GRACE_TWS JPL Release-05 monthly mass concentration (mascons; JPL RL-05M) and high-resolution hydrologic variables, and applying ANN techniques. Statistical downscaling was also used to successfully downscale GWS anomalies from 110 to 2 km in the North China Plain on both interannual and monthly scales in areas where a strong relationship between GWS and ET was detected and where the relationship was established under different spatial resolutions [34]. The successful applications in the first two examples (in California’s Central Valley, Texas, Nebraska, and Illinois) rely heavily on the availability of temporal head data from dense networks of wells, and for the latter (North China Plain) on the presence of strong relationship between GWS and ET—conditions that are not necessarily available in many of the basins worldwide. In a fourth study, temporal GRACE_TWS solutions and land surface and hydro-climatic variables were used to predict groundwater level anomaly (GWLA). A network of 32 wells (21 wells for training and 11 wells for testing) was used to establish and test the relationship between GRACE_TWS and hydro-climatic variables as input and GWLA as the response variable using a downscaling algorithm based on machine learning (ML) [35]. In many of the applied statistical downscaling approaches, including the latter study, a dense network of wells is required to establish a relationship between hydrological variables and groundwater anomalies extracted from the well data. Those methods cannot be applied in many parts of the world with limited monitoring well sites.

In this study, we applied statistical techniques to extract relationships between coarse-resolution GRACE solutions (target data) and hydrologic variables (total precipitation, normalized difference vegetation index (NDVI), snow cover, streamflow, Lake Michigan level, Lake Huron level, land surface temperature, soil moisture, air temperature, and evapotranspiration (ET). These variables could potentially correlate with, or contribute to, the temporal variations of GRACE_TWS. We used those relationships and high-resolution hydrologic variables to generate high-resolution modeled monthly GRACE solutions. The Lower Peninsula (LP) of Michigan was used as a test site. We applied and compared the findings from three statistical methods—stepwise multivariate regression (MR) models, ANN, and extreme gradient boosting (XGBoost)—to downscale GRACE data and to fill the temporal gaps in the time series data over the LP throughout the investigated period (2002–2016).

2. Overview of the Study Area

The LP (Figure 1) depends heavily on its groundwater resources to support a population of 10 million citizens, its agricultural sector (35% of its area is agricultural land [36]), and its industry [37]. The area in general, and the local communities in particular, can benefit from reliable datasets that show spatial and temporal variations in GWS at the local scale. This is especially true for the southern counties of the LP, where intensive agricultural activities are established and groundwater withdrawal rates (30 to 90 million gallons/day [38]) are the highest in the state.

Groundwater availability differs from one place to another in the LP; it is plentiful in some regions (e.g., the southwest) and less so in other areas (e.g., the southeast) [39]. The major aquifers in Michigan are largely found in (1) glacial deposits, where yields are mostly from outwash and glacio-fluvial deposits; and (2) sedimentary bedrock units, where yields come largely from the Mississippian and Pennsylvanian rocks [39]. The surface and groundwater in the area drain into the Great Lakes (Lake Michigan, Lake Huron, and Lake Erie) [40].

Nine hydrologic provinces with varying sedimentary deposits and aquifer thicknesses (Figure 2) were identified in the LP [39]. Province 1 has a relatively thin (1 to 60 m) glacial-lacustrine sand unit that overlies Silurian and Devonian limestone and dolomite. Aquifers in Silurian and Devonian rocks are the main source of groundwater in the area. Province 2 is largely covered by thick (>300 m in some areas), coarse-grained, and sandy glacial deposits, whereas province 3 is characterized by variable thicknesses (15 to 75 m) low-yield lacustrine deposits that overlie the Mississippian bedrocks. Province 4 is located in the most southern section of the LP and is characterized by thick (30 to 180 m) coarse-grained glacial deposits that overlie the low-yield Mississippian rocks; it is the main source for groundwater in the area. Province 5 glacial deposits vary in thickness (7 to 150 m) and overlie the high-yield Pennsylvanian aquifers. In this province, the thickness of the glacial aquifer thins to the south and groundwater is largely extracted from the Pennsylvanian sandstone. Province 6 is characterized by thin to moderate glacial drift that overlies high-yield Mississippian bedrock aquifers. The latter is largely composed of the Marshall Sandstone, whereas the glacial deposits are absent in some areas but thicken (up to 120 m) in others. Province 7 is characterized by thin glacial deposits (<10 m in most areas) that overlie moderate-yield Silurian and Devonian limestone and dolomite intercalated with sand and shale layers. Province 8 is characterized by low-yield, moderate to thick (15 to 120 m), and lacustrine clay deposits that overlie low-yield Devonian and Mississippian sandstone. Lastly, Province 9 consists of featureless lacustrine and low-yield glacial sand and clay deposits of variable thickness (7 to 90 m) that overlie Pennsylvanian and Mississippian sandstone aquifers [39]. In general, the vertical hydraulic conductivity of the glacial aquifer is high (9.64 × 10⁻⁷ to 3.8 × 10⁻⁵ m/day) in the southern sections compared to the northern sections (3.8 × 10⁻⁵ to 0.45 m/day; [41]).

3. Methodology

A four-fold exercise was conducted to accomplish the statistical downscaling of the GRACE-derived terrestrial water storage (GRACE_TWS) (Figure 3). First, clustering analysis was conducted on GRACE_TWS data over the LP to identify clusters of pixels, where pixels within each cluster have similar GRACE_TWS values yet are statistically different from those of neighboring clusters (Step I). Variables that correlate with and/or control GRACE_TWS were then identified (Step II). In Step III, for each cluster, relationships between coarse-scale inputs (independent variables) and outputs (GRACE_TWS; dependent variables or target) were extracted using three statistical approaches—MR, ANN, and XGBoost—to produce the dependent variables at a spatial resolution matching that of the fine-scale inputs (0.125° × 0.125° or 120 km²). In doing so, we assumed that the extracted statistical relationship between the input variables and the target applies at the finer scales as well. In this step, the adopted statistical models were compared and evaluated to select the optimum methodology. Finally, GWS variations were extracted using the outputs of the applied land surface model and outputs of the optimum downscaling method (downscaled GRACE_TWS) (Step IV).

Initially, we selected 11 variables that could correlate with, or contribute to, the temporal variations of GRACE_TWS. All input variables listed below (Section 3.2) are available at a spatial resolution ranging from 0.05° × 0.05° to 0.125° × 0.125°, and the output target values for each of the identified clusters were calculated from the gridded values within each of the clusters. Input variables were resampled to the size of their corresponding clusters for Step I and resampled to 0.125° × 0.125° (120 km²) for Step III.

We adopted three statistical approaches to model the relationships between the 11 variables and GRACE_TWS, namely, the MR, ANN, and XGboost approaches for each cluster. For each of the three approaches, the data were randomly partitioned into two subsets, training and testing. The training subset comprised 80% of the data points (percentage of the months in the time series data) and was used to construct the model, whereas the remaining 20% were used to evaluate the performance of the model. This approach was applied to each of the identified clusters.

3.1. Cluster Analysis

Working with small areas (individual pixels) introduces significant leakages from neighboring pixels. We adopted cluster analysis to identify larger areas (contiguous pixels) with similar geophysical signatures (GRACE_TWS time series) and hence reduce leakage errors. Clustering is the partitioning of the set of objects into groups in such a way that the objects within a group are more similar to each other than those in other groups. We applied K-means, one of the most popular methods for clustering analysis, to the monthly GRACE Release-06 (CSR RL06) solutions over the study area. In this method (K-means), the dataset is partitioned into K clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster [42]. The optimum number of clusters was estimated, and the area was partitioned into the identified number of clusters. The monthly values of a variable for each cluster represent the average values of all pixels within that cluster in the investigated month.

The optimal number of clusters was determined using the gap statistic implementation [43], where three steps are undertaken:

Step 1: Estimate the gap statistic using Equation (1):

Gap (k) = \frac{1}{B} \sum_{b = 1}^{B} (\log W_{k b} - \log W_{k}),

(1)

in which k is the number of clusters, B is the number of reference datasets generated using a uniform prescription, W_kb is the within-dispersion measures, and W_k is the within-cluster dispersion.

Step 2: Compute the standard deviation:

s d_{k} = {[(\frac{1}{B}) \sum_{b} {\log (W_{k b}) - \bar{l}}^{2}]}^{\frac{1}{2}},

where:

\bar{l} = (\frac{1}{B}) \sum_{b} \log (W_{k b}) .

Step 3: Define

s_{k} = s d_{k} \sqrt{1 + \frac{1}{B}}

and choose the number of clusters via:

\hat{k} = smallest k such that Gap (k) \geq Gap (k + 1) - s_{k + 1} .

3.2. Identification of Variables that Correlate with and/or Control GRACE_TWS

In this section, we briefly describe the variables and target data for the MR, ANN, and XGboost models. We selected the input variables (total precipitation, normalized difference vegetation index (NDVI), snow cover, streamflow, Lake Michigan level, Lake Huron level, land surface temperature, soil moisture, air temperature, and evapotranspiration (ET)) based on their probable correlation and/or contribution to the target (GRACE_TWS). The spatial resolution, format, and sources of these datasets are given in Table 1. A number of these variables (e.g., soil moisture, NDVI, and evapotranspiration) may be related to, or affected by, the soil characteristics and the underlying glacial aquifer parameters.

3.2.1. GRACE-Derived TWS

Three communal GRACE mascon solutions (from April 2002 through June 2016) were applied in this study and reported relative to a 2004–2009 mean baseline. The first is the GRACE CSR-RL06M solutions provided by the University of Texas Center for Space Research (UT-CSR); they were derived using Tikhonov regularization [44], and resolved on a geodesic grid (grid size: 12,000 km²) [44,45]. The second is the mascon solutions from the Jet Propulsion Laboratory (JPL-RL06M) [46] and the third is the mascon solution from NASA Goddard Space Flight Center [47].

The CSR-RL06M solutions were selected as the initial mascon solutions for extracting trends and time series over the investigated areas. The uncertainty associated with the calculated trend values were calculated from the differences in trend values extracted from the three solutions (CSR-RL06M, JPL-RL06M, and GSFC-M) [48,49] (Table 2). No post-processing and/or filtering or application of empirical scaling factors were applied [44,46,50]. SH solutions of GRACE data have been successfully applied in many studies to monitor variations in TWS on large scales [3,51]; however, its application on local scales was hindered by its coarse spatial resolution (>125,000 km²), leakage problems from adjacent pixels, and the required complex post-processing steps [52]. Compared to SH, the GRACE RL06 mascon solutions have a higher signal-to-noise ratio, higher spatial resolution, and reduced leakage from neighboring mascons that are in separate constraint regions [52,53]. The extracted trends and associated uncertainties for each of the investigated clusters are given in Table 2.

3.2.2. NDVI

We used NDVI products derived from the Moderate-resolution Imaging Spectroradiometer (MODIS) as one of the variables. NDVI uses the red and near-infrared bands, which are sensitive to healthy vegetation. The data consists of global monthly NDVI values reported at a 0.05° × 0.05° spatial resolution downloaded from Land Processes Distributed Active Archive Center’s website [54]. The uncertainties associated with MODIS NDVI products were estimated by comparing the NDVI products with that extracted from the Advanced Very-High-Resolution Radiometer (AVHRR) and from the Visible Infrared Imaging Radiometer Suite (VIRS) [55]. The reported statistical coefficients between the NDVI products of MODIS and each of the AVHRR (R-square 0.99) and VIRS (R-square 0.99) indicate high consistency of the NDVI values extracted from different sensors [55].

3.2.3. Snow Cover (SC)

The monthly average snow cover (spatial resolution: 0.05° × 0.05°) values were computed from daily snow cover observations extracted from the MODIS/Terra Snow Cover Daily L3 Global 0.05Deg Climate Modeling Grid dataset. The normalized difference snow index—an index that is sensitive to the high reflectance over snow-covered lands in the visible wavelength region and low reflectance in the shortwave infrared regions—was used to identify snow-covered land. The monthly averages are calculated from the corresponding 28 to 31 days of observations in the daily maximum snow cover extent data. The MODIS monthly snow cover data were downloaded from NSIDC’s website [56].

3.2.4. Stream Flow (SF)

The stream flow data were obtained from the United States Geological Survey (USGS). A good correspondence between GRACE_TWS and streamflow was noted in previous studies [57,58,59]. We used monthly mean values, which are average monthly values of average daily streamflow for each of the nine gauge stations. In the selection of the gauge stations, preference was made to the gauges on the main river, as observations from such locations are more likely to represent the overall fluctuations of runoff within the investigated cluster. The selected gauge stations are located in the following rivers: Clinton River in Sterling, Sable River near Red Oak, Pine River near Midland, Grand River in Lansing, St. Joseph River in Niles, Boardman River near Mayfield, Muskegon River near Croton, Kalamazoo River near Battle Creek, and St. Joseph River in Elkhart (Figure 1). The monthly streamflow data were downloaded from the USGS National Water Information System’s web interface [60].

3.2.5. Lake Levels (LL)

Average monthly water level data for Lake Michigan and Lake Huron were obtained from the NOAA’s Center for Operational Oceanographic Products and Services. The Lake Michigan water levels were obtained from the Holland station and the Lake Huron levels from the Harbor Beach station. The former station is located along Lake Michigan’s eastern shoreline and the latter along Lake Huron’s southwestern shoreline (Figure 1). At both stations, water levels are measured every six minutes. The average monthly water level data were downloaded from NOAA’s website [61].

3.2.6. Land Surface Temperature (LST)

The monthly MODIS/Terra land surface temperature (LST) is derived by averaging the daily values of MOD11C3 products. These products have been validated through a series of field studies by the MODIS land team [62]. The spatial resolution of the monthly LST is 0.05° × 0.05° and the data were obtained from the NASA’s Earth Science Data Systems available at [62]. The performance of the MODIS LST product was evaluated in several locations in the USA; results showed a good correspondence (absolute biases < 0.8 °C and RMSEs < than 1.7 °C) between the in-situ measurements and MODIS LST products [63].

3.2.7. Rainfall, Snow Water Equivalent, Soil Moisture, Air Temperature, and Evapotranspiration

The NASA’s North American Land Data Assimilation System (NLDAS) NOAH model outputs (total monthly rainfall, snow water equivalent (SWE), soil moisture (SMS), air temperature (AT), and evapotranspiration (ET); spatial resolution: 0.125° × 0.125°) were used as inputs to our models. These data are produced from daily ground-based precipitation analysis, bias-corrected shortwave radiation, and surface meteorology re-analyses to drive land surface models [64]. The average monthly data used in this research were downloaded from NASA Goddard Earth Sciences Data and Information Services Center’s website at [65].

The monthly total precipitation was computed from the sum of the monthly SWE and rainfall. The outputs of NLDAS products have been validated and enhanced through several research works (e.g., [66,67,68]). The accuracy of the NLDAS products varies from one product to another and from one location to another. For instance, the uncertainty for ET products was reported to be 48-mm per month over the contiguous United States [66]. Comparing the NLDAS products with in-situ measurements showed that the performance of the soil moisture product over the contiguous United States was very high (RMSE = 0.02 to 0.11) [69]. The uncertainty of the NLDAS snow water equivalent is less than 20% based on comparisons with IMS (MultiSensor Snow and Ice Mapping System) observations [70]. The uncertainties in air temperature over Michigan were estimated by comparing the monthly mean values between NLDAS and the National Centers for Environmental Prediction (NCEP) products. The reported difference was <0.4 °C [71]. The NLDAS precipitation data were compared with five other available datasets over the western United States and the mean relative difference between them ranged from 11% to 18% [72].

3.3. Construction, Evaluation, and Selection of an Optimum Model for Downscaling

For each cluster, all datasets (variables and targets) were randomly partitioned into two groups: Training (80% of the time series for each cluster) and testing (20% of the time series for each cluster). We constructed and applied three statistical models (MR, ANN, and XGBoost) to establish the relationships between the variables (predictors) and GRACE (the target) for each of the investigated clusters. The performance of each model was compared and evaluated. The evaluation of the models was carried out by comparison between the observed values (testing subset) and predicted values using the coefficient of determination (R-squared), the normalized root-mean-square error (NRMSE), and the Nash–Sutcliffe model efficiency coefficient (NSE) (Table 3) [73]. The R-squared values range from 0 to 1; those for the NRMSE and NSE indices range from 0 to 1. The predictive power of the models increases with increasing R-squared and NSE values and with decreasing NRMSE values. The rate of the performance for each approach was based on classifications adopted by [74]. Following the identification of the statistical model that yielded the highest performance, we used the selected model to downscale the GRACE solutions for each of the clusters to 0.125° × 0.125° throughout the investigated period.

3.3.1. MR Models

The MR, or multiple linear regression method, derives patterns in the data and establishes the best fitting multivariate linear relationships between two or more dependent variables and the target (GRACE_TWS). As described earlier, we applied a stepwise MR method in which the selection of variables is carried out by addition to, or subtraction from, a set of dependent variables using some pre-specified coefficients, such as the F-test, the t-test, and the coefficient of partial determination. In an MR model, every value of the independent variable, X, is associated with a value of the target variable, Y. The regression line for n independent variables X₁, X₂,…, X_n can be explained as follows:

Y = B₀ + B₁X₁ + B₂X₂ +…+ B_nX_n

(2)

where Y is the predicted value of the target variable, B₀ is the value of Y when all of the independent variables are equal to zero, X₁ through to X_n are independent variables, and B₁ through to B_n are the estimated regression coefficients. In multivariate linear regression, the response variable, Y (GRACE_TWS in our work), is assumed to be linearly related to a set of n explanatory independent variables, X₁, X₂,…, X_n, and the independent variables are not highly correlated with each other. Observations are selected independently and randomly from the population. Also, residuals are assumed to be normally distributed with a mean value of 0.

The parameters are trained in such a way to achieve the highest similarity between the modeled and observed values in the training data set. One optimization model is employed to minimize the sum of the squares of the vertical deviations from each data point to the regression equation. The ideal case is a model in which a data point lies completely on the fitted line (i.e., vertical deviation = zero).

We applied a stepwise multivariate regression approach. Stepwise regression fits the multivariate regression several times, each time removing the least correlated variable until the statistically significant variables are left. For a full description of variable selection in the stepwise method, see [75]. MR models were first developed for each of the identified clusters to establish linear relationships between coarse-resolution inputs (variables) and target (GRACE_TWS) values for each of the identified clusters.

All input variables were available in both coarse and fine resolutions, whereas GRACE_TWS values were available in coarse resolution only. The variables were resampled to the size of each cluster using bilinear resampling techniques.

3.3.2. ANN

The ANN method establishes empirical, possibly non-linear, relationships between a set of “input” variables and corresponding “target” variables. An ANN is based on a series of connected units or neurons, which are intended to replicate the functions of neurons in animal or human brains; they pass information between one another, a structure that enables ANNs to be trained and learn. The ANN method used in this study is known as multilayer perceptron (MLP). An MLP consists of units called perceptrons. Perceptrons have one or more inputs, an activation function, and an output. An MLP model is built up by combining perceptrons in structured layers. The perceptrons in a given layer are independent of each other, but each connects to all of the perceptrons in the following layer. Each layer is composed of a set of neurons and is trained with a back-propagation algorithm.

Backpropagation is one of the most extensively used algorithms for supervised training of multilayered neural networks [76,77,78]. Backpropagation works by approximating the non-linear relationship between the input and the target by altering the weight values internally. The processes of the backpropagation can be divided into two stages: Feedforward and backpropagation. In the feedforward step, a pattern is applied to the input layer, and its effect propagates, layer by layer, through the network until the output is generated. The network’s sample output value is then compared to the anticipated value for a given input, and an error signal is estimated for each of the output neurons. Since all neurons within the hidden layer contributed to the signal errors in the output layer, the output errors are transmitted backward from the output layer to each neuron within the hidden layer that contributed to the output layer. This process is then reiterated, layer by layer, until each neuron in the network has received an error signal that represents its relative contribution to the total error. Once the error signal for each neuron has been computed, the errors are then applied by the neuron to adjust the values for each connection weight. The goal is to minimize the value of the error function in weight space. The weights with minimum error functions are then considered to be a solution to the learning problem. In an ANN, the hyperparameter is a parameter whose value is set before the learning process begins, and it controls the model structure (eg., number of layers, number of hidden neurons, number of epochs). Additional information about the theory behind ANN applications can be found in [79].

In our study, we applied a trial and error technique to determine the optimal number of hyperparameters, where the numbers were added gradually until the predicted and observed values start to match by evaluating the model performance using the mean squared normalized error (MSE) performance function. In our study, individual ANNs were constructed for each cluster. In all three clusters, the ANNs consist of one input layer, one hidden layer, and one output layer. The number of hidden neurons in our study ranged from 12 to 18. The number of epochs is the number of times the entire training data are used to update the weights. In other words, it is the number of times that the backpropagation algorithm works through the entire training dataset. The number of epochs ranged from 350 to 500.

The final model evaluation was carried out by the comparison between observed and predicted values on testing data (out of sample data set) using the above-mentioned statistical coefficients (NRMSE and NSE).

3.3.3. XGBoost

Gradient boosting was used with decision trees. Decision tree learning is a predictive modeling approach in machine learning that uses a tree-like model to go from observations of predictors (branches of the tree) to the prediction of the target value (leaves of the tree). The goal of our study was to create a model to predict the GRACE_TWS values from sets of input variables for each month. Using trees has several advantages including, but not limited to, the ability to handle various types of target variables (e.g., numerical, categorical, and multivariate), modeling complex interactions, and managing missing values with minimal loss of information [80]. However, there are two main limitations with trees: Weakness of the prediction and difficulty in the interpretation of large trees [81]. To overcome these limitations, the gradient boosting algorithm was introduced by [81] and developed by many others (e.g., [82,83]).

In gradient boosting, the goal is to use a set of predictors (X₁,…, X_n) to predict a set of target data (Y₁,…,Y_n) by fitting a model

F (X) \to Y

and minimizing the sum of the loss function

J = \sum_{i = 1}^{n} L (Y_{i}, F (X_{i}))

by improving the model F(X) (in our work, the loss function,

L (x, y) = {(x - y)}^{2}

). Then, the following iteration is performed:

Calculate the negative gradients of J with respect to F(X_i), which is $- \frac{\partial J}{\partial F (X_{i})}$ .
Fit a regression tree, $h$ , to negative gradients $- \frac{\partial J}{\partial F (X_{i})}$ .
Let our new F(X_i) be F(X_i) + $γ h$ , where $γ$ is the step size in our algorithm to reach the estimated minimum of $J$ .

As a significant improvement over gradient boosting, in XGBoost, we start with a loss function

L (Y_{i}, F (X_{i}) + h)

and minimize

J = \sum_{i = 1}^{n} L (Y_{i}, F (X_{i}) + h) + Ω (h)

, where

Ω (h) = γ T + \frac{1}{2} λ {| | w | |}^{2}

. Here,

T

is the number of leaves in the tree and

w

is the leaf weights. Figure 4 shows a schematic diagram of the gradient boosting method.

3.3.4. Selection of Optimum Statistical Model and Gap Filling

The performance of each of the three models was evaluated by comparing the predicted values with the observed values on the testing subset using R-squared, normalized root mean squared error (NRMSE) (Equations (3) and (4)), and Nash–Sutcliffe efficiency (NSE) (Equation (5)) as follows:

R M S E = \sqrt{\sum_{i = 1}^{n} \frac{{(Y_{o} - Ŷ_{p})}^{2}}{n}},

(3)

N R M S E = \frac{R M S E}{{\bar{Y}}_{o i}},

(4)

N S E = 1 - \frac{\sum_{i = 1}^{n} {(Ŷ_{p} - Y_{o})}^{2}}{\sum_{i = 1}^{n} {(Y_{o} - {\bar{Y}}_{o i})}^{2}},

(5)

where

Y_{o}

is the observed value,

Ŷ_{p}

is the predicted value, n is the number of observations, and

{\bar{Y}}_{o i}

is the mean of the observed data.

The model that produced the highest R-squared and NSE and the lowest NRMSE value was selected. Using the optimum model, the relationships were established between input variables (total precipitation, NDVI, snow cover, streamflow, Lake Michigan level, Lake Huron level, soil moisture, air temperature, LST, and ET) and GRACE_TWS as the target variable. These relationships were used to estimate the missing GRACE_TWS months.

3.4. Extraction of Temporal Groundwater Storage Using Outputs of Land Surface Models

We used the downscaled GRACE data to extract for each of the 0.125° × 0.125° pixels the changes in GRACE_TWS (∆GRACE_TWS) and the secular trend for each of these pixels (Figure 5). Then, changes in groundwater storage (∆GWS) were calculated using the downscaled ∆GRACE_TWS and outputs of land surface models (NLDAS NOAH) and applying the following equation (Equation (6)):

∆GWS = ∆GRACE_TWS − (∆SMS_NLDAS + ∆CWS_NLDAS+ ∆SWE_NLDAS),

(6)

where ∆SMS_NLDAS, ∆CWS_NLDAS, and ∆SWE_NLDAS are the changes in soil moisture, canopy water storage, and snow water equivalent, respectively, as extracted from the NLDAS model. All these data are provided in a spatial resolution of 0.125° × 0.125° (~120 km²).

Sources and Propagation of Errors

The uncertainties in the GRACE_TWS trends reflect the variations between trend values that were extracted from three GRACE solutions (CSR-RL06M, JPL-RL06M, and GSFC-M) (Table 2; [48,49]).

The uncertainties in the downscaled GRACE_TWS are related to: (1) Uncertainties in the variables (remote sensing-based and land surface model-based) that were used as inputs to the statistical models, and (2) errors introduced by the applied statistical models. The statistical coefficients (R-square, NSE, and NRMSE) describe the accuracy of the extracted model, namely the degree to which the extracted statistical model can or cannot predict the target (GRACE_TWS in our case). Statistical models that have R-square, NSE, and NRMSE values of 0.9, 0.9, and 0.1, respectively, can predict the target with an accuracy of 90%. Since the reported accuracy of the models was estimated by comparing the modeled and observed GRACE_TWS values in the test dataset, the reported model errors incorporate the errors associated with the individual variables as well. In this respect, the coefficients could be used to estimate the errors introduced by both the variables and the statistical models [35]. The combined errors were estimated by averaging the three coefficients and are presented in Table 4 as the percent uncertainty of the output (GRACE_TWS).

We assumed that the model-based accuracy of GRACE_TWS in area A applies to all downscaled pixels within this area. Similarly, the GRACE_TWS of the downscaled pixels within areas B and C will inherit the estimated accuracy for areas B and C, respectively. These are reasonable assumptions given that all of the CSR06M pixels within each of areas A, B, and C have similar geophysical signals.

The errors (uncertainties) associated with the estimated downscaled GRACE_GWS values were propagated from the estimated errors in the GRACE_TWS, and in the land surface model outputs (SMS_NLDAS, CWS_NLDAS, and SWE_NLDAS) which were used in calculating changes in groundwater storage (Equation (7)). The errors in each of these land surface model outputs were calculated as the standard deviation of the values extracted from the three NLDAS simulations (NOAH, VIC, and mosaic [84,85]. The errors in the estimated GWS (σGWS) were calculated by adding, in quadrature, the uncertainties related to GRACE_TWS, SMS_NLDAS, CWS_NLDAS, and SWE_NLDAS values (Equation (7)). The estimated total error rate for fine-scale GWS in three arbitrary pixels (location given in Figure 1) is presented in Figure 6:

σ_{G W S} = \sqrt{{(σ_{T W S})}^{2} + {(σ_{S M S})}^{2} + {(σ_{S W E})}^{2} + {(σ_{C W S})}^{2}} .

(7)

4. Results

4.1. Cluster Analysis

The optimum number of clusters was estimated at 3. Three clusters were identified (cluster 1 area: 13,700 km², cluster 2 area: 59,200 km², cluster 3 area: 33,100 km²; Figure 1). The correlation coefficients of the GRACE time series between clusters were evaluated through the construction of a correlation matrix (Table 5). The correlation coefficients of the GRACE time series between clusters varies from 0.41 to 0.66, and those between clusters and the Michigan lake level varies from 0.43 to 0.74. In the generation of the correlation matrix, the secular trends and seasonal cycles were removed from the time series. Although we cannot rule out leakage from the adjacent water bodies and areas, we suggest that there are significant geophysical signals in each of the investigated areas, as evidenced by the following observations. First, higher correlation coefficients than those observed (0.41–0.66) are to be expected between GRACE_TWS over areas 1, 2, and 3 if the leakage was significant. Second, lake levels lag behind GRACE_TWS by 1 to 2 months for areas 1, 2, and 3 (Figure 7).

4.2. Evaluation and Comparison of the Models

Comparison of the performance of the three models revealed that, in general, the XGBoost models perform better than the other two models (Table 4) as indicated by their lower NRMSE values and their higher NSE and R-squared values and ranking. For example, the R-squared, NRMSE, and NSE values for the XGBoost models ranged from 0.84 to 0.88 (average: 0.86), 0.35 to 0.40 (average: 0.37), and 0.84 to 0.87 (average: 0.86), respectively; those for the ANN models ranged from 0.6 to 0.86 (average: 0.76), 0.40 to 0.85 (average: 0.56), and from 0.25 to 0.82 (average: 0.64), respectively; and those for the MR models from 0.72 to 0.85 (average: 0.77), 0.4 to 0.62 (average: 0.5), and from 0.6 to 0.83 (average: 0.73), respectively. The performance of the XGBoost is high (clusters 1, 2, and 3: Very good), compared to that for MR models (clusters 2 and 3: Very good; cluster 1: Good) and for the unified ANN model (clusters 2 and 3: Very good; cluster 1: Unsatisfactory). One plausible explanation for the enhanced performance of the XGBoost models over the ANN and MR models is that it can better account for the specific characteristics or significant variables that control, or relate to, the observed temporal GRACE_TWS solutions in each cluster. It is flexible and performs well with categorical and numerical values [84], as is the case with our datasets (Figure 8).

4.3. Factors Controlling the TWS and GWS Variations over the Study Area

Seven out of 11 variables showed statistical significance with the GRACE_TW values in the XGboost models (Table 6). They were used for the downscaling process based on cluster-based XGBoost models. Those variables are ET, air temperature, NDVI, total precipitation, soil moisture, Lake Michigan water level, and streamflow. The significance of the variables is determined by their p-values. The insignificant variables were omitted due to their high p-value (probability value). The p-value represents the probability of the occurrence of a given event and helps determine the significance of the results. The higher p-values (typically > 0.05) indicate weak evidence against the null hypothesis (i.e., there is no significant relationship between the independent variable and the target, and therefore the variable is insignificant [86]). The smaller p-values (typically ≤ 0.05) indicate the opposite: There is strong evidence in favor of the alternative hypothesis, and there is a significant relationship between the independent variable and the target.

Multicollinearity, a condition in which two or more predictors are highly correlated with one another in linear regression models, was addressed using the variance inflation factor (VIF). Multicollinearity makes it difficult to determine the effect of the individual predictors on the response and to identify the variables to be included in the model. VIF is one of the most widely used diagnostic indices for multicollinearity [87]. It estimates how much the variance of a coefficient is “inflated” because of linear dependences with other predictors [87]. Using a VIF value of 11 in this study, one of two variables (Lake Michigan and Lake Huron water levels) that show multicollinearity was omitted. Lake Huron lake level was automatically removed from the set of individual variables in the stepwise procedure and was not used in our models. Five lag times (months 1 through 5) were assigned to each of the investigated variables to identify the optimum lag time for the individual variables. Four of the examined variables (total precipitation, temperature, ET, and NDVI) were found to have optimum lag times ranging from 1 to 3 months; none exceeded 3 months, and the remaining variables had no lag times (Table 6). The optimal lag time was found to vary from one cluster to another; for example, the lag in total precipitation varied from 1 month in cluster 1 to 3 months in cluster 2. Again, less significant lag times for an individual variable were automatically removed throughout the application of the stepwise regression.

The significant variables are the ones that correlate well with, respond fairly quickly to, and either drive or are driven by the variations in GRACE_TWS. An increase in soil moisture and stream flow over a cluster will increase its GRACE_TWS values, whereas an increase in land surface temperature or ET will probably decrease its TWS values. Interestingly, lake levels correlated well with GRACE_TWS, which is to be expected given that both the land (clusters 1–3) and surrounding water bodies (Lakes Michigan and Huron) receive added water contributions (precipitation and SWE), which will increase the water levels in the lakes and increase the GRACE_TWS over the land. However, the lake water levels lagged behind GRACE_TWS by 1 to 2 months. We suspect that this lag time is related to the time period it takes for runoff to reach the lake. Starting in 2013, there has been an increase in the water level in both Lake Huron and Lake Michigan. A thorough investigation revealed that the recent rise in the water level in Lake Michigan-Huron is due to above-average spring runoff, which drains into the lakes, and excess precipitation over the lake as well [88].

5. Discussion

The original size of the pixels over the LP (irregular grid, pixel size ~12,000 km²) is coarse for monitoring TWS and GWS on the county scales (size range: 500 (e.g., St. Joseph County) to 850 km² (e.g., Kent County)). The adopted downscaling technique addresses this issue through the generation of downscaled GRACE_TWS and GRACE_GWS (spatial resolution: 0.125° × 0.125°; 10 × 14 km = 140 km²).

Inspection of the secular trends in GRACE_TWS and GRACE_GWS revealed two general patterns, a near-steady state in GRACE_TWS and GRACE_GWS (−1 to +1 mm/year) for an earlier period (2002 to 2012), hereafter referred to as period I, followed by an increase in GRACE_TWS (28 to 120 mm/year) and GRACE_GWS (10 to 130 mm/year) for a later period (2013 to 2016), hereafter referred to as period II (Figure 9).

The breakpoints were identified using the regime shift detection method with a 95% confidence [89]. The two above-mentioned general patterns were observed throughout the entire investigated area. For all of the downscaled pixels, no major differences in GRACE_TWS and GRACE_GWS trends were observed during period I, all of which show near-steady trends. However, distinct variations in the GRACE_TWS and GRACE_GWS trend values are observed in period II between the three clusters (Figure 9). Cluster 1 (represented by point 1; Figure 9) has the highest GRACE_TWS (103 to 122 mm/year) and GRACE_GWS (100 to 130 mm/year) trends, followed by cluster 2 (represented by point 2), with a TWS trend of 50 to 57 mm/year and GWS trend of 45 to 70 mm/year. Clusters 1 and 2 are located within areas characterized by the highest average SWE (60 to 190 mm/year) and the highest average annual rainfall (800 to 1043 mm/year), respectively. Cluster 3 is located in the southern and southwestern parts of the LP, areas that are characterized by high groundwater extraction. This cluster (represented by point 3) shows a TWS trend of 28 to 55 mm/year and a GWS trend of 10 to 55 mm/year.

The glacial aquifers are widely distributed, they overlie all other aquifers, and crop out across large sectors in the state, and thus one would expect that the observed variations in GRACE_GWS are largely controlled by variations in glacial aquifer storage. Clusters 1 and 2 are located in the northern and central sections of province 2 ( Figure 1 and Figure 2) where the glacial deposit is relatively thick, whereas cluster 3 is located in the southwestern section of the LP, an area characterized by high groundwater withdrawal for agricultural activities (Figure 1). The eastern part of cluster 3 is located in an area characterized by low yield (Figure 2) and thin to moderate glacial deposits (refer to Section 2). Also, in general, the northern, but not the southern sections, of the LP have high vertical conductivity and low groundwater extraction rates [41]. The average annual rainfall over the entire LP increased from 774 mm/year (period I) to 783 mm/year (period II) and the average annual SWE increased from 50 (period I) to 55 mm/year (period II) (Figure 10). For cluster 1, the average annual rainfall increased from 689 (period I) to 723 mm/year (period II) and the average annual SWE increased from 44 to 75 mm/year. Similarly, for cluster 2, the average annual rainfall increased from 761 (period I) to 785 mm/year (period II) and the average annual SWE increased from 52 to 56 mm/year in periods I and II, respectively. For cluster 3, the average annual SWE increased from 36 (period I) to 44 mm/year (period II), but the average annual rainfall decreased from 834 (period I) to 823 mm/year (period II). These collective observations suggest that the observed steep GRACE_TWS and GRACE_GWS trends over the northern sections of the LP during period II could be related to one or more of these factors: (1) Thickened glacial deposit, (2) high precipitation and/or snow fall rates, (3) high vertical conductivity [41], and (4) low extraction rates.

One would expect the above-mentioned temporal variations in precipitation and SWE to be reflected in the downscaled GRACE_GWS and groundwater levels. Figure 11 shows an overall correspondence between the downscaled GRACE_GWS data and groundwater levels from three monitoring wells (well A: site name 02S 11W 22CDBB 01, location Kalamazoo; well B: site name 04N 02W 26BBDB 01, location Lansing; and well C: site name 04N 02W 16DAAA 01, location Lansing; Figure 1) within each of the downscaled GRACE_GWS pixels. One should not expect a one to one correspondence between the two datasets. The groundwater levels, but not the GWS, are affected by groundwater withdrawal and by the lag time (between precipitation and recharge). Unfortunately, only a few of the monitoring wells, all located in Kalamazoo and Lansing, have continuous measurements throughout the investigated period and across the study area, none of which are located in the central or northern LP (Figure 1). The correlation coefficient between the downscaled GWS (0.125° × 0.125°) and the observed groundwater level in wells A, B, and C were calculated at 0.4, 0.55, and 0.32, respectively, higher than those between the original GRACE_GWS and the wells (A: 0.14; B: 0.36; and C: 0.05).

We also compared the time series for surface water levels from two inland lakes (Otsego lake in northern LP and Austin lake in southern LP; Figure 1) to downscaled GRACE_TWS time series in areas (pixels) proximal to these lakes (Figure 12). The surface water lake levels approximate the groundwater table in the surrounding areas and thus, the changes in lake level should be indicative of the temporal variations in GRACE_GWS [90,91]. Figure 12 shows a good correspondence between the downscaled GRACE_GWS and Otsego lake level (correlation coefficient: 0.73) and Austin lake level (correlation coefficient: 0.75), an observation that further validates the adopted downscaling procedures.

6. Conclusions

The GRACE data has been widely used to monitor the temporal and spatial variations in TWS and GWS on large scales. However, such applications remained limited on the local scales due to the poor spatial resolution (irregular grid of 12,000 km²) of the GRACE data. The objective of this study was to address this shortcoming by downscaling the CSR mascon solutions to a finer resolution (0.125° × 0.125°) to enable monitoring of GRACE_TWS and GRACE_GWS on county scales and fill the gaps for missing months in the GRACE time series over the study area. Using cluster analysis, areas of similar GRACE_TWS patterns within the study area were first identified. For each of the identified clusters, variables (total precipitation, NDVI, snow cover, streamflow, Lake Michigan water level, Lake Huron water level, soil moisture, land surface temperature, and ET) that presumably contributed to, or were correlated with, the GRACE data were identified and collected on a monthly basis over the investigated period (2002 to 2016). The data sets were randomly partitioned into two groups: Training data (80%) and testing data (20%). XGBoost, MR, and ANN methods were applied to extract statistical relationships between the independent variables and the GRACE_TWS (dependent variable).

The comparisons of the observed GRACE_TWS (training dataset) versus the modeled GRACE_TWS values showed that the XGBoost method outperformed the other two methods as indicated by their lower NRMSE and higher NSE values compared to those obtained from the MR and ANN models. The unified approaches have the advantage of providing adequate overall downscaling results over large areas, yet one would expect the performance of the model to vary from one area to another given that the selected variables and/or their significance is likely to vary across the investigated area. We suggest that if statistical downscaling methods were selected for downscaling GRACE_TWS values on local scales, preference should be given to cluster-based approaches over the commonly used unified approaches.

The XGBoost model was used to downscale (12,000 to 120 km²) GRACE_TWS given the high performance of this model over all other models (ANN and MR) and its ability to estimate the contributions of the independent variables towards the response variable and to forecast missing months within the GRACE’s time series data. Although the XGBoost model outperformed the ANN method in all three clusters in our study area, that might not necessarily be the case over other locations. We suggest that one should explore the use of multiple statistical approaches and select the one that performs the best over each of the investigated areas (clusters).

Since the individual variables and the degree to which they correlate with GRACE_TWS vary from one cluster to another, it is recommended to identify the local hydrologic components that are specific to the investigated area and to select the optimum cluster-based model to improve the accuracy of the downscaled GRACE_TWS values. The accuracy of the derived downscaled GRACE_GWS will largely depend on the accuracy of the land surface model outputs that were used in calculating GRACE_GWS, namely the SMS_NLDAS and SWE_NLDAS in our case. Unfortunately, the State of Michigan lacks a comprehensive groundwater monitoring program to validate the downscaled GRACE_GWS data adequately.

As discussed earlier, we cannot rule out leakage from the adjacent water bodies and/or areas, but we suggest that there are significant geophysical signals from each of the investigated areas (clusters) as evidenced by the modest correlation coefficients between the time series of areas 1, 2, and 3 and by the lag of lake levels by 2– months behind the GRACE_TWS over the land areas. Currently, efforts are underway to generate GRACE_TWS of higher spatial resolution (1° × 1°) by NASA JPL, through combining satellite gravimetry and in-situ GNSS measurements [92]. If and when such data become available, we can apply the proposed methodologies on the individual pixels without worrying about the leakage from their surroundings.

We developed a straightforward methodology that could be used to monitor temporal variations in GRACE_TWS and GRACE_GWS on local scales (county levels). The methodology takes advantage of readily available remote sensing datasets and outputs of land surface models that are of global nature, both of which come at no cost to users. These methodologies could be used by local communities and decision makers for water management purposes in the State of Michigan. They can also provide a replicable model for local applications across the continental USA and possibly in similar settings worldwide as well. The performance of the statistical models can be enhanced by identifying and including local variables that control, or correlate with, the GRACE_TWS solutions over the investigated areas.

Author Contributions

Conceptualization, H.S. and M.S.; Data curation, H.S., M.V., K.A., S.K. and F.A.; Formal analysis, H.S., M.V.; Funding acquisition, M.S. and J.A.Y.; Methodology, H.S., M.S., M.V., and T.M.E.; Resources, M.S., and J.A.Y.; Software, H.S., M.V., K.A., S.K., E.G. and F.A.; Supervision, M.S.; Writing—original draft, H.S., M.S., J.A.Y. and E.G.; Writing—review & editing, H.S. and M.S.. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Aeronautics and Space Administration (NASA) grants NNX12AJ94G and 80NSSC18K1681 to Western Michigan University and by the Michigan Geological Survey.

Conflicts of Interest

The authors declare no conflict of interest.

References

Tapley, B.D.; Bettadpur, S.; Ries, J.C.; Thompson, P.F.; Watkins, M.M. GRACE measurements of mass variability in the earth system. Science 2004, 305, 503–505. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tapley, B.D.; Bettadpur, S.; Watkins, M.; Reigber, C. The Gravity Recovery and Climate Experiment: Mission Overview and Early Results. Geophys. Res. Lett. 2004, 31, L09607. [Google Scholar] [CrossRef] [Green Version]
Ahmed, M.; Sultan, M.; Wahr, J.; Yan, E. The Use of GRACE Data to Monitor Natural and Anthropogenic Induced Variations in Water Availability across Africa. Earth-Sci. Rev. 2014, 136, 289–300. [Google Scholar] [CrossRef]
Abdelmohsen, K.; Sultan, M.; Ahmed, M.; Save, H.; Elkaliouby, B.; Emil, M.; Yan, E.; Abotalib, A.Z.; Krishnamurthy, R.V.; Abdelmalik, K. Response of Deep Aquifers to Climate Variability. Sci. Total Environ. 2019, 677, 530–544. [Google Scholar] [CrossRef]
Abdelmalik, K.W.; Abdelmohsen, K. GRACE and TRMM Mission: The Role of Remote Sensing Techniques for Monitoring Spatio-Temporal Change in Total Water Mass, Nile Basin. J. Afr. Earth Sci. 2019, 160. [Google Scholar] [CrossRef]
Othman, A.; Sultan, M.; Becker, R.; Alsefry, S.; Alharbi, T.; Gebremichael, E.; Alharbi, H.; Abdelmohsen, K. Use of Geophysical and Remote Sensing Data for Assessment of Aquifer Depletion and Related Land Deformation. Surv. Geophys. 2018, 39, 543–566. [Google Scholar] [CrossRef] [Green Version]
Sultan, M.; Sturchio, N.C.; Alsefry, S.; Emil, M.K.; Ahmed, M.; Abdelmohsen, K.; AbuAbdullah, M.M.; Yan, E.; Save, H.; Alharbi, T.; et al. Assessment of Age, Origin, and Sustainability of Fossil Aquifers: A Geochemical and Remote Sensing-Based Approach. J. Hydrol. 2019, 576, 325–341. [Google Scholar] [CrossRef]
Feng, W.; Zhong, M.; Lemoine, J.-M.; Biancale, R.; Hsu, H.-T.; Xia, J. Evaluation of Groundwater Depletion in North China Using the Gravity Recovery and Climate Experiment (GRACE) Data and Ground-Based Measurements. Water Resour. Res. 2013, 49, 2110–2118. [Google Scholar] [CrossRef]
Rodell, M.; Velicogna, I.; Famiglietti, J.S. Satellite-Based Estimates of Groundwater Depletion in India. Nature 2009, 460, 999–1002. [Google Scholar] [CrossRef] [Green Version]
Scanlon, B.R.; Longuevergne, L.; Long, D. Ground Referencing GRACE Satellite Estimates of Groundwater Storage Changes in the California Central Valley, USA. Water Resour. Res. 2012, 48, W04520. [Google Scholar] [CrossRef] [Green Version]
Castellazzi, P.; Longuevergne, L.; Martel, R.; Rivera, A.; Brouard, C.; Chaussard, E. Quantitative Mapping of Groundwater Depletion at the Water Management Scale Using a Combined GRACE/InSAR Approach. Remote Sens. Environ. 2018, 205, 408–418. [Google Scholar] [CrossRef]
Wahr, J.; Swenson, S.; Zlotnicki, V.; Velicogna, I. Time-Variable Gravity from GRACE: First Results. Geophys. Res. Lett. 2004, 31, L11501. [Google Scholar] [CrossRef] [Green Version]
Wahr, J.; Swenson, S.; Velicogna, I. Accuracy of GRACE Mass Estimates. Geophys. Res. Lett. 2006, 33, L06401. [Google Scholar] [CrossRef] [Green Version]
Rodell, M.; Houser, P.R.; Jambor, U.; Gottschalck, J.; Mitchell, K.; Meng, C.-J.; Arsenault, K.; Cosgrove, B.; Radakovich, J.; Bosilovich, M.; et al. The Global Land Data Assimilation System. Bull. Am. Meteorol. Soc. 2004, 85, 381–394. [Google Scholar] [CrossRef] [Green Version]
Chen, J.L.; Rodell, M.; Wilson, C.R.; Famiglietti, J.S. Low Degree Spherical Harmonic Influences on Gravity Recovery and Climate Experiment (GRACE) Water Storage Estimates. Geophys. Res. Lett. 2005, 32, L14405. [Google Scholar] [CrossRef] [Green Version]
Atkinson, P.M. Downscaling in Remote Sensing. Int. J. Appl. Earth Obs. Geoinf. 2013, 22, 106–114. [Google Scholar] [CrossRef]
Zaitchik, B.F.; Rodell, M.; Reichle, R.H.; Zaitchik, B.F.; Rodell, M.; Reichle, R.H. Assimilation of GRACE Terrestrial Water Storage Data into a Land Surface Model: Results for the Mississippi River Basin. J. Hydrometeorol. 2008, 9, 535–548. [Google Scholar] [CrossRef]
Houborg, R.; Rodell, M.; Li, B.; Reichle, R.; Zaitchik, B.F. Drought Indicators Based on Model-Assimilated Gravity Recovery and Climate Experiment (GRACE) Terrestrial Water Storage Observations. Water Resour. Res. 2012, 48, W07525. [Google Scholar] [CrossRef] [Green Version]
Sahoo, A.K.; De Lannoy, G.J.M.; Reichle, R.H.; Houser, P.R. Assimilation and Downscaling of Satellite Observed Soil Moisture over the Little River Experimental Watershed in Georgia, USA. Adv. Water Resour. 2013, 52, 19–33. [Google Scholar] [CrossRef]
Shokri, A.; Walker, J.P.; Dijk, A.I.J.M.; Pauwels, V.R.N. On the Use of Adaptive Ensemble Kalman Filtering to Mitigate Error Misspecifications in GRACE Data Assimilation. Water Resour. Res. 2019, 55, 7622–7637. [Google Scholar] [CrossRef] [Green Version]
Shokri, A.; Walker, J.P.; Dijk, A.I.J.M.; Pauwels, V.R.N. Performance of Different Ensemble Kalman Filter Structures to Assimilate GRACE Terrestrial Water Storage Estimates Into a High-Resolution Hydrological Model: A Synthetic Study. Water Resour. Res. 2018, 54, 8931–8951. [Google Scholar] [CrossRef]
Landerer, F.W.; Swenson, S.C. Accuracy of Scaled GRACE Terrestrial Water Storage Estimates. Water Resour. Res. 2012, 48, W04531. [Google Scholar] [CrossRef]
Schoof, J.T. Statistical Downscaling in Climatology. Geogr. Compass 2013, 7, 249–265. [Google Scholar] [CrossRef] [Green Version]
Le Roux, R.; Katurji, M.; Zawar-Reza, P.; Quénol, H.; Sturman, A. Comparison of Statistical and Dynamical Downscaling Results from the WRF Model. Environ. Model. Softw. 2018, 100, 67–73. [Google Scholar] [CrossRef]
Hou, Y.-K.; Chen, H.; Xu, C.-Y.; Chen, J.; Guo, S.-L.; Hou, Y.-K.; Chen, H.; Xu, C.-Y.; Chen, J.; Guo, S.-L. Coupling a Markov Chain and Support Vector Machine for At-Site Downscaling of Daily Precipitation. J. Hydrometeorol. 2017, 18, 2385–2406. [Google Scholar] [CrossRef] [Green Version]
Jin, Y.; Ge, Y.; Wang, J.; Heuvelink, G.; Wang, L. Geographically Weighted Area-to-Point Regression Kriging for Spatial Downscaling in Remote Sensing. Remote Sens. 2018, 10, 579. [Google Scholar] [CrossRef] [Green Version]
Miro, M.; Famiglietti, J. Downscaling GRACE Remote Sensing Datasets to High-Resolution Groundwater Storage Change Maps of California’s Central Valley. Remote Sens. 2018, 10, 143. [Google Scholar] [CrossRef] [Green Version]
So, B.-J.; Kim, J.-Y.; Kwon, H.-H.; Lima, C.H.R. Stochastic Extreme Downscaling Model for an Assessment of Changes in Rainfall Intensity-Duration-Frequency Curves over South Korea Using Multiple Regional Climate Models. J. Hydrol. 2017, 553, 321–337. [Google Scholar] [CrossRef]
Joshi, D.; St-Hilaire, A.; Ouarda, T.; Daigle, A. Statistical Downscaling of Precipitation and Temperature Using Sparse Bayesian Learning, Multiple Linear Regression and Genetic Programming Frameworks. Can. Water Resour. J./Rev. Can. Des Ressour. Hydr. 2015, 40, 392–408. [Google Scholar] [CrossRef]
Ezzine, H.; Bouziane, A.; Ouazar, D.; Hasnaoui, M.D. Downscaling of TRMM3B43 Product Through Spatial and Statistical Analysis Based on Normalized Difference Water Index, Elevation, and Distance From Sea. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1449–1453. [Google Scholar] [CrossRef]
Chadwick, R.; Coppola, E.; Giorgi, F. An Artificial Neural Network Technique for Downscaling GCM Outputs to RCM Spatial Scale. Nonlinear Process. Geophys. 2011, 18, 1013–1028. [Google Scholar] [CrossRef]
Vu, M.T.; Aribarg, T.; Supratid, S.; Raghavan, S.V.; Liong, S.-Y. Statistical Downscaling Rainfall Using Artificial Neural Network: Significantly Wetter Bangkok? Theor. Appl. Climatol. 2016, 126, 453–467. [Google Scholar] [CrossRef]
Sun, A.Y. Predicting Groundwater Level Changes Using GRACE Data. Water Resour. Res. 2013, 49, 5900–5912. [Google Scholar] [CrossRef]
Yin, W.; Hu, L.; Zhang, M.; Wang, J.; Han, S.-C. Statistical Downscaling of GRACE-Derived Groundwater Storage Using ET Data in the North China Plain. J. Geophys. Res. Atmos. 2018, 123, 5973–5987. [Google Scholar] [CrossRef]
Seyoum, W.; Kwon, D.; Milewski, A. Downscaling GRACE TWSA Data into High-Resolution Groundwater Level Anomaly Using Machine Learning-Based Models in a Glacial Aquifer System. Remote Sens. 2019, 11, 824. [Google Scholar] [CrossRef] [Green Version]
Tayyebi, A.; Smidt, S.; Pijanowski, B. Long-Term Land Cover Data for the Lower Peninsula of Michigan, 2010–2050. Data 2017, 2, 16. [Google Scholar] [CrossRef] [Green Version]
Census Bureau, U. Income, Poverty, and Health Insurance Coverage in the United States. 2009. Available online: https://www.census.gov/prod/2010pubs/p60-238.pdf (accessed on 15 November 2019).
Grannemann, N.G.; Hunt, R.J.; Nicholas, J.R.; Reilly, T.E.; Winter, T.C. The Importance of Ground Water in the Great Lakes Region; Water-Resources Investigations Report 00–4008; US Geological Survey: Reston, VA, USA, 2000.
Rheaume, S.J. Hydrologic Provinces of Michigan; Water-Resources Investigations Report 91–4120; US Geological Survey: Lansing, MI, USA, 1991.
Vugrinovich, R. Patterns of Regional Subsurface Fluid Movement in the Michigan Basin. Available online: https://www.michigan.gov/documents/deq/GIMDL-OFR866_302614_7.pdf (accessed on 15 November 2019).
Groundwater Inventory and Mapping Project Summary and Status—September. 2004. Available online: http://mrwa.org/wp-content/uploads/repository/Exec_Summ_Final_081805.pdf (accessed on 15 November 2019).
Dhanachandra, N.; Manglem, K.; Chanu, Y.J. Image Segmentation Using K-Means Clustering Algorithm and Subtractive Clustering Algorithm. Procedia Comput. Sci. 2015, 54, 764–771. [Google Scholar] [CrossRef] [Green Version]
Tibshirani, R.; Walther, G.; Hastie, T. Estimating the Number of Clusters in a Data Set via the Gap Statistic. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2001, 63, 411–423. [Google Scholar] [CrossRef]
Save, H.; Bettadpur, S.; Tapley, B.D. High-Resolution CSR GRACE RL05 Mascons. J. Geophys. Res. Solid Earth 2016, 121, 7547–7569. [Google Scholar] [CrossRef]
Save, H.; Bettadpur, S.; Tapley, B.D. Reducing Errors in the GRACE Gravity Solutions Using Regularization. J. Geod. 2012, 86, 695–711. [Google Scholar] [CrossRef]
Watkins, M.M.; Wiese, D.N.; Yuan, D.N.; Boening, C.; Landerer, F.W. Improved Methods for Observing Earth’s Time Variable Mass Distribution with GRACE Using Spherical Cap Mascons. J. Geophys. Res. B Solid Earth 2015, 120, 2648–2671. [Google Scholar] [CrossRef]
Luthcke, S.B.; Sabaka, T.J.; Loomis, B.D.; Arendt, A.A.; McCarthy, J.J.; Camp, J. Antarctica, Greenland and Gulf of Alaska Land-Ice Evolution from an Iterated GRACE Global Mascon Solution. J. Glaciol. 2013, 59, 613–631. [Google Scholar] [CrossRef]
Rodell, M.; Famiglietti, J.S.; Wiese, D.N.; Reager, J.T.; Beaudoing, H.K.; Landerer, F.W.; Lo, M.H. Emerging Trends in Global Freshwater Availability. Nature 2018, 557, 651–659. [Google Scholar] [CrossRef] [PubMed]
Scanlon, B.R.; Zhang, Z.; Save, H.; Sun, A.Y.; Schmied, H.M.; Van Beek, L.P.H.; Wiese, D.N.; Wada, Y.; Long, D.; Reedy, R.C.; et al. Global Models Underestimate Large Decadal Declining and Rising Water Storage Trends Relative to GRACE Satellite Data. Proc. Natl. Acad. Sci. USA 2018, 115, E1080–E1089. [Google Scholar] [CrossRef] [Green Version]
Luthcke, S.B.; Rowlands, D.D.; Lemoine, F.G.; Klosko, S.M.; Chinn, D.; McCarthy, J.J. Monthly Spherical Harmonic Gravity Field Solutions Determined from GRACE Inter-Satellite Range-Rate Data Alone. Geophys. Res. Lett. 2006, 33, L02402. [Google Scholar] [CrossRef]
Ahmed, M.; Sultan, M.; Wahr, J.; Yan, E.; Milewski, A.; Sauck, W.; Becker, R.; Welton, B. Integration of GRACE (Gravity Recovery and Climate Experiment) Data with Traditional Data Sets for a Better Understanding of the Time-Dependent Water Partitioning in African Watersheds. Geology 2011, 39, 479–482. [Google Scholar] [CrossRef] [Green Version]
Scanlon, B.R.; Zhang, Z.; Save, H.; Wiese, D.N.; Landerer, F.W.; Long, D.; Longuevergne, L.; Chen, J. Global Evaluation of New GRACE Mascon Products for Hydrologic Applications. Water Resour. Res. 2016, 52, 9412–9429. [Google Scholar] [CrossRef]
Ahmed, M.; Abdelmohsen, K. Quantifying Modern Recharge and Depletion Rates of the Nubian Aquifer in Egypt. Surv. Geophys. 2018, 39, 729–751. [Google Scholar] [CrossRef]
The Land Processes Distributed Active Archive Center. Available online: https://lpdaac.usgs.gov/data/ (accessed on 15 November 2019).
Van Leeuwen, W.J.D.; Orr, B.J.; Marsh, S.E.; Herrmann, S.M. Multi-Sensor NDVI Data Continuity: Uncertainties and Implications for Vegetation Monitoring Applications. Remote Sens. Environ. 2006, 100, 67–81. [Google Scholar] [CrossRef]
National Snow and Ice Data Center. Available online: https://nsidc.org/ (accessed on 15 November 2019).
Frappart, F.; Ramillien, G.; Ronchail, J. Changes in Terrestrial Water Storage versus Rainfall and Discharges in the Amazon Basin. Int. J. Climatol. 2013, 33, 3029–3046. [Google Scholar] [CrossRef] [Green Version]
Prakash, S.; Gairola, R.M.; Papa, F.; Mitra, A.K. An Assessment of Terrestrial Water Storage, Rainfall and River Discharge over Northern India from Satellite Data. Curr. Sci. 2014, 107, 1582–1586. [Google Scholar] [CrossRef]
Nikzad Tehrani, E.; Sahour, H.; Booij, M.J. Trend Analysis of Hydro-Climatic Variables in the North of Iran. Theor. Appl. Climatol. 2019, 136, 85–97. [Google Scholar] [CrossRef]
USGS Current Conditions for Michigan_Streamflow. Available online: https://waterdata.usgs.gov/mi/nwis/current/?type=flow (accessed on 15 November 2019).
NOAA Tides and Currents. Available online: https://tidesandcurrents.noaa.gov/ (accessed on 15 November 2019).
MODIS/Aqua Land-Surface Temperature/Emissivity Monthly Global 0.05Deg CMG-LAADS DAAC. Available online: https://ladsweb.modaps.eosdis.nasa.gov/missions-and-measurements/products/MYD11C3/ (accessed on 15 November 2019).
Wang, W.; Liang, S.; Meyers, T. Validating MODIS Land Surface Temperature Products Using Long-Term Nighttime Ground Measurements. Remote Sens. Environ. 2008, 112, 623–635. [Google Scholar] [CrossRef]
Mitchell, K.E.; Lohmann, D.; Houser, P.R.; Wood, E.F.; Schaake, J.C.; Robock, A.; Cosgrove, B.A.; Sheffield, J.; Duan, Q.; Luo, L.; et al. The Multi-institution North American Land Data Assimilation System (NLDAS): Utilizing Multiple GCIP Products and Partners in a Continental Distributed Hydrological Modeling System. J. Geophys. Res. Atmos. 2004, 109, D07S90. [Google Scholar] [CrossRef] [Green Version]
NASA- GES DISC. Available online: https://disc.gsfc.nasa.gov/datasets?keywords=NLDAS (accessed on 15 November 2019).
Xu, T.; Guo, Z.; Xia, Y.; Ferreira, V.G.; Liu, S.; Wang, K.; Yao, Y.; Zhang, X.; Zhao, C. Evaluation of Twelve Evapotranspiration Products from Machine Learning, Remote Sensing and Land Surface Models over Conterminous United States. J. Hydrol. 2019, 578, 124105. [Google Scholar] [CrossRef]
Xia, Y.; Mitchell, K.; Ek, M.; Sheffield, J.; Cosgrove, B.; Wood, E.; Luo, L.; Alonge, C.; Wei, H.; Meng, J.; et al. Continental-Scale Water and Energy Flux Analysis and Validation for the North American Land Data Assimilation System Project Phase 2 (NLDAS-2): 1. Intercomparison and Application of Model Products. J. Geophys. Res. Atmos. 2012, 117, D03109. [Google Scholar] [CrossRef]
Xia, Y.; Mitchell, K.; Ek, M.; Cosgrove, B.; Sheffield, J.; Luo, L.; Alonge, C.; Wei, H.; Meng, J.; Livneh, B.; et al. Continental-Scale Water and Energy Flux Analysis and Validation for North American Land Data Assimilation System Project Phase 2 (NLDAS-2): 2. Validation of Model-Simulated Streamflow. J. Geophys. Res. Atmos. 2012, 117, D03110. [Google Scholar] [CrossRef]
Xia, Y.; Sheffield, J.; Ek, M.B.; Dong, J.; Chaney, N.; Wei, H.; Meng, J.; Wood, E.F. Evaluation of Multi-Model Simulated Soil Moisture in NLDAS-2. J. Hydrol. 2014, 512, 107–125. [Google Scholar] [CrossRef]
Sheffield, J.; Pan, M.; Wood, E.F.; Mitchell, K.E.; Houser, P.R.; Schaake, J.C.; Robock, A.; Lohmann, D.; Cosgrove, B.; Duan, Q.; et al. Snow Process Modeling in the North American Land Data Assimilation System (NLDAS): 1. Evaluation of Model-simulated Snow Cover Extent. J. Geophys. Res. Atmos. 2003, 108. [Google Scholar] [CrossRef] [Green Version]
Mo, K.C.; Chen, L.-C.; Shukla, S.; Bohn, T.J.; Lettenmaier, D.P. Uncertainties in North American Land Data Assimilation Systems over the Contiguous United States. J. Hydrometeorol. 2012, 13, 996–1009. [Google Scholar] [CrossRef]
Henn, B.; Newman, A.J.; Livneh, B.; Daly, C.; Lundquist, J.D. An Assessment of Differences in Gridded Precipitation Datasets in Complex Terrain. J. Hydrol. 2018, 556, 1205–1219. [Google Scholar] [CrossRef]
Nash, J.E.; Sutcliffe, J.V. River Flow Forecasting through Conceptual Models Part I—A Discussion of Principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
Moriasi, D.N.; Arnold, J.G.; Van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model Evaluation Guidelines for Systematic Quantification of Accuracy in Watershed Simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
Hocking, R.R. A Biometrics Invited Paper. The Analysis and Selection of Variables in Linear Regression. Biometrics 1976, 32, 1. [Google Scholar] [CrossRef]
Zolfaghari, A.; Izadi, M. Burst Pressure Prediction of Cylindrical Vessels Using Artificial Neural Network. J. Press. Vessel Technol. 2019, PVT-19-1142. [Google Scholar] [CrossRef]
Gholami, V.; Booij, M.J.; Nikzad Tehrani, E.; Hadian, M.A. Spatial Soil Erosion Estimation Using an Artificial Neural Network (ANN) and Field Plot Data. Catena 2018, 163, 210–218. [Google Scholar] [CrossRef]
Mohaghegi, S.; Del Valle, Y.; Venayagamoorthy, G.K.; Harley, R.G. A Comparison of PSO and Backpropagation for Training RBF Neural Networks for Identification of a Power System with Statcom. In Proceedings of the 2005 IEEE Swarm Intelligence Symposium, Pasadena, CA, USA, 8–10 June 2005; pp. 391–394. [Google Scholar]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Representations by Back-Propagating Errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
De’ath, G. Boosted Trees for Ecological Modeling and Prediction. Ecology 2007, 88, 243–251. [Google Scholar] [CrossRef]
Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
Mason, L.; Bartlett, P.; Baxter, J.; Frean, M. Boosting Algorithms as Gradient Descent. In Advances in Neural Information Processing Systems 12; MIT Press: Cambridge, MA, USA, 2000; p. 1098. [Google Scholar]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
Castle, S.L.; Thomas, B.F.; Reager, J.T.; Rodell, M.; Swenson, S.C.; Famiglietti, J.S. Groundwater Depletion during Drought Threatens Future Water Security of the Colorado River Basin. Geophys. Res. Lett. 2014, 41, 5904–5911. [Google Scholar] [CrossRef] [Green Version]
Voss, K.A.; Famiglietti, J.S.; Lo, M.; de Linage, C.; Rodell, M.; Swenson, S.C. Groundwater Depletion in the Middle East from GRACE with Implications for Transboundary Water Management in the Tigris-Euphrates-Western Iran Region. Water Resour. Res. 2013, 49, 904–914. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fisher, R.A. Statistical Methods for Research Workers; Springer: New York, NY, USA, 1992; pp. 66–70. [Google Scholar] [CrossRef]
Alin, A. Multicollinearity. Wiley Interdiscip. Rev. Comput. Stat. 2010, 2, 370–374. [Google Scholar] [CrossRef]
Gronewold, A.D.; Bruxer, J.; Durnford, D.; Smith, J.P.; Clites, A.H.; Seglenieks, F.; Qian, S.S.; Hunter, T.S.; Fortin, V. Hydrological Drivers of Record-Setting Water Level Rise on Earth’s Largest Lake System. Water Resour. Res. 2016, 52, 4026–4042. [Google Scholar] [CrossRef] [Green Version]
Andersen, T.; Carstensen, J.; Hernández-García, E.; Duarte, C.M. Ecological Thresholds and Regime Shifts: Approaches to Identification. Trends Ecol. Evol. 2009, 24, 49–57. [Google Scholar] [CrossRef] [Green Version]
Krabbenhoft, D.P.; Bowser, C.J.; Anderson, M.P.; Valley, J.W. Estimating Groundwater Exchange with Lakes: 1. The Stable Isotope Mass Balance Method. Water Resour. Res. 1990, 26, 2445–2453. [Google Scholar] [CrossRef]
Krabbenhoft, D.P.; Webster, K.E. Transient Hydrogeological Controls on the Chemistry of a Seepage Lake. Water Resour. Res. 1995, 31, 2295–2305. [Google Scholar] [CrossRef]
Wiese, D.; Argus, D.; Yuan, D.; Landerer, F. Combining satellite gravimetry and in-situ GNSS measurements to improve spatial resolution of mass flux estimates. In Proceedings of the GRACE Science Team Meeting (GSTM), Pasadena, CA, USA, 8–10 October 2019. [Google Scholar]

Figure 1. Location map showing the distribution of the three clusters (1, 2, and 3), the Holland and Harbor Beach lake-level measuring stations, stream gauges, lakes Austin and Ostego, three arbitrary pixels (Point 1, Point 2, and Point 3) in clusters 1, 2, and 3, respectively where downscaled GRACE_TWS and GRACE_GWS trends, time series, and uncertainties were estimated, and monitoring wells in Kalamazoo (well A: site name 02S 11W 22CDBB 01) and in Lansing (well B: site name 04N 02W 26BBDB 01 and well C: site name: 04N 02W 16DAAA 01). Inset shows the location of the study area in the USA.

Figure 2. Estimated yield in glacial deposits in gallons/minute (gpm) and hydrological provinces (1 through to 9) of the Lower Peninsula modified from [39] and [41].

Figure 3. Flow chart showing the four main steps that were used to downscale GRACE-derived terrestrial water storage (GRACE_TWS) from 12,000 to 120 km² and to extract fine-resolution (120 km²) GRACE-derived groundwater storage GRACE_GWS.

Figure 4. Schematic diagram of a tree-based gradient boosting method.

Figure 5. Secular GRACE_TWS and GRACE_GWS trend images (mm/year) over the LP for the period 2002 to 2016. The trend from the GRACE_TWS data prior to gap filling and associated uncertainty for each cluster is presented in Table 2. (A) Secular GRACE_TWS trend extracted from GRACE mascon data after filling the gaps for missing months. (B) Secular GRACE_TWS trend image extracted from the downscaled GRACE solutions (120 km²) after filling the gaps for the missing months. (C) Secular GRACE_GWS trend image extracted from downscaled GRACE_TWS and NLDAS NOAH land surface model outputs.

Figure 6. The estimated total error rate for three arbitrary pixels (Point 1, Point 2, and Point 3). The location of the pixels is shown in Figure 1.

Figure 7. Comparison of the time series of GRACE_TWS over land (cluster 3) and Michigan lake level. The time series for the lake levels lagged by two months.

Figure 8. Scatter plots of the GRACE_TWS predicted values (using XGBoost models) versus observed (from testing subsets) GRACE_TWS values for the three clusters.

Figure 9. GRACE_TWS and GRACE_GWS trend images and time series for downscaled data. (A) GRACE_TWS trend image for period II (2013 to 2016). (B) GRACE_TWS time series (2012–2016) over three locations (points 1, 2, and 3; Figure 1). (C) GRACE_GWS trend image for period II. (D) GRACE_GWS time series (2012–2016) over three pixels (points 1, 2, and 3; Figure 1).

Figure 10. Average annual rainfall and snow water equivalent for periods I (2002–2012) and II (2013–2016). (A) Average annual snow water equivalent for period I. (B) Average annual snow water equivalent for period II. (C) Average annual rainfall (mm/year) for period I. (D) Average annual rainfall for period II.

Figure 11. Comparison between the downscaled GRACE_GWS data for three pixels and groundwater levels from monitoring wells within each of the three GRACE pixels in Kalamazoo (well (A)) and Lansing (wells (B,C)) (see locations in Figure 1). Groundwater-level elevations are given in elevation above mean sea level (cm).

Figure 12. Comparison between the downscaled GRACE_GWS data for two pixels and surface water levels from two inland lakes, namely Otsego lake and Austin lake (see locations in Figure 1). The discontinuities in the lake levels is due to temporal gaps in the collected data.

Table 1. Initial input variables for the statistical models.

Variable Name	Format	Resolution	Source
NDVI	raster	(0.05° × 0.05°)	MODIS
Snow cover	raster	(0.05° × 0.05°)	MODIS
Land surface temperature	raster	(0.05° × 0.05°)	MODIS
Total precipitation	raster	(0.125° × 0.125°)	NLDAS
Air temperature	raster	(0.125° × 0.125°)	NLDAS
Soil moisture	raster	(0.125° × 0.125°)	NLDAS
Lakes Level	numerical	N/A	NOAA
Streamflow	numerical	N/A	USGS
Evapotranspiration	raster	(0.125° × 0.125°)	NLDAS

Table 2. Secular trends for GRACE_TWS and GRACE_GWS from 2002 to 2016.

Cluster	ΔTWS (mm/year)	ΔSMS (mm/year)	ΔSWE (mm/year)	ΔGWS (mm/year)
1	16.2 ± 5	0.3 ± 0.0	0.1 ± 0.0	15.8 ± 5
2	14.4 ± 5.2	0.0 ± 0.0	0.7 ± 0.0	13.7 ± 5.2
3	8.8 ± 3.4	−0.7 ± 0.0	0.1 ± 0.0	9.5 ± 3.4

Notes: Locations for clusters are shown in Figure 1. ΔTWS: Change in terrestrial water storage. The values are based on originial GRACE_TWS before gap filling for missing months. ΔSMS: Change in soil moisture storage. ΔSWE: Change in snow water equivalent. ΔGWS: Change in groundwater storage. ΔCWS: Change in canopy water storage for each of the three clusters was found to be negligible. (0.0) and was ignored in estimating ΔGWS.

Table 3. Performance of the applied models modified from [74].

Performance Rating	NSE	NRMSE
Very Good	NSE ≥ 0.75	NRMSE ≤ 0.5
Good	NSE ≥ 0.65 and < 0.75	NRMSE > 0.50 and ≤ 0.60
Satisfactory	NSE ≥ 0.50 and < 0.65	NRMSE > 0.60 and ≤ 0.70
Unsatisfactory	NSE < 0.5	NRMSE > 0.70

Table 4. The coefficient of determination (R-squared), the normalized root-mean-square error (NRMSE), and the Nash-Sutcliffe model efficiency coefficient (NSE) for each of the examined models (extreme gradient boosting, multivariate regression, and artificial neural network) over clusters 1, 2, and 3 and calculated uncertainties.

Method		Cluster 1		Cluster 2		Cluster 3
Method		Coefficients	Uncertainty (%)	Coefficients	Uncertainty (%)	Coefficients	Uncertainty (%)
Extreme Gradient Boosting	R-squared	0.84	16	0.88	12	0.86	14
	NSE	0.84	16	0.87	13	0.85	15
	NRMSE	0.4	40	0.35	35	0.38	38
	Average Uncertainty (%)	24		20		22.3
	Ranking *	VG		VG		VG
Artificial Neural Networks	R-squared	0.6	40	0.84	16	0.86	16
	NSE	0.25	75	0.84	16	0.82	18
	NRMSE	0.85	85	0.4	40	0.42	42
	Average Uncertainty (%)	66.7		24		25.3
	Ranking	US		VG		VG
Multivariate Regression	R-squared	0.72	28	0.76	24	0.85	15
	NSE	0.6	4	0.75	25	0.83	17
	NRMSE	0.62	62	0.48	48	0.4	40
	Average Uncertainty (%)	31.3		32.3		24
	Ranking *	S		VG		VG

Notes: * VG: Very Good; G: Good; S: Satisfactory; US: Unsatisfactory.

Table 5. Correlation matrix for GRACE_TWS values over land (three clusters) and Lake Michigan water levels. The values are presented after the removal of the seasonal cycle and secular trends.

	Cluster1	Cluster2	Cluster3	Lake Level
Cluster1	1
Cluster2	0.41	1
Cluster3	0.66	0.56	1
Lake Level	0.74	0.43	0.58	1

Table 6. Percent contribution of each variable in the outputs of the XGBoost models and their optimum lag times.

Variables
Clusters	Total Precipitation	Temperature	NDVI	Soil Moisture	Lake Michigan Level	Streamflow	Evapotranspiration
1	5.1 (1) *	1.4	2.2 (1)	13.3 (2)	69.6 (1)	6.1 (1)	1.3 (2)
2	3.1 (3)	1.4 (3)	0.00	1.6 (1)	68.6 (2)	23.0 (1)	1.6 (2)
3	3.7 (1)	0.00	3.6 (2)	38.7 (1)	48.1 (2)	5.9	0.0

Note: * The number in parentheses shows the optimum lag time (in months) for each variable.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sahour, H.; Sultan, M.; Vazifedan, M.; Abdelmohsen, K.; Karki, S.; Yellich, J.A.; Gebremichael, E.; Alshehri, F.; Elbayoumi, T.M. Statistical Applications to Downscale GRACE-Derived Terrestrial Water Storage Data and to Fill Temporal Gaps. Remote Sens. 2020, 12, 533. https://0-doi-org.brum.beds.ac.uk/10.3390/rs12030533

AMA Style

Sahour H, Sultan M, Vazifedan M, Abdelmohsen K, Karki S, Yellich JA, Gebremichael E, Alshehri F, Elbayoumi TM. Statistical Applications to Downscale GRACE-Derived Terrestrial Water Storage Data and to Fill Temporal Gaps. Remote Sensing. 2020; 12(3):533. https://0-doi-org.brum.beds.ac.uk/10.3390/rs12030533

Chicago/Turabian Style

Sahour, Hossein, Mohamed Sultan, Mehdi Vazifedan, Karem Abdelmohsen, Sita Karki, John A. Yellich, Esayas Gebremichael, Fahad Alshehri, and Tamer M. Elbayoumi. 2020. "Statistical Applications to Downscale GRACE-Derived Terrestrial Water Storage Data and to Fill Temporal Gaps" Remote Sensing 12, no. 3: 533. https://0-doi-org.brum.beds.ac.uk/10.3390/rs12030533

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Statistical Applications to Downscale GRACE-Derived Terrestrial Water Storage Data and to Fill Temporal Gaps

Abstract

1. Introduction

2. Overview of the Study Area