Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Mapping Atmospheric Moisture Climatologies across the Conterminous United States

  • Christopher Daly ,

    Chris.Daly@oregonstate.edu

    Affiliation PRISM Climate Group, Northwest Alliance for Computational Science and Engineering, Oregon State University, Corvallis, Oregon, United States of America

  • Joseph I. Smith,

    Affiliation PRISM Climate Group, Northwest Alliance for Computational Science and Engineering, Oregon State University, Corvallis, Oregon, United States of America

  • Keith V. Olson

    Affiliation PRISM Climate Group, Northwest Alliance for Computational Science and Engineering, Oregon State University, Corvallis, Oregon, United States of America

Abstract

Spatial climate datasets of 1981–2010 long-term mean monthly average dew point and minimum and maximum vapor pressure deficit were developed for the conterminous United States at 30-arcsec (~800m) resolution. Interpolation of long-term averages (twelve monthly values per variable) was performed using PRISM (Parameter-elevation Relationships on Independent Slopes Model). Surface stations available for analysis numbered only 4,000 for dew point and 3,500 for vapor pressure deficit, compared to 16,000 for previously-developed grids of 1981–2010 long-term mean monthly minimum and maximum temperature. Therefore, a form of Climatologically-Aided Interpolation (CAI) was used, in which the 1981–2010 temperature grids were used as predictor grids. For each grid cell, PRISM calculated a local regression function between the interpolated climate variable and the predictor grid. Nearby stations entering the regression were assigned weights based on the physiographic similarity of the station to the grid cell that included the effects of distance, elevation, coastal proximity, vertical atmospheric layer, and topographic position. Interpolation uncertainties were estimated using cross-validation exercises. Given that CAI interpolation was used, a new method was developed to allow uncertainties in predictor grids to be accounted for in estimating the total interpolation error. Local land use/land cover properties had noticeable effects on the spatial patterns of atmospheric moisture content and deficit. An example of this was relatively high dew points and low vapor pressure deficits at stations located in or near irrigated fields. The new grids, in combination with existing temperature grids, enable the user to derive a full suite of atmospheric moisture variables, such as minimum and maximum relative humidity, vapor pressure, and dew point depression, with accompanying assumptions. All of these grids are available online at http://prism.oregonstate.edu, and include 800-m and 4-km resolution data, images, metadata, pedigree information, and station inventory files.

Introduction

The demand for spatial climate data sets in digital form continues to increase, as more and more climate-driven modeling and analysis activities are performed within spatially-explicit computing environments. Key inputs to these analyses are grids of thirty-year decadal climate averages (e.g., 1971–2000, 1981–2010, etc.), termed “normals,” that describe the values and spatial patterns that can be expected in an average year or month. Through various forms of a technique called Climatologically-Aided Interpolation (CAI; see Mapping Methods section), climate normal grids also provide the foundation for a number of spatially distributed climate products that cover individual years, months and days [14]. Normals produced by the PRISM (Parameter-elevation Relationships on Independent Slopes Model) Climate Group at Oregon State University are often used for this purpose [59].

Most spatial climate normals are limited to temperature and precipitation, but measures of atmospheric moisture content and deficit are also important in a variety of processes, including plant function [1014], drought severity and wildfire behavior [15], [16], and climate change impacts [17], [18]. Relatively little work has been to done to produce long-term normal grids of such variables, partly due to a lack of observations. Empirical relationships have been developed to derive humidity measures from available temperature and precipitation data (e.g., [1921]), but such methods incorporate assumptions that are violated in certain regions (e.g., [22]). Actual observations of mean dew point (Tdmean) have been included in monthly time series datasets ([9]), which in turn have been incorporated into other datasets (e.g., [23]), but there has been little focus on long-term normals of Tdmean. Further, long-term normals of atmospheric moisture deficit have received even less attention. A useful variable in this regard is vapor pressure deficit (VPD), which is the difference between the saturation vapor pressure (dictated by temperature alone) and the actual vapor pressure. VPD is a direct measure of the deficit, unlike relative humidity (RH), for which one must know the temperature at the time of measurement [24]. Ideally, one would want to know the average minimum VPD (VPDmin) and maximum VPD (VPDmax), which bracket the range of moisture deficit conditions throughout an average day. This is similar to the advantage of knowing the average maximum and minimum temperature as opposed to mean temperature only. An added advantage is that Tdmean, VPDmin and VPDmax, in combination with existing grids of minimum and maximum temperature (Tmin and Tmax), allow many other atmospheric moisture variables to be derived, such as minimum and maximum RH, vapor pressure, and dew point depression, with accompanying assumptions (see Relative Humidity Derivation section, for example).

This paper describes the development of spatial climate normals of 1981–2010 mean monthly Tdmean, VPDmin and VPDmax across the conterminous United States, using methods that strive to account for the major physiographic factors influencing climate patterns. The Tdmean normals are updates of datasets used in the development of a century-long monthly time series [9]. The VPDmin and VPDmax datasets are, to our knowledge, the first of their kind, although datasets of mean RH have been developed [25]. Section 2 of this paper describes the study area, preparation of station data, and mapping methods. Section 3 presents the resulting gridded data sets, and discusses model performance. Concluding remarks are given in Section 4.

Study Area and Work Flow

Climate data sets were developed for the conterminous US at 30 arc-second resolution in geographic (latitude/longitude) coordinates (Fig 1). A 30-arc-second grid cell is approximately 900 x 700 m at 40N latitude, and is referred to here as “800 m.” The boundaries of the grid are 22°N and 50°N and 65°W and 125°W, which are exactly coincident with other 800-m grids from the PRISM Climate Group [8]. The interpolation was performed separately in three overlapping regions: western, central and eastern US, and the resulting grids merged to form a complete conterminous US grid. The western region extends from the Pacific Coast to eastern Colorado, central from central Colorado to Lake Michigan, and eastern from eastern Minnesota to the eastern seaboard.

thumbnail
Fig 1. Map of study area and station locations.

Conterminous US and border area monthly average (a) mean dew point (Tdmean) stations and (b) minimum and maximum vapor pressure deficit (VPDmin and VPDmax) stations.

https://doi.org/10.1371/journal.pone.0141140.g001

Care was taken to include as many islands offshore the US mainland as possible, but undoubtedly some very small islands were missed. To accommodate GIS shoreline data sets of varying quality and resolution, the modeling region was extended offshore several km and generalized to include bays and inlets. However, the gridded climate estimates are valid over land areas only.

Overviews of the data processing and mapping work flows for Tdmean and for VPDmin and VPDmax are diagrammed in Figs 2 and 3, respectively. The process began with hourly and daily data for Tdmean, and hourly data for VPDmin and VPDmax, which were averaged to daily, and then monthly, time steps, with quality screening done at each stage (see Station Data and Processing section). At the monthly time step, all three variables were expressed as dew point depression (DPD) to take advantage of a spatial consistency quality control (QC) method using a version of PRISM called ASSAY (see Quality Control and Calculation of Monthly Values section). Tdmean was kept in the form of DPD for interpolation by PRISM, and converted back to Tdmean as a grid post-processing step. VPDmin and VPDmax were converted from DPD back to their original forms before interpolation by PRISM (see Mapping Methods section). A performance evaluation was conducted on the PRISM interpolation process (see Uncertainty Analysis section). As a final step, output grids from PRISM were checked for consistency with previously-created grids of 1981–2010 Tmax and Tmin.

thumbnail
Fig 2. Work flow diagram for data processing and mapping of Tdmean.

Cross-hatched boxes represent gridded data. See text for abbreviation definitions.

https://doi.org/10.1371/journal.pone.0141140.g002

thumbnail
Fig 3. Work flow diagram for data processing and mapping of VPDmin and VPDmax.

VPD refers to either VPDmin or VPDmax. Cross-hatched boxes represent gridded data. See text for abbreviation definitions.

https://doi.org/10.1371/journal.pone.0141140.g003

Station Data and Processing

Data Sources

Data from surface stations, numbering about 4,000 for Tdmean (44 km average station spacing) and 3,500 for VPDmax and VPDmin (47 km average station spacing) were obtained from a variety of sources (Table 1; Fig 1). Data from the National Weather Service (NWS) Automated Surface Observing System (ASOS) came via Unidata's Internet Data Distribution system and the National Climatic Data Center (NCDC) Integrated Surface Hourly/Integrated Surface Data archives (ftp://ftp.ncdc.noaa.gov/pub/data/noaa/), supplemented by the Solar And Meteorological Observation Network (SAMSON) and Integrated Surface Weather Observations CD-ROMs. NWS Cooperative Observer Program (COOP) and Weather Bureau Army Navy (WBAN) data were obtained from (http://www.ncdc.noaa.gov/cdo-web/). USDA Forest Service and Bureau of Land Management Remote Automatic Weather Station (RAWS) data came from archives at the Western Regional Climate Center (http://www.raws.dri.edu), the Real-time Observation Monitor and Analysis Network http://raws.wrh.noaa.gov), and from the MesoWest Local Data Manager (LDM) feed and website (http://mesowest.utah.edu/data). USDA Natural Resources Conservation Service (NRCS) Soil Climate Analysis Network (SCAN) observations were provided through the National Water and Climate Center web service (http://www.wcc.nrcs.usda.gov). Bureau of Reclamation AgriMet (AGRIMET) data were obtained from the Pacific Northwest Region (http://www.usbr.gov/pn/agrimet/), Great Plains Region (http://www.usbr.gov/gp/agrimet/), Upper Colorado Region in cooperation with the Utah Climate Center (https://climate.usurf.usu.edu/agweather.php), and Mid Pacific Region in cooperation with the Desert Research Institute (http://nicenet.dri.edu/) and the National Oceanic and Atmospheric Administration NOAA Idaho National Laboratory Weather Center (http://niwc.noaa.inel.gov/). Data were also collected from the Oklahoma Climatological Survey Mesonet (OKMESONET; http://www.mesonet.org/data/public/mesonet/mts), Colorado State University’s Colorado Agricultural Meteorological Network (COAGMET; http://ccc.atmos.colostate.edu/~coagmet), and a small number of miscellaneous stations from state and local networks.

thumbnail
Table 1. Source, averaging interval, and number of stations used in the mapping process.

https://doi.org/10.1371/journal.pone.0141140.t001

To improve marine representation, data were obtained from coastal stations and offshore buoys operated by the NOAA National Data Buoy Center (NDBC; http://www.ndbc.noaa.gov/data/historical/stdmet). To better define humidity profiles at high elevations, mean monthly upper-air temperature, geopotential height, and relative humidity grid points for the western and eastern United States were obtained at 2.5-degree resolution for the period 1981–2010 from the National Center for Environmental Prediction (NCEP) Global Reanalysis (ftp://ftp.cdc.noaa.gov/Datasets/ncep.reanalysis.derived/pressure). The 5650-m level (~500 hPa) was chosen for the western US, and the 3050-m level (~700 hPa) for the eastern US. (Upper-air data were not needed in the central US, because of a lack of elevated terrain.) These levels were sufficiently far above the highest terrain features to minimize potential errors involved in estimating surface humidity statistics from free air values.

Quality Control and Calculation of Monthly Values

As shown in Table 1, station data were available at either an hourly or daily time step. Daily data were sufficient for Tdmean, but hourly data were required to calculate VPDmin and VPDmax. In some cases, hourly data were provided in the form of RH directly from the instrument. Range checks were performed to screen out values that were either impossible or could cause instabilities in the calculation of related statistics. If RH ≤ 0 or RH > 105, the value was set to missing. Otherwise, if RH < 0.5, it was set to 0.5, and if 100 < RH ≤ 105 (as sometimes occurs under saturated conditions), it was set to 100.

Dew point (Td) was derived from hourly RH (%) and ambient temperature (Ta,°C) by calculating the saturation vapor pressure at Ta (SATVPa) ([26], Eq 21): (1) and finding Td (°C) [27]: (2)

A consistency check was done to ensure that the calculated Td was less than Ta; if not, each was set to missing.

As a range check, Td values that fell below -68°C, or exceeded the statewide extreme maximum temperature record (http://www.ncdc.noaa.gov/extremes/scec/records) were set to missing. In addition, the hourly Td values were subjected to a step check in which each value could not differ from that of the previous hour by more than 10°C; otherwise the current hour’s Td was set to missing.

Hourly VPDs were calculated from a combination of Ta and Td or Ta and RH, depending on which was available. If RH was available, VPD was calculated by: (1) finding SATVPa from Eq 1; (2) finding the saturation vapor pressure at the previously calculated Td: (3) and; (3) calculating VPD as (4)

If Td was available, VPD was obtained by calculating SATVPd from Eq 3 and VPD from Eq 4. Each hourly VPD was subjected to a range check, where if VPD > 200 hPa or < 0 hPa, or if VPDSATVPa, the value was set to missing.

Three persistence tests were done on Ta and Td hourly values for each 24-hour period. If the maximum difference between any pair of hourly observations or the difference between the maximum and minimum value over the 24-hour period was less than 0.1°C, or the standard deviation of all values in the 24-hour period was less than 0.1°C, the day was considered a “flat day” and all variables, including the VPDs, were set to missing.

For networks with hourly data, Tdmean values were calculated by averaging available hourly observations, subject to the requirement that at least 18 of 24 observations be non-missing; fewer than 18 non-missing hourly observations resulted in a missing daily value. Daily VPDmin and VPDmax, as well as Tmin, and Tmax, were determined by finding the minimum and maximum hourly value, respectively, subject to the same data completeness requirement.

Daily maximum and minimum Td (Tdmax, Tdmin) and mean Ta (Tmean) values were calculated for diagnostic purposes. Consistency checks were performed on daily combinations of Tdmean and temperature as follows: If Tmax < Tdmax, Tmin < Tdmin, or Tmean < Tdmean, all variables, including VPDmin and VPDmax, were set to missing.

The daily values were averaged to create monthly mean Tdmean, VPDmin, and VPDmax for each year of record. A minimum of 85% of non-missing daily values were required for a monthly value to be non-missing [28], [29].

The monthly averages were tested for spatial consistency using the ASSAY QC (quality control) system. ASSAY is a version of PRISM that estimates station values in their absence and compares them to the observed values, a procedure termed cross-validation [8]. The interpolation procedures in ASSAY are exactly the same as in PRISM; the only difference is that ASSAY interpolates to point (station) locations, rather than grid cells. As a rule, a large discrepancy between an observed station value and the interpolated estimate from ASSAY suggests that the station value is unusual compared to nearby stations, and may therefore be erroneous. In previous work, an ASSAY QC analysis was performed for Tdmean, with Tdmean expressed in the form of dew point depression (DPD) with Tmin; that is, DPD = TminTdmean. (As will be discussed in the next section, the DPD form of Tdmean was the favored method of expressing Tdmean for interpolation in this study.) In the previous ASSAY analysis of DPD, QC was done manually by trained personnel, and compared with the ASSAY results. It was found that when the absolute difference between the observation and ASSAY estimate of DPD exceeded 4.5°C, the value was considered erroneous by the manual method. Based on these results, in this study ASSAY QC was applied to Tdmean expressed as DPD, and station values producing absolute differences between the observation and estimate of more than 4.5°C were flagged as bad and set to missing. Overall, 0.8 percent of the monthly Tdmean values were set to missing using this method.

To take advantage of the ASSAY QC screening method, VPDmin and VPDmax were also expressed as DPD with Tmin, termed DPD(VPDmin) and DPD(VPDmax), respectively. This involved: (1) calculating the saturation vapor pressure for the station’s Tmin value (SATVPa) using Eq 1; (2) subtracting the VPD from this value to get SATVPd; (3) obtaining Td from Eq 2, substituting SATVPd for SATVPa and using 100 for the RH value; and (4) subtracting the resulting Td from Tmin to obtain the DPD. Manual QC checks found that the same threshold of 4.5°C used in QC’ing DPD was suitable for DPD(VPDmin) and DPD(VPDmax) as well, and those exceeding this threshold were flagged as bad and set to missing. Overall, 0.7 percent of the monthly VPDmin values and 1.8 percent of the monthly VPDmax values were set to missing using this method.

Monthly values of Tdmean, VPDmin, and VPDmax passing the ASSAY QC screening were averaged over their period of record (POR), or 1981–2010, if available. A 1981–2010 monthly mean calculated using data from at least 23 of these 30 years (75% data coverage) was considered to be sufficiently characteristic of the 1981–2010 period, and was termed a “long-term” station. However, many stations had a POR of fewer than 23 years. Averages from stations with short PORs were subjected to adjustment to minimize temporal biases, as described in [8], Appendix A.

Mapping Methods

Mapping of 1981–2010 mean monthly Tdmean, VPDmin, and VPDmax was performed using PRISM [68], [30]. For each grid cell, PRISM calculated a local linear regression function between the atmospheric moisture variable and a predictor grid (see Climatologically-Aided Interpolation below). Nearby stations entering the regression were assigned weights based primarily on the physiographic similarity of the station to the grid cell. Physiographic factors relevant to this study were distance, elevation, coastal proximity, vertical atmospheric layer, and topographic position (relative to surrounding terrain). Detailed descriptions of the PRISM model algorithms, structure, input grids, and operation are given in [7], [8], and [30]. Details on the specific modeling approach for this study are given below.

Derive or Interpolate VPDmin and VPDmax?

Given that Tdmean is a basic atmospheric moisture variable from which other variables can be derived, early in this study the question was asked: do VPDmin and VPDmax need to be interpolated separately, or can they be derived from a combination of the Tdmean grid and previously-created grids of Tmax and Tmin? More specifically, can Tdmean be combined with Tmax to estimate VPDmax and Tdmean combined with Tmin to estimate VPDmin? To do this, we must assume that Td = Tdmean and Ta = Tmax at the time of VPDmax, and Td = Tdmean and Ta = Tmin at the time of VPDmin. To test these assumptions, monthly averages of VPDmin and VPDmax, as well as VPDmin and VPDmax estimated as described above, were calculated and averaged for each month over the ten-year period 2003–2012 at 100 randomly selected ASOS stations with hourly data. Results of this exercise are given in Table 2 for January and July, which bracket the range of monthly values observed during the year.

thumbnail
Table 2. Monthly averages of actual and estimated vapor pressure deficit.

2003–2012 January and July averages from 100 randomly-selected ASOS stations. Actual and estimated VPD values are expressed as 5th percentile / mean / 95th percentile.

https://doi.org/10.1371/journal.pone.0141140.t002

In Table 2, actual and estimated VPDmin and VPDmax are expressed as means and percentiles (5th percentile / mean / 95th percentile) so that the full distribution of values can be evaluated. Differences between actual and estimated VPDmax were generally within about three hPa or five percent across the distributions in both winter and summer. However, a proportion of the estimated VPDmin values fell below zero, which is not physically possible. This problem was most serious during winter; in January, a VPDmin of less than zero was estimated at 47 of the 100 stations. In these cases, Tdmean exceeded Tmin, resulting in a negative VPD. In reality, the opposite was true, resulting in a positive VPDmin. Given this issue, deriving VPDmin and VPDmax from Tdmean, Tmin, and Tmax was not considered a viable option, leading us to interpolate VPDmin and VPDmax directly from station data.

Climatologically-Aided Interpolation

In previous work, constructing the 1971–2000 and 1981–2010 monthly normals for Tmin and Tmax used a DEM as the predictor grid (see [8] for details on methods used for mapping Tmin and Tmax). There were nearly 10,000 stations used in the mapping of the 1971–2000 Tmin and Tmax normals, and over 16,000 stations used in the 1981–2010 normals. In contrast, there were only 3,500–4,000 stations available for this study, with poor representation at high elevations. Faced with limited station data, we opted for the CAI method of interpolation. CAI uses an existing climate grid to improve the interpolation of another climate element for which data may be sparse or intermittent in time [3136]. This method relies on the assumption that local spatial patterns of the climate element being interpolated closely resemble those of the existing climate grid (called the predictor grid). Uses of CAI fall into two main categories: (1) using a long-term mean grid of a climate element to aid the interpolation of the same element over different (usually shorter) averaging periods; and (2) using a grid of a climate variable to aid the interpolation of a different, but related, climate variable, such as interpolating annual extreme minimum temperature using January mean minimum temperature as the predictor grid [37]. A classic example of the first strategy involves mapping a long-term mean climatology carefully with sophisticated methods, then developing time series grids for shorter averaging periods (monthly or daily) using simpler and faster methods such as inverse-distance weighting to interpolate deviations from the mean climatology to a grid. These deviations can then be added to (e.g., temperature) or multiplied by (e.g., precipitation) the mean climatology to obtain the new grid.

Our use of CAI for this study falls into the second strategy, for which we use pre-existing grids of 1981–2010 mean monthly Tmin and Tmax as predictor grids in the interpolation process. A series of tests was conducted with ASSAY and PRISM to determine which of the 1981–2010 mean monthly Tmin and Tmax grids were the strongest predictors of the spatial patterns of Tdmean. Tmin was found to be a good predictor, which is not surprising given that temperatures often reach the dew point at the time of Tmin over much of the US. However, inspection of the interpolated grids revealed that in some areas subject to locally low temperatures, such as cold air pools in mountain valleys, the interpolated Tdmean exceeded the mean temperature, meaning that long-term mean RH exceeded 100 percent, which was not acceptable. In order to more closely tie the patterns of Tdmean to the patterns of the existing monthly Tmin grids and their relatively large supporting station data sets, each Tdmean station value was expressed as the deviation from Tmin, or DPD (Tdmean—Tmin). Thus, in the interpolation process, PRISM was run with the 1981–2010 mean monthly Tmin grid as the independent variable and DPD as the dependent variable. The local regression functions were generally not as strong as they were in the case of Tdmean vs. Tmin, because DPD was essentially the residual from the Tdmean vs. Tmin relationship. Once the mean monthly DPD values were mapped in this manner, the final Tdmean grid was obtained by adding the DPD grid to the 1981–2010 Tmin grid. The result was an interpolated Tdmean grid that was highly consistent in an absolute, as well as relative sense, with the associated Tmin grid. An example of the relationship between 1981–2010 January mean observed DPD and gridded Tmin for a location north of the Wind River Mountains of Wyoming (43.6N, 109.73W) is shown in Fig 4a. This area is characterized by persistent wintertime cold air pools in mountain valleys, where the humidity is high and Tmin is less than Tdmean (DPD<0). Above these cold pools, Tmin increases and rises above Tdmean (DPD>0).

thumbnail
Fig 4. Scatterplots of relationships between predictor grid and interpolated variables.

1981–2010 mean monthly (a) dew point depression (DPD) vs. minimum temperature (Tmin) for January in western Wyoming; (b) minimum vapor pressure deficit (VPDmin) vs. first-guess VPDmin for June near Las Vegas; and (c) maximum vapor pressure deficit (VPDmax) vs. first-guess VPDmax for October in San Francisco.

https://doi.org/10.1371/journal.pone.0141140.g004

An additional series of tests was conducted with PRISM and ASSAY to determine whether either of the pre-existing 1981–2010 mean monthly Tmin and Tmax grids could be used as predictors of the spatial patterns of VPDmin and VPDmax. Tmax was found to be a good predictor of VPDmax; and the same was true for Tmin and VPDmin. However, once Tdmean was mapped, a superior, second-generation CAI method using Tdmean became available. Termed CAI2, this method involves using the result of a CAI interpolation as the predictor grid in another CAI interpolation. Specifically, a first-guess VPDmin grid was created by calculating the VPD associated with the combination of the Tdmean and Tmin grids using Eqs 1, 3 and 4. Similarly, a first-guess VPDmax grid was found by calculating the VPD associated with the combination of the Tdmean and Tmax grids. These “first-guess” predictor grids represented what the VPDmin and VPDmax spatial patterns would be like if Td was held constant throughout the day at Tdmean, and VPDmin occurred at the time of Tmin and VPDmax occurred at the time of Tmax. While these assumptions are not always correct (see Table 2), they are sufficiently reasonable to produce predictor grids that closely match the relative spatial patterns of VPDmin and VPDmax.

An example of the relationship between 1981–2010 June mean observed VPDmin and first-guess predictor grid VPDmin for a location near Las Vegas, Nevada (35.79N, 115.26W) is shown in Fig 4b. In this desert environment, VPDmin is still relatively large (14–22 hPa), even at its minimum for an average day in June. The lowest VPDmin values are found at higher elevations, where temperatures are cooler. An example of the relationship between 1981–2010 October mean observed VPDmax and the first-guess predictor grid VPDmax for a location in San Francisco, along the California coastline (37.76N, 122.45W), is shown in Fig 4c. In this case, the lowest VPDmax values are found along the immediate coast where temperatures are cooler and moisture is greater, with higher values in warmer and drier inland areas.

PRISM Weighting Functions

During PRISM interpolation, upon entering the local linear regression function for a pixel, each station was assigned a weight based on several factors. The combined weight (W) of a station is given by the following: (5) where Wc, Wd, Wz, Wp, Wl, and Wt are cluster, distance, elevation, coastal proximity, vertical layer, and topographic position weights, respectively, and Fd and Fz are user-specified distance and elevation weighting importance scalars [8], [38]. All weights and importance factors, individually and combined, are normalized to sum to unity. PRISM weighting functions not enabled in this study were topographic facet weighting, which is used primarily to identify rain shadows in precipitation mapping, and effective terrain height weighting, which identifies orographic precipitation regimes based on terrain profiles [8].

Table 3 summarizes how the PRISM climate regression and station weighting functions accounted for physiographic climate forcing factors, and provides citations for further information. Cluster weighting was used to keep clusters of stations that represent similar local conditions from dominating the regression functions; both horizontal and vertical separations were considered [8]. Distance and elevation weighting were used to accommodate the spatial coherence of climatic regimes, both horizontally and vertically [8].

thumbnail
Table 3. PRISM weighting algorithms and associated physiographic climate forcing factors.

Inputs to the algorithms and references on their formulation and use are included. Methods used to prepare the gridded model inputs are summarized in [8], Tables 1 and 2.

https://doi.org/10.1371/journal.pone.0141140.t003

Coastal proximity weighting accounted for sharp gradients in temperature and atmospheric moisture from coastlines to interior regions [7], [8], [30]. Atmospheric layer weighting was useful where temperature inversions occurred, by delineating gradients in the relationship between temperature and atmospheric moisture along the transition from the relatively humid boundary layer near the earth’s surface to the drier free atmosphere above [7], [30]. Topographic position weighting differentiated topographically sheltered locations where cold, moist air may accumulate, from more exposed locations not susceptible to cold air pooling, such as hill slopes and ridge tops [8], [38].

Relevant PRISM control parameters are listed in Table 4. A minimum of 25 stations were required for each pixel’s regression function, and the radius of influence was expanded from a minimum of 20 km to as far as necessary to reach the 25-station threshold. Tdmean slope bounds were expressed as the change in DPD per unit Tmin from the predictor grid, and VPD slopes expressed as the change in VPD per unit first-guess VPD from the predictor grid. The maximum and minimum allowable regression slopes were derived from test runs of PRISM where distributions of slope values were created and outliers examined to determine validity, combined with performance assessments using ASSAY. In the final PRISM interpolation runs, slopes falling outside the designated bounds were set to values that fell halfway between the default slope and the bound that the slope violated (either the maximum or minimum). In the western region, allowable slopes in the relationship between VPDmax and the first-guess VPDmax were constrained to fall between 0.99 and 1.01, so as to avoid a rare circumstance of predicting a VPDmax that might approach the value of SATVPa at Tmax (hence producing a very low RH) in the warmest and driest areas. The exponential relationship between temperature and SATVPa steepens at higher temperatures, leaving less room for error in extremely low-humidity situations. Unconstrained, the average slope of the 5.5 million PRISM regression functions (one per pixel) for July (the warmest month) across the western region was 1.017, with the 10th percentile at 0.964 and the 90th at 1.15, so this constraint had a very slight effect.

thumbnail
Table 4. Values of relevant PRISM interpolation parameters.

Regression slope bounds are sometimes different among regions; values separated by forward slashes are for west, central, and east regions, respectively. See [7] and [8] for details on PRISM parameters and Table 3 for citations to details on weighting function formulations.

https://doi.org/10.1371/journal.pone.0141140.t004

Station weighting parameters were very similar among the three variables, and were generally set to the same values across the three regions. The exception was the elevation weighting exponent for VPDmax, which was set to 0.5, compared to 1.5 for the other variables. Tmax and VPDmax occur primarily during the day, when the atmosphere is more likely to be well-mixed, and vertical gradients in atmospheric moisture more slowly varying. As a rule, the weighting exponents were set to the lowest values possible to achieve the desired effects. Being parsimonious with the weighting exponents ensured that the station data entering the local regression functions were not down-weighted unnecessarily, which can weaken the statistical results by decreasing the effective number of stations in the regression.

A number of small but noticeable inconsistencies in DPD values between adjacent stations were observed during the summer months in some agricultural areas, most notably in eastern Colorado. Further investigation revealed that relatively low DPD values were coming from two networks: AGRIMET and COAGMET. Stations in these networks are typically sited in or near irrigated fields for use in water management calculations, resulting in more humid conditions than locations away from irrigated areas. Given that the data from these networks were of otherwise high quality, it was unreasonable to simply omit these two networks outright. This issue raised questions about whether the grids should, or could, represent conditions over irrigated land. Concerns over attempting to do so are summarized as follows:

  1. Siting requirements for these station networks state that the station must be located in, or immediately adjacent to, an irrigated field to be representative of an irrigated environment. This suggests that the effects of irrigation on humidity are highly local, and which may not extend beyond one 800-pixel. In practice, however, these stations influenced the interpolated estimates many km away, well beyond the likely limits of irrigation in many instances.
  2. A complete picture of the location and extent of irrigated lands across the country was unavailable; further, there was no interpolation mechanism in place to constrain the influence of stations on irrigated fields to just those lands. It was also unclear how to modify DPD values in the many irrigated areas not represented by station data.
  3. We, and likely others, will be using these atmospheric moisture climatologies as the predictor grids for CAI mapping of monthly and daily time series of the same variables. Given that many of these time series will extend back a century or more, when crop patterns and irrigation practices were very different than today, patterns of humidity caused by today’s irrigation patterns could be propagated to times when they are not applicable.

Given these concerns, a subjective middle ground was taken, where a few (<10) stations causing the most severe spatial discrepancies were omitted from the Tdmean dataset, and the rest retained. For consistency, the same stations were also omitted from the VPDmin and VPDmax datasets.

Grid Post-processing

Once monthly grids of Tdmean, VPDmin, and VPDmax were generated with PRISM, post-processing checks were made to ensure that the interpolated values did not exceed reasonable ranges, and that consistency was maintained among the three variables and with the pre-existing 1981–2010 mean monthly Tmin and Tmax grids. Given that the 1981–2010 monthly means are by definition made up of yearly values above and below these means, the acceptable ranges were defined to be more restrictive than what would be allowed on, say, a single day. If Tdmean > (Tmean− 0.5°C), Tdmean mean was set equal to Tmean− 0.5°C. If VPDmin < 0.01 hPa, VPDmin was set equal to 0.01 hPa. If VPDmin < 0.001 SATVPa at Tmin (i.e., RH > 99.9%), VPDmin was set equal to 0.001 SATVPa at Tmin. If VPDmax > 0.95 SATVPa at Tmax (i.e., RH < 5%), VPDmax was set equal to 0.95 SATVPa at Tmax. The number of grid cells affected by these restrictions was limited to a handful in remote mountainous regions of the western US.

Results and Discussion

Spatial Patterns

Dew Point.

Maps of 1981–2010 mean Tmin and Tdmean in January and July are shown in Figs 5 and 6, respectively. The general patterns of Tdmean are similar to those of Tmin, especially in winter. In January, values are lowest in the northern tier of states, and in cold pools along valley bottoms in the West (Fig 5). Tdmean is high in the eastern part of the country in July, exceeding 20°C in much of the Southeast, which is exposed to the transport of moist air from the Gulf of Mexico (Fig 6). Despite relatively warm temperatures, July Tdmean is low in the dry Intermountain West. A notable exception is the Southwest, where the summer monsoon produces locally elevated Tdmean values.

thumbnail
Fig 5. Maps of January minimum temperature and dew point.

Conterminous US 1981–2010 mean January (a) minimum temperature (Tmin) and (b) dew point temperature (Tdmean).

https://doi.org/10.1371/journal.pone.0141140.g005

thumbnail
Fig 6. Maps of July minimum temperature and dew point.

Conterminous US 1981–2010 mean July (a) minimum temperature (Tmin) and (b) dew point temperature (Tdmean).

https://doi.org/10.1371/journal.pone.0141140.g006

Interpolated DPDs between Tmin and Tdmean (TminTdmean) for January and July are shown in Fig 7. DPD is the original variable interpolated with PRISM before being added to the Tmin grid to obtain Tdmean. In January, DPDs are mostly near zero or negative, meaning that Tdmean is similar to, or greater than, Tmin over much of the country. In the northern US and in persistent cold pools in large inland valleys of the West, Tdmean is as much as 3–5°C greater than Tmin (Fig 7a). For example, the 1981–2010 mean January Tdmean at the International Falls, MN ASOS station is -18.1°C and the mean January Tmin is -21.3°C. In contrast, Tdmean averages 6–8°C lower than Tmin in the southwestern US, which is a testament to the region’s aridity, even in winter.

thumbnail
Fig 7. Maps of dew point depression (DPD).

Conterminous US difference between the 1981–2010 mean minimum temperature and the 1981–2010 mean dew point temperature (TminTdmean) for (a) January and (b) July.

https://doi.org/10.1371/journal.pone.0141140.g007

In July, Tdmean and Tmin have similar values in the eastern half of the country. Interestingly, major metropolitan areas such as St. Louis, Kansas City, Minneapolis, and Detroit, and several eastern seaboard cities, are visible as small areas of relatively warm Tmin values, resulting in positive differences between Tmin and Tdmean of 1–3°C (dark green spots in Fig 7b). The West is dominated by larger positive differences, where Tmin is much higher than Tdmean. The changeover from near-zero to positive differences roughly follows the 100th meridian. Notable exceptions are the West Coast, where cool, marine air penetrates inland, and areas of the Southwest affected by the summer monsoon. The core area of maximum DPD is centered on southeastern California, Nevada, western Arizona, and lower elevations of Utah, where Tdmean averages 10–25°C lower than Tmin. A combination of low moisture content (low Tdmean) and limited time for nighttime cooling (high Tmin) during summer contribute to these high DPDs. Methods that estimate Tdmean by assuming it is equal to Tmin would experience the largest errors in this region [22].

Vapor Pressure Deficit.

Patterns of 1981–2010 mean VPDmin in January and July are shown in Fig 8. In winter, VPDmin is low (<1 hPa) over most of the country, due to a combination of low temperatures and relatively small differences between Ta and Td during the morning hours (Fig 8a). Vapor pressure varies exponentially with temperature; for example, a difference between Ta and Td of 1°C amounts to a VPD of 0.15 hPa at 20°C, but less than 0.05 hPa at 0°C. In July, VPDmin is also low in the eastern US, but exceeds 5 hPa over much of the West, reaching maxima of over 20 hPa in the desert southwest (Fig 8b).

thumbnail
Fig 8. Maps of mean minimum vapor pressure deficit (VPDmin).

1981–2010 mean monthly VPDmin for (a) January and (b) July.

https://doi.org/10.1371/journal.pone.0141140.g008

Spatial patterns of 1981–2010 mean Tmax and VPDmax in January and July are shown in Figs 9 and 10, respectively. Patterns of VPDmax roughly follow those of Tmax in January, with the lowest values in the northern tier and western mountains, and the highest in the southern states (Fig 9). In July, the area of high VPDmax expands considerably, exceeding 25 hPa over much of the western US, and reaching a maximum of over 60 hPa in the desert southwest (Fig 10). Coastal regions of the West exhibit relatively low VPDmax values, as they did for DPD. Similarly low values are found at higher elevations, illustrating the strong spatial relationship between VPDmax and Tmax. In the east, VPDmax values in the Midwest and northeast are mostly less than 20 hPa, while those in the southeast can range above 25 hPa; a notable maximum occurs in the piedmont region of Georgia and the Carolinas.

thumbnail
Fig 9. Maps of January maximum temperature and vapor pressure deficit.

Conterminous US 1981–2010 mean January (a) maximum temperature (Tmax) and (b) maximum vapor pressure deficit (VPDmax).

https://doi.org/10.1371/journal.pone.0141140.g009

thumbnail
Fig 10. Maps of July maximum temperature and vapor pressure deficit.

Conterminous US 1981–2010 mean July (a) maximum temperature (Tmax) and (b) maximum vapor pressure deficit (VPDmax).

https://doi.org/10.1371/journal.pone.0141140.g010

Relative Humidity Derivation.

RH can be derived from VPD and Ta by calculating SATVPa at the desired temperature from Eq 1, and factoring in the appropriate VPD: (6)

On a gridded basis, if only Tmax and Tmin are available to estimate Ta, and only VPDmin and VPDmax available to estimate VPD, grids of minimum RH (RHmin) can be approximated by substituting VPDmax for VPD and Tmax for Ta, and maximum RH (RHmax) can be similarly derived using VPDmin and Tmin. In performing these derivations, we make the assumption that Ta = Tmin and VPD = VPDmin at the time of day when RHmax occurs, and Ta = Tmax and VPD = VPDmax at the time of day when RHmin occurs. To evaluate the error introduced by these assumptions, we calculated 2003–2012 mean monthly RHmin and RHmax for the same 100 ASOS stations used in Table 2 and compared them with the estimated RHmin that would result from substituting VPDmax for VPD and Tmax for Ta, and RHmax resulting from using VPDmin and Tmin. As expected, Tmax and VPDmax were slightly higher than Ta and VPD at the hour of RHmax, and Tmin and VPDmin were slightly lower than Ta and VPD at the hour of RHmin (Table 5). As a result, the substitutions resulted in an overestimation of RHmin and an underestimation of RHmax. The differences averaged less than five percent in all months, suggesting that the derived grids are reasonable, but not exact, measures of the true RH.

thumbnail
Table 5. Results of temperature and vapor pressure deficit substitution in the derivation of minimum and maximum relative humidity.

2003–2012 January and July averages from 100 randomly-selected ASOS stations. Actual and estimated RH values are expressed as a distribution: 5th percentile / mean / 95th percentile.

https://doi.org/10.1371/journal.pone.0141140.t005

Maps of estimated 1981–2010 mean January and July RHmin, estimated from grid values of VPDmax and Tmax, reveal some interesting features not easily seen in the maps of VPDmax (Fig 11). In January, RHmin values are as low as 10–15% on the lee side of the Rocky Mountains, associated with dry, Chinook (downslope) winds produced by a strong westerly jet stream. In contrast, RHmin values exceed 70% in the winter-wet Pacific Northwest. RHmin values are also above 70% in California’s Central Valley and Idaho’s Snake Plain, where extended periods of high pressure and calm winds promote temperature inversions that often result in persistent fog and low clouds (Fig 11a). Higher RHmin values in the upper Midwest reflect frequent cloudy weather during winter in this region.

thumbnail
Fig 11. Maps of minimum relative humidity.

Conterminous US 1981–2010 mean minimum relative humidity (RHmin) estimated from mean maximum temperature (Tmax) and mean maximum vapor pressure deficit (VPDmax) for (a) January and (b) July.

https://doi.org/10.1371/journal.pone.0141140.g011

RHmin patterns in July show a divided country, with a dry west and moist east (Fig 11b). The exception is the West Coast, where cool, moist onshore flow maintains relatively high humidity values throughout the day. The lowest RHmin (<10%) values are centered in Nevada, and more generally in the Great Basin. In the east, the highest values are found in regions that receive substantial moisture from the Gulf of Mexico, such as the southern Appalachians and Mississippi Valley.

Uncertainty Analysis

Estimating the true errors associated with spatial climate data sets is difficult, and subject to its own set of errors [36]. This is because the true climate field is unknown, except at a relatively small number of observed points, and even these are subject to measurement and siting uncertainties (as has already been noted in the case of irrigated land, for example). Leave-one-out cross validation (C-V) approach is the most common evaluation method where each station is omitted from the dataset one at a time, the station value estimated in its absence, and the estimate and observation compared. The mean absolute error (MAE) and bias are typically calculated once the process is complete. While this approach is commonly used to assess error in interpolation studies, and is reported here, there are several disadvantages. An obvious disadvantage is that no error information is provided for locations where there are no stations. The single-deletion method favors interpolation models that heavily smooth the results, so that deletion of one station is relatively unimportant to the stability of the estimate. Randomly withholding a larger percentage of the station data at once can help to minimize this issue, as well as provide more robust error statistics. Withholding a stratified sample from the analysis is useful in detecting specific weaknesses or issues in the interpolation, as was done to investigate irrigated stations (also, see [36] for an example).

In this study, uncertanties in the mapped estimates were initially estimated by performing a C-V exercise with ASSAY, and the results compiled for each month for each of the three modeling regions (west, central, and east).

Even accounting for weaknesses in the C-V methodology, these errors are underestimates of the actual C-V uncertainty. One overlooked aspect of CAI is that it relies on predictor grids which have their own interpolation errors. These errors accumulate from one CAI generation to the next. For Tdmean, uncertainties in the interpolated Tmin predictor grids must be accounted for. In turn, VPDmin relies on interpolated Tmin and Tdmean, and VPDmax relies on Tmax, as well as Tmin and Tdmean.

To quantify the effects of error propagation on the CAI MAEs, the predictor grid interpolation error was introduced at each CAI step by using ASSAY interpolated estimates at station locations in their absence instead of the predictor grid values, which already have those station values built in. For Tdmean, Tmin values for all stations used in the mapping of the Tmin predictor grid were estimated in their absence using ASSAY, and estimates common to both the Tmin and Tdmean station datasets used as the values of Tmin in an ASSAY C-V exercise for Tdmean. Tmin values for stations used in the interpolation of Tdmean but not Tmin, such as those from the SCAN and COAGMET networks, could not be estimated by ASSAY, because they were not used to create the predictor grid. However, they could still be included in the Tdmean C-V error estimation because, by definition, the estimates at their location on the Tmin predictor grid were made in their absence.

For VPDmin, ASSAY estimates of station Tdmean values, predicted using ASSAY estimates of Tmin as described above, were used in combination with ASSAY estimates of Tmin to form station values of first-guess VPDmin, used as the predictor in the interpolation of VPDmin. The result was station values of first-guess VPDmin that accounted for errors in the interpolation of both Tmin and Tdmean. These were used as the values of first-guess VPDmin in a C-V assessment of VPDmin with ASSAY. VPDmax involved the same steps as VPDmin, except that first-guess VPDmax values were formed from a combination of ASSAY estimates of Tmax and Tdmean. Again, a C-V exercise for VPDmax was performed with ASSAY.

Table 6 reports monthly ASSAY cross-validation MAEs for each of the modeling regions. MAEs that account for CAI interpolation error propagation are denoted with a CAI or CAI2. Figs 12, 13 and 14 show the distribution of cross-validation absolute errors at stations across the country for Tdmean, VPDmin, and VPDmax, respectively.

thumbnail
Table 6. PRISM monthly cross-validation mean absolute errors (MAEs).

Both interpolation-only MAEs, and MAEs accounting for uncertainties in the predictor grids (denoted by CAI or CAI2), are given below. See text for details.

https://doi.org/10.1371/journal.pone.0141140.t006

thumbnail
Fig 12. Maps of Tdmean absolute cross-validation errors.

Errors for 1981–2010 mean monthly Tdmean in (a) January and (b) July.

https://doi.org/10.1371/journal.pone.0141140.g012

thumbnail
Fig 13. Maps of VPDmin absolute cross-validation errors.

Errors for 1981–2010 mean monthly VPDmin in (a) January and (b) July.

https://doi.org/10.1371/journal.pone.0141140.g013

thumbnail
Fig 14. Maps of VPDmax absolute cross-validation errors.

Errors for 1981–2010 mean monthly VPDmax in (a) January and (b) July.

https://doi.org/10.1371/journal.pone.0141140.g014

Dew Point.

In general, Tdmean interpolation errors in the west were greater than those in the central and eastern US, due primarily to terrain-induced complexities in the vertical distribution of moisture and temperature. This is evidenced by higher absolute errors in the Rocky Mountains, Cascades, and Sierra Nevada (Fig 12). Regionally, MAEs for Tdmean, interpolated as DPD (TminTdmean), were less than 1°C in the west and less than 0.5°C in the central and east (Table 6). CAI MAEs that accounted for error propagation were 0.1–0.2°C larger than those that did not in the west, but less than 0.05°C larger in the central and east, owing to larger Tmin interpolation errors in the west.

The Tdmean C-V analysis showed that the MAEs were inflated by systematic positive biases (lower observed DPD and higher Tdmean than predicted) in the COAGMET and AGRIMET networks. In fact, bias accounted for nearly all of the MAE. As discussed previously, stations in these networks were typically sited in or near irrigated fields for use in water management calculations. Despite subjective omission of stations producing the largest spatial inconsistencies, network-wide biases were still noticeable. Fig 15a and 15b show monthly Tdmean MAE and bias for each of the major station networks when each network is entirely eliminated from the dataset at one time. Plots are for the central region, to better control for the effects of complex physiographic features on interpolation bias; the exception is AGRIMET, which operates in the western region only, but stations are in flat agricultural areas, so interpolated predictions should be otherwise relatively unbiased. The peak MAE and bias in mid-summer correspond to the maximum difference in atmospheric moisture content one would be expect between irrigated and non-irrigated land. At their summer peaks, the COAGMET MAE and bias were 1.7 and 1.6°C, respectively. Peak AGRIMET MAE and bias were 1.5 and 1.4°C, respectively. These are in contrast to RAWS, for which the peak MAE was 1.4°C but the bias was only 0.1°C in the central region. The MAEs for these networks were much larger than the overall MAE for the central region of less than 0.5°C (Table 6).

thumbnail
Fig 15. Network cross-validation MAE and bias of mean dew point depression (DPD).

By month, station network, and region, when each network is entirely eliminated from the dataset at one time: (a) MAE for each network in the central region, except for AGRIMET, which is in the western region; (b), same as panel a, except showing bias; and (c) RAWS network bias for each region. The number of stations in each network is given in Table 1.

https://doi.org/10.1371/journal.pone.0141140.g015

A second source of systematic bias was found regionally in RAWS stations (Fig 15c). In the western region, biases were negative (higher observed DPD and lower Tdmean than predicted). In this region, the culprits may have been both local land cover and location. RAWS stations in the West are used primarily for fire weather applications, and are typically sited in open, ventilated areas away from transpiring vegetation. In addition, many RAWS stations are located on exposed terrain above locally humid cold air pools, and in foothill locations, which can be above larger-scale valley inversions, where a dry, free atmosphere aloft overlies relatively moist air below. Spring is the time of year when good vertical mixing causes a reduction in the incidence of inversions and cold air pools, which may explain the minimal bias at this time of year. The case for siting and location as causes for bias appears to be supported in the central region, where RAWS biases are near zero (Fig 15c). It is not clear why biases become somewhat positive (lower observed DPD and higher Tdmean than predicted) in the eastern region. One explanation may be that RAWS stations, often located in heavily forested areas in the east, are sampling more locally humid environments than other networks located at airports and in developed areas. The seasonal maximum bias in summer would correspond with the time of maximum transpiration from vegetation.

Vapor Pressure Deficit.

MAEs for VPDmin and VPDmax were calculated regionally as percentages and in absolute hPa units (Table 6) and absolute errors mapped across the country (Figs 13 and 14). MAEs in absolute units were typically larger in summer than in winter, in keeping with characteristically larger vapor pressure deficits in summer due to higher temperatures (Table 6). Regionally, VPDmin MAEs were less than 1 hPa in all months and regions, owing to relatively small absolute values. In January, the highest absolute VPDmin errors were concentrated in the desert southwest, where moisture deficits are greatest (Fig 13). These higher errors spread to most of the west in July, when deficits are seasonally high. Regional MAEs ranged from roughly 30–40 percent in the west, 20–30 percent in central, and 25–50 percent in the east. CAI2 MAEs that accounted for error propagation were 0.1–0.5 hPa and 15–20% larger in the west, 0.05–0.15 hPa and 5–15% larger in the central, and 0.05–0.15hPa and 10–25% larger in the east.

Regional VPDmax absolute MAEs were approximately double those for VPDmin in absolute units. Percentage MAEs were accordingly much lower, ranging from roughly 6–11, 3–5, and 4–6 percent in the west, central, and east, respectively. CAI2 MAEs that accounted for error propagation were 0.1–0.5 hPa and 0.5–1.5% larger in the west, and 0.05–0.25 hPa and 0.5–1.5% larger in the central and east. MAEs were again inflated by systematic positive biases (lower observed VPD observations than predicted) from COAGMET and AGRIMET. The spatial and temporal distributions of absolute errors were similar to that of VPDmin, with the highest January errors in the west, spreading to the east in July (Fig 14). July VPDmax errors were higher in the eastern US than those of VPDmin, because warm maximum temperatures in this region produce significant daytime vapor pressure deficits, despite relatively high RH values.

Summary and Conclusions

Long-term normal grids of 1981–2010 mean monthly average daily Tdmean, VPDmin and VPDmax were developed for the conterminous United States. The Tdmean grids update previous unpublished Tdmean normals used as CAI predictor grids in PRISM monthly time series datasets, and to our knowledge, the VPDmin and VPDmax normal grids are the first of their kind. Interpolation of the long-term monthly averages was performed using PRISM. Nearby stations entering the PRISM local regression functions (one per pixel) were assigned weights based on the physiographic similarity of the station to the grid cell that included the effects of distance, elevation, coastal proximity, vertical atmospheric layer, and topographic position. Relatively few stations were available for these variables, prompting us to use CAI to improve interpolation accuracy. In the CAI process, 1981–2010 monthly Tmin grids served as predictor grids for the interpolation of Tdmean, expressed as the dew point depression (DPD = Tmin—Tdmean). Second-generation CAI (CAI2) involving Tdmean and Tmin, and Tdmean and Tmax, predictor grids were used to interpolate VPDmin and VPDmax, respectively.

The general patterns of Tdmean were similar to those of Tmin in both winter and summer. However, the assumption that Tdmean is equal to Tmin did not hold over large parts of the country. In winter, Tdmean was several degrees higher than Tmin in the northern US and in cold valley locations of the West. In summer, Tdmean averaged 10–25°C lower than Tmin in the desert southwest. VPDmin was very low over most of the country, due to a combination of low temperatures and relatively small differences between ambient and dew point temperatures in the morning hours. Patterns of VPDmax roughly followed those of Tmax in winter, with the lowest values in the northern tier and western mountains, and the highest in the southern states. In summer, the area of high VPDmax expanded considerably, reaching a maximum of over 60 hPa in the desert southwest. Coastal regions of the West exhibited relatively low VPDmax values, where cool, marine air penetrates inland.

A PRISM interpolation uncertainty analysis was performed using C-V exercises. Since CAI relies on predictor grids which have their own interpolation errors, these errors were included in the overall error estimates. To quantify the effects of error propagation on the CAI MAEs, the predictor grid interpolation error was introduced at each step by using ASSAY predictions at station locations instead of the interpolated predictor grid values. When accounting for error propagation, MAEs for Tdmean were 0.8–1.1°C in the west modeling region and less than 0.5°C in the central and east, with the greatest errors typically occurring in summer. VPDmin MAEs were generally less than 1 hPa in all months and regions, and VPDmax MAEs about double those values.

Overall, accounting for CAI error propagation increased C-V errors, but not as much as expected, especially when compared to the initial MAEs associated with the Tmax and Tmin predictor values. One reason for this appears to be that the errors were not systematically biased in one direction; that is, overestimates and underestimates partly canceled each other, producing lower net error increases.

Some stations in the AGRIMET and COAGMET networks were found to have consistently high Tdmean and low VPD values during the summer months. Stations in these networks were typically sited in or near irrigated fields for use in water management calculations, resulting in more humid conditions than locations distant from irrigated areas. This raised questions about whether the grids should represent conditions over irrigated land. A middle ground was taken, where a few stations causing the most severe spatial discrepancies were omitted from the datasets. In addition, the siting of RAWS stations was posited as the reason for lower than expected humidity in the west and higher than expected in the east. These issues highlight the effect that local topographic position and land use/land cover properties can have on the spatial patterns of atmospheric moisture content and deficit.

In combination with existing PRISM grids of Tmin and Tmax, grids of Tdmean, VPDmin and VPDmax allow the user to derive many other atmospheric moisture variables, such as minimum and maximum RH, vapor pressure, and DPD. Accompanying assumptions may need to be made, however, such as those outlined in the derivation of RHmin and RHmax. These new grids will serve as the predictor grids in second-generation CAI to produce an updated version of the PRISM Tdmean monthly time series dataset that covers the years 1895-present [9], and initial versions of monthly VPDmin and VPDmax time series grids for the same period using third-generation CAI. Similar methods will be used to produce initial versions of daily Tdmean, VPDmin and VPDmax time series datasets, spanning 1981-present. All of the PRISM normal grids discussed here are available online at http://prism.oregonstate.edu, and include 800-m and 4-km resolution data, images, metadata, pedigree information, and station inventory files. Links to the papers cited in Table 3 and elsewhere are also available from this website.

Acknowledgments

We thank Mike Halbleib for creating the cartographic maps, and Eileen Kaspar for editing support. We also thank two anonymous reviewers whose comments resulted in a substantially improved manuscript. We are grateful to all of the station data providers, without whom this mapping effort would not have been possible.

Author Contributions

Conceived and designed the experiments: CD. Performed the experiments: CD JIS KVO. Analyzed the data: CD JIS KVO. Contributed reagents/materials/analysis tools: CD JIS KVO. Wrote the paper: CD JIS KVO.

References

  1. 1. Maurer EP, Wood AW, Adam JC, Lettenmaier DP, Nijssen B. A long-term hydrologically based dataset of land surface fluxes and states for the conterminous United States. J. Climate. 2002; 15: 3237–3251.
  2. 2. Daly C, Gibson WP, Doggett M, Smith J, Taylor G. Up-to-date monthly climate maps for the conterminous United States. Proc., 14th AMS Conf. on Applied Climatology, 84th AMS Annual Meeting Combined Preprints, Amer. Meteorological Soc., Seattle, WA, January 13–16, 2004, Paper P5.1. Available: http://ams.confex.com/ams/pdfpapers/71444.pdf
  3. 3. Hamlet AF, Lettenmaier DP. Production of temporally consistent gridded precipitation and temperature fields for the continental U.S. Journal of Hydrometeorology. 2005; 6: 330–336.
  4. 4. Livneh B, Rosenburg EA, Lin C, Nijssen B, Mishra V, Andreadis KM, et al. A Long-Term Hydrologically Based Dataset of Land Surface Fluxes and States for the Conterminous United States: Update and Extensions. J. Climate. 2013: 26, 9384–9392.
  5. 5. Lundquist JD, Hughes M, Henn B, Gutmann ED, Livneh B, Dozier J, et al. High-elevation precipitation patterns: using snow measurements to assess daily gridded datasets across the Sierra Nevada, California. Journal of Hydrometeorology. 2015.
  6. 6. Daly C, Neilson RP, Phillips DL. A statistical-topographic model for mapping climatological precipitation over mountainous terrain. J. Appl. Meteor. 1994; 33: 140–158.
  7. 7. Daly C, Gibson WP, Taylor GH, Johnson GL, Pasteris P. A knowledge-based approach to the statistical mapping of climate. Climate Research. 2002; 22: 99–113.
  8. 8. Daly C, Halbleib MD, Smith JI, Gibson WP, Doggett MK, Taylor GH, et al. Physiographically-sensitive mapping of temperature and precipitation across the conterminous United States. International Journal of Climatology. 2008; 28: 2031–2064.
  9. 9. PRISM Climate Group, Oregon State University [internet]. Descriptions of PRISM Spatial Climate Datasets for the Conterminous United States. Available: http://www.prism.oregonstate.edu/documents/PRISM_datasets.pdf. Accessed 25 September 2015.
  10. 10. Hinckley TM, Lassoie JP, Running SW. Temporal and spatial variations in the water status of trees. Forest Science Monographs. 1978; 20: 72 pp.
  11. 11. Running SW. Environmental and physiological control of water flux through Pinus contorta. Can. J. For. Res. 1980; 10: 82–91.
  12. 12. Johnson JD, Ferrell WK. Stomatal response to vapour pressure deficit and the effect of water stress. Plant, Cell and Environ. 1983; 6: 451–456.
  13. 13. Bunce JA. Does transpiration control stomatal responses to water vapour pressure deficit? Plant, Cell and Environ. 1997; 20:131–135.
  14. 14. Zeppel MJB, Lewis JD, Phillips NG, Tissue DT. Consequences of nocturnal water loss: a synthesis of regulating factors and implications for capacitance, embolism and use in models. Tree Physiol. 2014; 34: 1047–1055. pmid:25413023
  15. 15. Littell J.S., McKenzie D, Peterson DL, Westerling AL. Climate and wildfire area burned in Western US ecoprovinces, 1916–2003. Ecological Applications. 2009; 19:1003–1021.
  16. 16. Williams A, Seager R, Berkelhammer M, Macalady A, Crimmins M, Swetnam T, et al. Causes and implications of extreme atmospheric moisture demand during the record-breaking 2011 wildfire season in the southwest United States. J. Appl. Meteor. Climatol. 2009; , in press.
  17. 17. Littell JS, Oneil EE, McKenzie D, Hicke JA, Lutz JA, Norheim RA, et al. Forest ecosystems, disturbance, and climate changes in Washington state, USA. Climatic Change. 2010; 102: 120–158.
  18. 18. Ashcroft BA, Gollan JR. Moisture, thermal inertia, and the spatial distributions of near-surface soil and air temperatures: Understanding factors that promote microrefugia. Agricultural and Forest Meteorology. 2013; 176: 77–89.
  19. 19. Running SW, Nemani RR, Hungerford RD. Extrapolation of synoptic meteorological data in mountainous terrain and its use for simulating forest evapotranspiration and photosynthesis. Canadian Journal of Forest Research. 1987; 17: 472–483.
  20. 20. Kimball J, Running SW, Nemani RR. An improved method for estimating surface humidity from daily minimum temperature. Agricultural and Forest Meteorology. 1997; 85: 87–98.
  21. 21. Thornton PE, Running SW, White MA. Generating surfaces of daily meteorological variables over large regions of complex terrain. Journal of Hydrology. 1997; 190: 214–251.
  22. 22. McEvoy DJ, Meija JF, Huntington JL. Use of an observation network in the Great Basin to evaluate gridded climate data. J. Hydrometeorology. 2014; 15: 1913–1931.
  23. 23. Abatzoglou JT. Development of gridded surface meteorological data for ecological applications and modeling, International Journal of Climatology. 2013; 33: 121–131.
  24. 24. Anderson D B. Relative humidity or vapor pressure deficit. Ecology. 1936; 17: 277–282.
  25. 25. New M, Hulme M, Jones PD. Representing twentieth century space-time climate variability. Part II: Development of 1901–1996 monthly grids of terrestrial surface climate. J Climate. 2000; 13: 2217–2238.
  26. 26. Alduchov OA, Eskridge RE. Improved Magnus form approximation of saturation vapor pressure. J. Appl. Meteor. 1996; 35: 601–609.
  27. 27. National Oceanic and Atmospheric Administration [Internet]. Wet-bulb temperature and dewpoint temperature from air temperature and relative humidity. Available: http://www.srh.noaa.gov/images/epz/wxcalc/wetBulbTdFromRh.pdf. Cited 25 September 2015.
  28. 28. World Meteorological Organization. Calculation of monthly and annual 30-year standard normals. Prepared by a meeting of experts, Washington, DC, USA, March 1989. WCDP 10, WMO-TD 341.
  29. 29. NOAA-National Climatic Data Center [Internet]. Data documentation for data set 3220, summary of the month cooperative. 2003. Available: http://www1.ncdc.noaa.gov/pub/data/documentlibrary/tddoc/td3220.pdf. Cited 25 September 2015.
  30. 30. Daly C, Helmer EH, Quinones M. Mapping the climate of Puerto Rico, Vieques, and Culebra. International Journal of Climatology. 2003; 23: 1359–1381.
  31. 31. Willmott CJ, Robeson SM. Climatologically aided interpolation (CAI) of terrestrial air temperature. Int. J. Climatology. 1995; 15: 221–229.
  32. 32. Plantico MS, Goss LA, Daly C, Taylor G. A new U.S. climate atlas. In: Proc., 12th AMS Conf. on Applied Climatology, Amer. Meteorological Soc., Asheville, NC, May 8–11, 2000, 247–248. Available: http://www.ncdc.noaa.gov/oa/about/cdrom/climatls2/info/atlasad.html.
  33. 33. Funk C, Michaelsen J, Verdin J, Artan G, Husak G, Senay G, et al. The collaborative historical African rainfall model: description and evaluation. Int J Climatology. 2000; 1: 47–66.
  34. 34. Daly C, Gibson WP, Doggett M, Smith J, Taylor G. Up-to-date monthly climate maps for the conterminous United States. Proc., 14th AMS Conf. on Applied Climatology, 84th AMS Annual Meeting Combined Preprints, Amer. Meteorological Soc., Seattle, WA, January 13–16, 2004, Paper P5.1, Available: http://ams.confex.com/ams/pdfpapers/71444.pdf.
  35. 35. Hamlet AF, Lettenmaier DP. Production of temporally consistent gridded precipitation and temperature fields for the continental U.S. Journal of Hydrometeorology. 2005; 6: 330–336.
  36. 36. Daly C. Guidelines for assessing the suitability of spatial climate data sets. International Journal of Climatology. 2006; 26: 707–72.
  37. 37. Daly C, Widrlechner MP, Halbleib MD, Smith JI, Gibson WP. Development of a new USDA Plant Hardiness Zone Map for the United States. Journal of Applied Meteorology and Climatology. 2012; 51: 242–264.
  38. 38. Daly C, Smith JI, McKane R. High-resolution spatial modeling of daily weather elements for a catchment in the Oregon Cascade Mountains, United States. Journal of Applied Meteorology and Climatology. 2007; 46: 1565–1586.