<br />W05415
<br />
<br />WOODHOUSE ET AL.: UPDATED COLORADO RIVER RECONSTRUCTIONS
<br />
<br />W05415
<br />
<br />.,
<br />
<br />Table 1. Metadata and Descriptive Statistics of Annual Flows
<br />
<br /> Flow Statisticsb
<br />Gauge Location" Gauge Name USGS ID Basin Area, 106 ha Mean, 106 m' CV Skewc r[
<br />A Green R. at Green River, UT 9315000 11.6161 6704 0.30 0.38 0.26d
<br />B Colorado R. nr Cisco. UT 9180500 6.2419 8505 0.28 0.22 0.25d
<br />C San Juan R. nr Bluff, UT 9379500 5.9570 2711 0.40 0.32 0.12
<br />D Colorado R. at Lees Ferry, AZ 9380000 28.9562 18778 0.28 0.15 O.25d
<br />
<br />"
<br />
<br />"Gauge locations coded by letter are shown on map in Figure I.
<br />bMean, coefficient of variation, skewness coetJicient, and tirst-order autocorrelation computed from 1906-1995 annual (water year total) flows.
<br />cNone of the skewness coeflicients are signiticantly different trom zero at Q = 0.05.
<br />dSignificant of first-order autocorrelation based on one-tailed test, (X = 0.01.
<br />
<br />gauge period (I906-1995) and over both early (1906-
<br />1950) and late (1951- 1995) sets of years to ensure the
<br />stability ofthe correlation. A second approach, a "watershed-
<br />limited" approach, followed the same correlation rules, but
<br />the potential predictor set was restricted to chronologies
<br />within a I 00 kilometer buffer around the watershed up-
<br />stream from the gauge.
<br />[9] Reduction of the predictor pool by a watershed
<br />boundary constraint was not feasible for the Lees Ferry
<br />gauge, as the watershed essentially encompasses all chro-
<br />nologies. The approach taken for that gauge was to reduce
<br />the predictor pool by principal components analysis (PCA).
<br />After first removing chronologies uncorrelated with Lees
<br />Ferry streamflow, a PCA was run on the correlation matrix
<br />of the chronologies for their full common period of overlap.
<br />Mardia et ai. [1979, p. 244] suggest that in a regression
<br />context, the components having the largest correlations with
<br />the predictand, rather than the components with the largest
<br />variances, are best suited for retention. Accordingly, only
<br />those components significantly (p<0.05) correlated with
<br />streamflow were retained in the pool of potential predictors.
<br />The resulting pool has essentially been reduced to concisely
<br />express orthogonal modes of common variation in the tree
<br />ring data. Because each component is a linear combination
<br />of all tree ring chronologies correlated with streamflow, the
<br />PCA approach is relatively robust to nonclimatic influences
<br />(e.g., disturbance, insect outbreaks) at individual sites. For
<br />the Lees Ferry reconstruction, model sensitivity to the use of
<br />the standard versus the prewhitened chronologies was tested
<br />for both the non-PCA and PCA approaches described
<br />above. Validation statistics and features of the reconstructed
<br />time series were compared to assess sensitivity of results to
<br />the alternative model fonnulations.
<br />[10] The strength of the regression models was summa-
<br />rized by the adjusted R1 and F level of the regression
<br />equation [Weisberg, 1985]. Possible multicollinearity of
<br />predictors was assessed with the variance inflation factor
<br />(VIF) [Haan, 2002]. A forward stepwise approach was used
<br />to enter predictors from the predictor pools, with threshold
<br />F values for entry or removal of predictors. Variables were
<br />entered in order of their explained residual variance. As a
<br />guide, the F level for a predictor was allowed to have a
<br />maximum p value of 0.05 for entry and 0.10 forretention in
<br />the equation. Residuals for all regression models were
<br />inspected graphically for nonnonnality, trend, autocorrela-
<br />tion, and obvious dependence on values of the predictors or
<br />predicted flows. Any of these conditions could indicate a
<br />need for data transfonnation. Residuals were tested for
<br />nonnality with the Lilliefors test [Conover, 1980].
<br />
<br />[II] As a safeguard against model overfitting, the entry
<br />of predictors was tenninated when it resulted in decreased
<br />validation accuracy. The reduction of error (RE) [Fritts et
<br />ai., 1990] and root mean squared error (RMSE) [Weisberg,
<br />1985] were generated using two different calibration/
<br />validation schemes. In one scheme, a stepwise model
<br />was first fit to the full calibration period, recording the
<br />order of entry of predictors. The model was then fit to the
<br />first half of the data using the same predetennined order of
<br />entry for the predictors, and validated on the second half of
<br />the data. The calibration and validation halves were then
<br />exchanged and the process repeated. In the other validation
<br />scheme, leave-one-out cross validation [Michaelsen, 1987]
<br />was used to generate a single validation series. In both
<br />schemes, the RE and RMSE were calculated for each step
<br />and plotted to assess when the validation scores stopped
<br />improving. One last method of validation involved using
<br />the predictors selected by the stepwise regression process
<br />to run a linear neural network (LNN). LNN is an iterative
<br />model fitting process based on statistical bootstrapping
<br />techniques that was used here to assess bias in the
<br />explained variance. If the relationship between tree growth
<br />and climate is robust and stable, the results of LNN and
<br />stepwise regression should be equivalent [Goodman, 1996;
<br />Woodhouse, 1999].
<br />
<br />3. Reconstructions
<br />
<br />3.1. Full Pool Stepwise Regression Model Results
<br />
<br />[12] Statistics for the initial full pool stepwise regression
<br />results using residual chronologies as predictors are listed in
<br />Table 2 in the first three lines under full pool models
<br />(subbasins) and the first line under the Lees Ferry models.
<br />The regression models all have highly significant F levels,
<br />account for between 72% and 81 % of the variance of
<br />flow, and possess significant skill when applied to cross-
<br />validation testing. The predictor pools for the models
<br />contain between 24 and 38 chronologies, but the stepwise
<br />selection yields four to seven predictor chronologies in the
<br />final models.
<br />[13] The residuals analysis indicated that nonnality of
<br />residuals could not be rejected (Lilliefors test, p < 0.05) for
<br />any of the series. Residuals for one gauge, Colorado-Cisco,
<br />showed borderline significance of autocorrelation at a I-year
<br />lag. For three of the four gauges, residuals had a significant
<br />(p < 0.05) downward trend, suggesting greater tree growth
<br />than expected from flow in recent decades. A scatterplot
<br />indicated that the variance of residuals increased with the
<br />predicted values for the Colorado-Cisco. As neither square-
<br />
<br />3 of 16
<br />
|