Laserfiche WebLink
<br />OLS regression analyses were performed <br />using the Statit! statistical procedures <br />ALLREG and SREGRES (Statware, Inc., 1990, <br />p. 6-10 - 6-27). Initial selections of significant <br />explanatory variables for the OLS regression <br />models were performed by using the ALLREG <br />procedure. The ALLREG procedure uses an <br />all-possible subsets regression to identify the <br />best-possible combinations of explanatory <br />variables on the basis of the Mallows' Cp <br />statistic (Mallows, 1973). The SREGRES <br />procedure, based on stepwise-regression <br />algorithms, then was used to perform an OLS <br />regression analysis on each best-possible <br />combination of explanatory variables. The final <br />selection of explanatory variables was based on <br />the following criteria as described by Koltun and <br />Roberts (1990, p. 11): <br /> <br />1. The selection of explanatory variables, as <br />well as the signs and magnitudes of their <br />respective regression coefficients, need to be <br />hydrologically valid in the context of flood <br />runoff. This criterion takes precedence over all <br />other criteria. <br /> <br />2. All explanatory <br />statistically significant <br />confidence level. <br /> <br />variables should be <br />at the 95-percent <br /> <br />3. The selection of explanatory variables, <br />within the constraints of criteria 1 and 2, should <br />minimize the prediction error sum of squares <br />[the PRESS statistic, an index of the prediction <br />error associated with the regression equation <br />(Allen, 1971; Montgomery and Peck, 1982)], <br />maximize the coefficient of determination (R2, a <br />measure of the proportion of the variation in the <br />response variable accounted for by the <br />regression equation), and mlIllmlze the <br />standard error of estimate. Correlation between <br />explanatory variables and the vClriance inflation <br />factor (VIF) (Marquardt, 1970; Montgomery and <br />Peck, 1982) was used to assess multicollinearity <br />in the regression models. <br /> <br />Weighted Least-Squares Regression <br /> <br />A weighted least-squares (WLS) regression <br />technique described by Tasker (1980 p. 1107- <br />1109) was used to develop the final <br /> <br />1 Use of brand names in this report is for identification <br />purposes only and dol's nut constitute endorsement by <br />the U.S. Geological Survey. <br /> <br />multiple-regression equations for both the <br />drainage-basin and channel-geometry flood- <br />estimation methods. WLS regression analyses <br />improved R2 values and standard errors of <br />estimate obtained in the majority of the OLS <br />regression analyses. Tasker (1980, p. 1107) <br />reports that OLS regression assumes that the <br />time-sampling variance in the response- <br />variable estimates (design-flood discharges) are <br />the same for each gaging station used in the <br />analysis (assumption of homoscedasticity). In <br />hydrologic regression, this assumption usually <br />is violated because the reliability of response- <br />variable estimates depends primarily on the <br />length of the observed annual-peak discharge <br />records. The error associated with <br />response-variable estimates is inversely <br />proportional to record length (Choquette, 1988, <br />p. 16-17). WLS regression adjusts for the <br />variation in the reliability of the response- <br />variable estimates by using a weighting <br />function to account for differences in the lengths <br />of observed annual-peak discharge records at <br />gaging stations. The weighting function <br />(Tasker, 1980, p. 1107) is based on theory and on <br />analysis of residuals from the initial OLS <br />regression equation. The weighting function <br />assumes that the response variable (the <br />design-flood discharge) is determined by fitting <br />the logarithms (base 10) of the observed <br />annual-peak discharges to a Pearson Type-III <br />distribution. <br /> <br />The variance of the response variable, QT, is <br />estimated to determine the appropriate weight <br />factors for the WLS regression analysis. To be <br />effective, the weight factors should be inversely <br />proportional to the variance of QT' The variance <br />of QT' for a selected l' -year recurrence interval, <br />can be partitioned into two componentsnthe <br />variance due to OLS regression-model error, co' <br />and the variance due to time-sampling error, ti, <br />which is related to the standard deviation and <br />weighted skew coefficient of the logarithms of <br />the observed annual-peak discharges and to the <br />number of years of effective record length <br />(Tasker, 1980, p, 1107-1109; Choquette, 1988, p. <br />16-171. The weight function has the form <br /> <br />It" ::: <br />, <br /> <br />(4) <br /> <br />(Co + [I) <br /> <br />where Wi <br /> <br />'5 the weight factor for the ith <br />gaging station used in the <br /> <br />l <br />l <br /> <br />8 ESTIMATING DESIGN-FLOOD DISCHARGES FOR STREA..'\1S IN IOWA <br />