Laserfiche WebLink
<br />20 <br /> <br />TECHNIQUES OF WATER-RESOURCES INVESTIGATIONS <br /> <br />7 <br /> <br />. <br /> <br />iable is small relative to the sampling error, <br />(3) the variable is so highly correlated with <br />one or more other variables in the regression <br />that the real effect is divided among them and <br />no one variable shows a significant effect, <br />and (4) the range of the variable sampled may <br />be too small to define .a significant effect. <br />A regression equation does not imply a <br />cause-and-effect relation between the inde- <br />pendent variables and the dependent variable. <br />Both may be influenced by some other factor <br />not readily measured. However, there should <br />be some physical tie between the variables :if <br />the results can be considered meaningful. <br />Selection of a regression model usually begins <br />with a graphical analysis. A model which <br />plots as a straight line is commonly used unless <br />there is strong evidence to the contrary. <br />If the sample data exist near an asymptote <br />or near a maximum or minimum point on the <br />curve, a simple model may be inadequate to <br />describe the relation and a more sophisticated <br />one may not be justified unless many data <br />are available. An example showing the char- <br />acteristics of three common models when <br />applied to data defined near zero is given in <br />figure 13. Physical considerations suggest that <br />neither b nor Q, should be less than zero and <br />that the line should be curved. The zero <br />limitation can be obtained by using the var- <br />iables log b and log Q, and thus making the <br />curve asymptotic to zero on both axes. The <br />addition of a term (log Q,)' will provide the <br />necessary curvature. The regression equation <br />using these three variables is the top one on <br />figure 13. It is not a good fit to the data. <br />Next, assume that it is not necessary that <br />the curve be asymptotic to Q,=O. Then a semi- <br />log model using the variables log b, Q" and (t, <br />would be appropriate. But the equation based <br />on this model reaches a maximum too soon and <br />is a very poor fit. As a last resort assume a <br />simple model with the variables b, Q" and Q;. <br />This equation is a good fit to the data, largely <br />because of the locations of the data. An addi- <br />tional point of b=O at Q,=lO or more would <br />have brought the curve below b=O at Q,=7. <br />The curve shown on figure 13 reaches a mini- <br />mum at Q,=7 and increases beyond. <br />The mechanics of computing the regression <br />equation, the standard error, and the tests of <br /> <br />logb=O.50~1.49 log Q]-0.40 (log Q7i <br /> <br />6 <br /> <br />5 <br /> <br />4 <br /> <br /> <br />. b=3.44-1.05 Q7+0 .083Q/ <br /> <br />o <br /> <br /> <br />02345678910 <br />Q, <br /> <br />Figure 13.-Equations and graphs of three models based on <br />the plotted data, <br /> <br />significance have been described. One important <br />task remains, that of evaluating the results. <br />First, the analyst should recognize that the <br />regression equation developed, even though it <br />. is a good fit to the data, is not necessarily <br />correct :if extrapolated. For example, the curve <br />corresponding to the bottom equation of figure <br />13 is a good fit to the seven points but increases <br />directly with Q, for values of Q, greater than 7. <br />On the other hand, the dashed curve of figure 13 <br />fits the lower five points but becomes asymptotic <br />to zero as Q, increases. Available information <br />does not indicate which extrapolation is more <br />nearly correct. <br />The signs of all significant regression coeffi- <br />cients should be in accord with physical <br />principles. The regression is not necessarily <br />incorrect :if they are not; the nonconformity <br />may be due to interrelations among the inde- <br />pendent variables. Such a regression is useful <br />for estimating values of the dependent variables <br />from known values of the independent variables, <br />and the reliability of the results, if within the <br />defined range of the regression, can be com- <br />puted. <br />The more difficult problem of determining <br />whether a particular variable is related to the <br />dependent variable may not have a definite <br />answer. Even though a regression coefficient <br />is statistically significant, there is a small <br />probability that this result occurred by chance. <br />Other samples could produce conflicting results. <br />On the other hand, :if many regressions produce <br />nonsignificant coefficients of a particular vari- <br /> <br />. <br /> <br />. <br />