Laserfiche WebLink
<br />SOME STATISTICAL TOOLS IN HYDROLOGY <br /> <br />33 <br /> <br />. <br /> <br />chlLIlce if there were no real difference between <br />periods. The double asterisk on the mean <br />square for between periods (in the analysis-of- <br />varilLIlce table) denotes statistical significance <br />above the 0.01 level. <br />Now consider a similar problem, to deter- <br />mine whether mean annual precipitation at <br />three stations is different. The data are given <br />in the following table. <br /> <br />Precipitation, in inches, at- <br /> <br />Ye." <br /> <br />Site 1 <br /> <br />Site 2 <br /> <br />Site 3 <br /> <br />I' <br /> <br />1945_____________ <br />1946_____________ <br />1947_____________ <br />1948_____________ <br />1949_____________ <br />1950_____________ <br />1951_____________ <br />1952_____________ <br /> <br />47.5 <br />348 <br />42.2 <br />59.9 <br />51. 7 <br />42.5 <br />40.5 <br />47.8 <br />366.9 <br />45.9 <br /> <br />40.6 <br />36. 1 <br />37.5 <br />52. 3 <br />42. 2 <br />40.6 <br />38. 3 <br />45. 8 <br />333.4 <br />41. 7 <br /> <br />48.2 <br />40.2 <br />37.8 <br />58.2 <br />43.3 <br />41. 4 <br />42.3 <br />48.2 <br />359.6 <br />45. 0 <br /> <br />Sums__n_n <br />ynnn_n_______ <br /> <br />. <br /> <br />From the data in the table we ClLIl make the <br />following calculations: <br /> <br />T=1,059.9 <br />T'fN=46,807.8 <br />2:Y~=47,784.0 <br />2:T,'fn=46,885.4 <br /> <br />The lLIlalysis-of-varilLIlce table is <br /> <br />Source <br /> <br />Sum of <br />squares <br /> <br />Mean <br />square <br /> <br />Degrees <br />of <br />freedom <br /> <br />Among sites_______ <br />Within sitee_______ <br />TotaL___n <br /> <br />77.6 <br />898. 6 <br />976. 2 <br /> <br />2 38. 8 <br />21 42. 9 <br />23 nn___.. <br /> <br />. <br /> <br />F2.21=38,8f42.9<1; therefore there is no dif- <br />ference statistically among the three means. <br />A perusal of hydrologic literature will turn <br />up very few applications of analysis of variance. <br />Most analyses of varilLIlce are based on data <br />from a designed experiment, lLIld it is this ap- <br />lication for which the best results are obtained. <br />Hydrologic data are usually parts of a time <br />series which may not be stationary. Thus the <br />individual values may not be entirely inde- <br />pendent as required for a valid lLIlalysis of <br /> <br />variance. In the example comparing mean run- <br />offs for two periods of record, it was concluded <br />that a real difference existed between periods. <br />But there is no physical reason to expect a <br />change in this basin. The earlier period was <br />one of high precipitation; the later period in- <br />cluded the drought of the thirties. It is also <br />possible that some of the annual rnnoffs were <br />serially correlated. Thus the characteristics <br />of the data tend to discredit the results of this <br />particular application of the analysis of <br />variance. <br />In the last example, the precipitation at site <br />2 is greater than that at site 1 for every year <br />.shown, yet the lLIlalysis of variance shows no <br />difference in melLIlS. (An analysis of variance <br />between site 1 and site 2, only, shows a dif- <br />ference at a probability level of about 0.25.) <br />The annual precipitations at a site may be <br />independent, but the precipitations at the sev- <br />eral sites for the same year are not. Therefore <br />the requirements of the method are not met, <br />and the results must be accepted with <br />reservation. <br />The two examples given utilize a very simple <br />statistical model. For more complicated prob- <br />lems, several models may be considered. Selec- <br />tion of the appropriate one is difficult for the <br />"part-time" statisticilLIl. Many statistics texts <br />treat lLIlalysis of variance in detail. See Bennett <br />lLIld Franklin (1954), Brownlee (1960), and <br />Dixon and Massey (1957). In general, an <br />lLIlalysis of varilLIlce made by someone not <br />thoroughly familiar with the process should be <br />reviewed by a statisticilLIl for suitability of the <br />model and correctness of the interpretation. <br /> <br />Analysis of covariance <br /> <br />The lLIlalysis of variance of the runoff data <br />for two periods (see p. 32) indicated that the <br />population means were probably different, yet <br />other information, particularly precipitation <br />records, leads to the opposite conclusion. The <br />precipitation data can be incorporated in the <br />lLIlalysis by using lLIl analysis of covariance. <br />This method includes concepts from analysis of <br />variance and regression and is applicable where <br />a variable represents a measurement for each <br />individual as opposed to a variable which can <br />only be separated into a few categories. <br />