Laserfiche WebLink
<br />. <br /> <br />SOME STATISTICAL TOOLS IN HYDROWGY <br /> <br />33 <br /> <br />chance if there were no real diflerence between <br />periods. The double asterisk on the mean <br />square for between periods (in the analysis-of- <br />variance table) denotes statistical significance <br />above the 0.01 level. <br />Now consider a similar problem, to deter- <br />mine whether mean annual precipitation at <br />three stations is diflerent. The data are given <br />in the following table. <br /> <br />Precipitation, in inches, at- <br /> <br />Year <br /> <br />Site 1 <br /> <br />Site 2 <br /> <br />Site 3 <br /> <br />. <br />, <br />1 <br /> <br />1945_____________ 40. 6 48. 2 47. 5 <br />1946_ _ ----------- 36.1 40. 2 34- 8 <br />1947_____________ 37. 5 37. 8 42. 2 <br />1948_____________ 52. 3 58.2 59.9 <br />1949_____________ 42. 2 43.3 51. 7 <br />1950_____________ 40. 6 41. 4 42.5 <br />1951_____________ 38.3 42.3 40.5 <br />1952_____________ 45.8 48.2 47. 8 <br /> Sums______~ 333.4 359.6 366.9 <br />ynn __ __ _ ____ ___ 41. 7 45.0 45. 9 <br /> <br />. <br /> <br />From the data in the table we can make the <br />following calculations: <br /> <br />T=1,059.9 <br />T'/N=46,807.8 <br />~Y:,=47,784.0 <br />~T,'fn=46,885.4 <br /> <br />The analysis-of-variance table is <br /> <br />Source <br /> <br />Sum of Degrees <br />squares of <br />freedom <br /> <br />Mean <br />square <br /> <br />Among sites_ _ _ _ _ __ <br />Within sites_____n <br />TotaL_____ <br /> <br />77.6 <br />898.6 <br />976. 2 <br /> <br />2 38. 8 <br />21 42.9 <br />23 n___n__ <br /> <br />I. <br /> <br />F'.'1=38.8/42.9<1; therefore there is no dif- <br />ference statistically among the three means. <br />A perusal of hydrologic literature will turn <br />up very few applications of analysis of variance. <br />Most analyses of variance are based on data <br />from a designed experiment, and it is tms ap- <br />lication for which the best results are obtained. <br />Hydrologic data are usually parts of a time <br />series which may not be stationary. Thus the <br />individual values may not be entirely inde- <br />pendent as required for a valid analysis of <br /> <br />. <br /> <br />variance. In the example comparing mean run- <br />offs for two periods of record, it was concluded <br />that a real diflerence existed between periods. <br />But there is no physical reason to expect a <br />change in this basin. The earlier period was <br />one of high precipitation; the later period in- <br />cluded the drought of the thirties. It is also <br />possible that some of the annual runoffs were <br />serially correlated. Thus the characteristics <br />of the data tend to discredit the results of this <br />particular application of the analysis of <br />variance. <br />In the last example, the precipitation at site <br />2 is greater than that at site 1 for every year <br />shown, yet the analysis of variance shows no <br />diflerence in means. (An analysis of variance <br />between site 1 and site 2, only, shows a dif- <br />ference at a probability level of about 0.25.) <br />The annual precipitations at a site may be <br />independent, but the precipitations at the sev- <br />eral sites for the same year are not. Therefore <br />the requirements of the method are not met. <br />and the results must be accepted with <br />reservation. <br />The two examples given utilize a very simple <br />statistical model. For more complicated prob- <br />lems, several models may be considered. Selec- <br />tion of the appropriate one is difficult for the <br />Ifpart-time" statistician. Many statistics texts <br />treat analysis of variance in detail. See Bennett <br />and Franklin (1954), Brownlee (1960), and <br />Dixon and Massey (1957). In general, an <br />analysis of variance made by someone not <br />thoroughly familiar with the process should be <br />reviewed by a statistician for suitability of the <br />model and correctness of the interpretation. <br /> <br />Analysis of covariance <br /> <br />The analysis of variance of the runoff data <br />for two periods (see p. 32) indicated that the <br />population means were probably diflerent, yet <br />other information, particularly precipitation <br />records, leads to the opposite conclusion. The <br />precipitation data can be incorporated in the <br />analysis by using an analysis of covariance. <br />This method includes concepts from analysis of <br />variance and regression and is applicable where <br />a variable represents a measurement for each <br />individual as opposed to a variable which can <br />only be separated into a few categories. <br />