Laserfiche WebLink
(the lines to the boundary points of the boxplot), and <br />the quartile skew (the relative position of the median to <br />the 25th and 75th percentiles). For nutrient- and sedi- <br />ment -data groups that had five to nine data values, only <br />the individual data values were plotted, and only the <br />medians were reported in tables. For pesticides, all val- <br />ues were plotted, but statistical comparisons were not <br />done for groups that had less than five values, and only <br />the medians were reported in tables. For data groups <br />that had 10 to 14 values, the boxes and median line <br />were drawn for boxplots, and the 25th and 75th percen- <br />tiles were added to the tables. For data groups that had <br />15 or more values, boxplots and boundary lines were <br />drawn, and the 10th and 90th percentiles of the data dis- <br />tributions were added to tables. Where appropriate, a <br />line was drawn on boxplots to indicate the USEPA <br />Maximum Contaminant Level (MCL) for each constit- <br />uent. <br />The Kruskal- Wallis test was used to compare the <br />means of different groups of data. This is a non -para- <br />metric test that compares real and chance differences in <br />groups of data; the null hypothesis states that no real <br />difference exists. For this study, this test was done at <br />the 0.05 level of significance, which represents the <br />maximum probability of rejecting the null hypothesis <br />when it is actually true. Multiple -stage tests (as <br />described in Helsel and Hirsch, 1992) were performed <br />to detect differences between various combinations of <br />groups. <br />The LOWESS, or LOcally WEighted Scatterplot <br />Smoothing method (Cleveland, 1979) was used to <br />highlight trends or patterns of selected nutrient concen- <br />trations over time. To test for statistically- significant <br />trends, the Seasonal Kendall test (Hirsch and others, <br />1982) was done at the 0.05 level of significance. <br />Water - quality trends were only analyzed for sites with: <br />(1) instantaneous flow data and at least one nutrient <br />sample per month during 1980 -92 and (2) at least one - <br />half of the data present in the first and last thirds of the <br />record. Concentrations were flow- adjusted to remove <br />the variability due to differences in streamflow <br />(Lanfear and Alexander, 1990). <br />Nutrient loads were estimated using nutrient - <br />transport models. Models were based on data collected <br />during water years 1980 -92. Data requirements for <br />load calculation were as follows: (1) 35 or more obser- <br />vations for a nutrient constituent from the period 1980 <br />to 1992; (2) at least 3 of these observations from sam- <br />ples collected during the top decile of flow; and <br />(3) daily streamflow data available for years of interest. <br />Models were developed by multiple regression of <br />nutrient - constituent load on independent variables <br />including: (1) streamflow, because constituent concen- <br />trations often are related to streamflow; (2) time, to <br />compensate for long -term trends; and (3) sine and <br />cosine of time to compensate for seasonal variations. <br />Censored data were modified using the adjusted maxi- <br />mum likelihood estimator (Cohn and Gilroy, 1992). <br />421 Number of observations p > 0.05 <br />90th percentile <br />75th percentile- <br />Percentiles are statistics <br />upper quartile <br />which describe the varia- <br />bility in a data set. The <br />nth percentile is the value <br />that has at most n percent <br />of the observations less <br />than that value. For example, <br />50th percentile - <br />the 25th percentile has 25% <br />median value <br />of the observations less <br />than that value. <br />If 9 or less data points <br />exist, the data points <br />themselves are represented <br />25th percentile- <br />instead of a box. <br />lower quartile <br />10th percentile <br />KRUSKAL -WAWS TEST-Hypothesis <br />test to examine real versus chance <br />differences in data. The test <br />involves a null hypothesis stating <br />that no real difference exists. <br />An alpha value, or level of signif- <br />icance, is used in the hypothesis <br />test representing the maximum <br />probability of rejecting the null <br />hypothesis when it is actually true. <br />An alpha value of 0.05 is used. <br />p > 0.05 - Probability represent- <br />ing the attained significance <br />level. If the p -value is smaller <br />than or equal to the alpha value, <br />the null hypothesis is rejected <br />and significant differences are <br />assumed to exist among the <br />data. <br />Figure 11. Description of components of a boxplot and related statistical tests. <br />DATA - ANALYSIS METHODS 33 <br />