rvid L Otis, f~ 4;
<br />e
<br />!~......
<br />.Johnson 763
<br />1. Marcluff 773
<br />Halmata,
<br />:o Restani
<br />8. Isaacs
<br />RE
<br />781
<br />~fi ~0 ~ h C} n~ ,
<br />THE INSIGNIFICANCE OF STATISTICAL SIGNIFICANCE TESTING
<br />DOUGLAS H. JOHNSON,' U.S. Geological Survey, Biological Resources Division, Northern Prairie Wildlife Research Center,
<br />Jamestown, ND SB401, U6A
<br />~~
<br />794 - Key words: $avesian approaches, rnnfidence interval, null hypothesis, P-value, power analysis, scientific hy-
<br />pothesis -test. statistical hypothesis test. ~ •
<br />3. Christie
<br />MODEL
<br />~ A. Klaus
<br />l K. Carrie
<br />-JAY .. .
<br />H. Below
<br />q. Wagers
<br />ERN
<br />~. Mitchell
<br />:orquodale
<br />fE-
<br />~. Hartless
<br />ITRAL
<br />S. Wilson
<br />EXAS...
<br />K. Fritzell
<br />~,
<br />an Cassini
<br />803
<br />815
<br />824
<br />833
<br />843
<br />853
<br />861
<br />872
<br />880
<br />889
<br />895
<br />ioeiery. 5410 Grosvenor line.
<br />e~. and -includes all WiiDUFE
<br />,ues can tx obtained from the
<br />~. rho Wildlire Society. solo
<br />~ersi(y, Stillwater, OK 7407A.
<br />ct Biology. SWY College of
<br />w Hampshire Street. Lawrcncc.
<br />•nsvennr hoe, Bethesda. ~iD
<br />Statistical testing of hypotheses in the wildlife
<br />field has increased dramatically in recent years.
<br />Even more recent is an emphasis on power
<br />analysis associated with hypothesis testing (The
<br />Wildlife Society 1990. While this trend was oc-
<br />curring, statistical hypothesis testing was being
<br />deemphasized in some other disciplines. As an
<br />example, the American Psychological Assoc%a-
<br />tion seriously debated a ban on presenting re-
<br />sults of such tests in the Association's scientific
<br />journals. That proposal yvas rejected, not be-
<br />cause it lacked merit, but due to its appearance
<br />of censorship ~ Vleehl 199; ).
<br />The issue yeas lughlighted at the 1998 annual
<br />conference of The Wildlife Socieh-. in $uffalo.
<br />~ieyv fork. inhere the Biumetric•s ~Vo.rkin;
<br />Croup sponsored a half-day symposium on
<br />Evaluatin_ the Rule of Hypothesis Testin~-
<br />Poyver _~nalysis in Wildlife Science. Sp?akers at
<br />that session a~ho addressed statistical hypothesis
<br />testinr, yvr,re yimlally unanimous in their opin-
<br />ion that the tool wars oyenlsed. Misused. and
<br />ulten inappropriate.
<br />~[v ohjecrives are to hrieHv describe statisti-
<br />cal h~-pothesis testing, discuss common b~rt in-
<br />correct interpretations of resulting P-values.
<br />Mention some shortcolnin~s of hypothesis test-
<br />ier„ indicate. yylw hypothesis testim~ is conduct-
<br />ed, and outline some alternatives.
<br />l E-mail: dougl:l.~lLjohnsonC~nbs.~uy
<br />pence of Paper).
<br />WHAT 1S STATISTICAL HYPOTHESIS
<br />TESTING?
<br />Abstrnet: llespite their wide use in scientific journals such as The jorrnral of [Yikllife ~larut~rrnenf, statistical
<br />-' hypothesis tests add ven• little value to the products of research. Indeed, they frequently conftue the inter-
<br />'` pretation of data. This paper describes how statistical 1>,ypothrsis tests are often viewed, and then contracts
<br />that interpretation yyith the correct one. I discuss the arbitrariness of P-values, conclusions that the null hv-
<br />pothesis is tore, power analysis, and distinctions behveen statistical and biological significance. Statistical hy-
<br />- pothesis testing, itl yvhiclt the null hypothesis about the properties of a population is almost alyvavs )mown a
<br />:~ priori to be false, is contrasted with scientific hypothesis testing, which ecamines a credible null hypothesis
<br />~`_' about phenomena in nature. More meaningful alternatives are briefly outlined, including estimation and mn-
<br />fidence intervals for determining the importance of factors, decision theory for guiding actions in the face of
<br />uncertainh~, and Bavesian approaches to hypothesis testing and other statistical prlctices.
<br />JOURNAL OF WILDLIFE MANAGEMENT 63(3):763-772
<br />Four basic steps constitute statistical hypoth-
<br />esis testing. First, one develops a null hypothesis
<br />about some phenomenon or parameter. This null
<br />hppothe;is is generally the opposite of the re-
<br />search h~gothesis, which is what the investigator
<br />truly believes and wants to demonstrate. Re-
<br />search hypotheses may- be venerated either in-
<br />duc~vely'; from a study of obsen•ations already
<br />made. or deductiyeh•, Jeri`ing from theon•. Nest,
<br />data are collected that bear on the issue. typically
<br />by an experiment or by samnliu;. (dull ltypoth-
<br />eses often are developed after the data are in
<br />hand and have been nnnma~ed through. but
<br />th:lt~s another topic.) .~ statistic:i test of the cull
<br />hypothesis then is conducted, yy'nich generates a
<br />P-y:uue. Fin:tllr. the cluestiun oC •a•hat that valve
<br />mr:ms re!atiyr to the u~lll hypothesis is consicl-
<br />ere~l. Sey era! intrrj~ret:ltious of P ~,iten are rnacle.
<br />Sometimes P is yieyyecl as the pml,ahilih• that
<br />the results obtained yyere chle to chance. Small
<br />values are taken to indicate that the results yvere
<br />not just a happenstance. A lar~~e yallle ~f P. say
<br />for a test that µ = Q, yyonld suvuest that the
<br />mean r actualh~ rea,rclyd was due to chance,
<br />and µ cnuld be assuMed to hr zero iSchmidt
<br />anti I [enter 199 ~ ).
<br />Other times, 1-P is cuusidered the reliabilih•
<br />of the result: that is. the probahilih• of ~ettina-
<br />c:;
<br />RECEIVE
<br />
|