Lady tasting tea
In the design of experiments in statistics, the lady tasting tea is a famous randomized experiment devised by Ronald Fisher and reported in his book The Design of Experiments (1935).[1] The experiment is the original exposition of Fisher's notion of a null hypothesis, which is "never proved or established, but is possibly disproved, in the course of experimentation".[2][3]
The lady in question claimed to be able to tell whether the tea or the milk was added first to a cup. Fisher proposed to give her eight cups, four of each variety, in random order. One could then ask what the probability was for her getting the specific number of cups she identified correct, but just by chance.
Fisher's description is less than 10 pages in length and is notable for its simplicity and completeness regarding terminology, calculations and design of the experiment.[4] The example is loosely based on an event in Fisher's life. The lady in question was Muriel Bristol, and the test used was Fisher's exact test.
The experiment
- The experiment provided the Lady with 8 randomly ordered cups of tea—4 prepared by first adding milk, 4 prepared by first adding the tea. She was to select the 4 cups prepared by one method.
- This offered the Lady the advantage of judging cups by comparison.
- The Lady was fully informed of the experimental method.
- The null hypothesis was that the Lady had no such ability.
- Note that in Fisher's approach, there is no alternative hypothesis;[2] this is instead a feature of the Neyman–Pearson approach.
- The test statistic was a simple count of the number of successes in selecting the 4 cups.
- The null hypothesis distribution was computed by the number of permutations. The number of selected permutations equalled the number of unselected permutations.
- Using a combination formula, with total cups and cups chosen, there are possible combinations.
Success count | Permutations of selection | Number of permutations |
---|---|---|
0 | oooo | 1 × 1 = 1 |
1 | ooox, ooxo, oxoo, xooo | 4 × 4 = 16 |
2 | ooxx, oxox, oxxo, xoxo, xxoo, xoox | 6 × 6 = 36 |
3 | oxxx, xoxx, xxox, xxxo | 4 × 4 = 16 |
4 | xxxx | 1 × 1 = 1 |
Total | 70 |
- The critical region was the single case of 4 successes of 4 possible based on a conventional probability criterion (< 5%; 1 of 70 ≈ 1.4%).
If and only if the Lady properly categorized all 8 cups was Fisher willing to reject the null hypothesis – effectively acknowledging the Lady's ability at a 1.4% significance level (but without quantifying her ability). Fisher later discussed the benefits of more trials and repeated tests.
David Salsburg reports that a colleague of Fisher, H. Fairfield Smith, revealed that in the test, the woman got all eight cups correct.[5][6] The chance of someone who just guesses getting all correct, assuming she guesses that four had the tea put in first and four the milk, would be only 1 in 70 (the combinations of 8 taken 4 at a time).
In popular science, Salsburg published a book entitled The Lady Tasting Tea,[5] which describes Fisher's experiment and ideas on randomization. Deb Basu wrote that “the famous case of the ‘lady tasting tea’” was “one of the two supporting pillars … of the randomization analysis of experimental data.”[7]
See also
References
- ↑ Fisher 1971, II. The Principles of Experimentation, Illustrated by a Psycho-physical Experiment.
- 1 2 Fisher 1971, Chapter II. The Principles of Experimentation, Illustrated by a Psycho-physical Experiment, Section 8. The Null Hypothesis.
- ↑ OED quote: 1935 R. A. Fisher, The Design of Experiments ii. 19, "We may speak of this hypothesis as the 'null hypothesis', and it should be noted that the null hypothesis is never proved or established, but is possibly disproved, in the course of experimentation."
- ↑ Fisher, Sir Ronald A. (1956) [The Design of Experiments (1935)]. "Mathematics of a Lady Tasting Tea". In James Roy Newman. The World of Mathematics, volume 3. Courier Dover Publications. ISBN 978-0-486-41151-4.
- 1 2 Salsburg (2002)
- ↑ Box, Joan Fisher (1978). R.A. Fisher, The Life of a Scientist. New York: Wiley. p. 134. ISBN 0-471-09300-9.
- ↑ Basu (1980a, p. 575; 1980b)
- Fisher, Ronald A. (1971) [1935]. The Design of Experiments (9th ed.). Macmillan. ISBN 0-02-844690-9.
- Basu, D. (1980a). "Randomization Analysis of Experimental Data: The Fisher Randomization Test". Journal of the American Statistical Association 75 (371): 575–582. doi:10.2307/2287648. JSTOR 2287648.
- Basu, D. (1980b). "The Fisher Randomization Test", reprinted with a new preface in Statistical Information and Likelihood : A Collection of Critical Essays by Dr. D. Basu ; J. K. Ghosh, editor. Springer 1988.
- Kempthorne, Oscar. "Intervention experiments, randomization and inference". In Malay Ghosh and Pramod K. Pathak. Current Issues in Statistical Inference – Essays in Honor of D. Basu. Hayward, CA.: IMS. pp. 13–31. doi:10.1214/lnms/1215458836.
- Salsburg, D. (2002) The Lady Tasting Tea: How Statistics Revolutionized Science in the Twentieth Century, W.H. Freeman / Owl Book. ISBN 0-8050-7134-2