Talk:Simpson's paradox

From Wikipedia, the free encyclopedia

Mathematics Portal

This article is within the scope of WikiProject Mathematics, which collaborates on articles related to mathematics.

Mathematics rating:

B Class

Mid Priority

Field: Probability and statistics

Please update this rating as the article progresses, or if the rating is inaccurate. Please also add comments to suggest improvements to the article.

Simpson's paradox has had a peer review by Wikipedia editors which is now archived. It may contain ideas you can use to improve this article.

I'd like to change the first few paragraphs of this article to make it friendlier to folks afraid of math, and was wondering what other people thought. Here's a possibility:

Simpson's paradox is a statistical paradox described by E. H. Simpson in 1951, in which the accomplishments of several groups seem to be reversed with the groups are combined. This seeminhgly impossible result is encountered surprisingly often in social science and medical statistics.

As an example, suppose two people, Ann and Bob, who are let loose on Wikipedia. In the first test, Ann improves 60 percent of the articles she edits while Bob improves 90 percent of the articles he edits. In the second test, Ann improves just 10 percent of the articles she edits while Bob improves 30 percent.

Both times, Bob improved a much higher percentage of articles than Ann - yet when the two tests are combined, Ann has improved a much higher percentage than Bob!

The result comes about this way: In the first test, Ann edits 100 articles, improving 60 of them, while Bob edits just 10 articles, improving 9 of them. In the second test, Ann edits only 10 articles, improving 1 of them, while Bob edits 100 articles, improving 30 of them. When the two tests are added together, both edited 110 articles, yet Ann improved 69 of them (63 percent) while Bob improved only 40 of them (36 percent)!

Seems reasonable enough to me, although I wouldn't say "accomplishments" for "successes". "Success" in statistical jargon is not necessarily a positive thing! How about "ratings" instead?

I presume you are intending to leave the remaining paragraphs unchanged? -- Securiger

That was my thought, yes. So I'll go ahead and do this, then. DavidWBrooks 13:13, 17 Feb 2004 (UTC)

(However, looking it over again, I'll do my arithmetic correctly before I post it! Oops ... DavidWBrooks)

Is it a problem that the example explicitly refers to Wikipedia? (I'm thinking WP:SELF.) Avram 21:24, 10 March 2006 (UTC)

1 Nice work
2 The same paradox?
3 How is this a paradox?
4 One of the finer Wiki entries
5 A word
6 The kidney case
7 Suggested addition to aid paradoxical comprehension
8 How is the Electoral College an example of Simpson's paradox?
9 Do we need the fake example?
10 Correlation/Causation
11 Vector vs. Line

[edit] Nice work

I have recently been browsing the logic & game theory articles. This is the best I have seen so far. Congratulations to all concerned.

John Moore 309 12:36, 24 April 2006 (UTC)

I just read this article too, having come from Texture filtering and I am very impressed! This article is brilliant! --137.205.76.219 15:48, 27 January 2007 (UTC)

[edit] The same paradox?

I wonder if this is the same paradox and if it could be used as an example. I find it very easy to understand — and from real life.

Assume a population with 50% men and women and in both groups competence is spread in the same way. Imagine a situation where women are required to have more competence to get a promotion to management. You will then notice that women on the management level are more competent than male managers and that women in sub-management are more competent than men on the same level. This seems paradoxical at first considering that, on the whole, women and men are equally competent. Samulili

It's a nice example. In order to convince myself (and perhaps others) that it's the same paradox, I'll now assume that on average, the women are slightly less competent than the men (no offence, just to sharpen the paradox and make it clearer that Simpson is involved), and I'll add some numbers:

Suppose we have 100 men and 100 women. 18 of the men are highly competent, and 14 of them are in the management. Of the 82 less competent men, 6 are in the management. 17 of the women are highly competent, but only 8 of them are in the management. Of the 83 less competent women, 2 are in the management. Then, of the women in the management, 8/10=80% are highly competent, and of the sub-management women, 9/90=10% are highly competent. Of the men, only 14/20=70% of those in the management group are highly competent, and only 4/80=5% in the sub-management group are highly competent. So, in both groups, more of the women than of the men are highly competent, but combined, only 17/100=17% of the women are highly competent, while 18/100=18% of the men are.

Conclusion: This is indeed a Simpson paradox, and the only change compared to that suggested above is that I made it a little sharper by making the women less competent over all instead of just equally competent. However, I like the original better, and I think someone should go ahead and add it to the article. I'm afraid it takes skills beyond mine to write it in a simple way that makes it clear that it is a Simpson's paradox.--Niels Ø 20:04, 2 May 2006 (UTC)

[edit] How is this a paradox?

For Ann, the time that she royally screwed up barely counts, while the time that she did poorly counts the most. For Bob, the time that he royally screwed up hugely affected his total, while the time that he did amazing barely counts at all. I don't quite see why the results are surprising. Anyone care to enlighten me?

It all makes sense in the end, but it's still initially surprising for most people who are not aware of the explanation or suspect it. If you only know the partial percentages, then the total percentages would come as a surprise to most people. Obviously, once the weights are introduced, the initial surprise is exchanged for comprehension, but then a paradox is only a seemingly self-contradictory statement anyway, so I see nothing wrong with calling this a paradox. -Kvaks 01:09, 2 September 2005 (UTC)

Its not strictly a paradox, since there is a straight forward solution. But, its widely known by that name, so we ought to keep it. --best, kevin ···Kzollman | Talk··· 04:19, September 2, 2005 (UTC)

I do not support the idea that the phenomenon is not "really" a paradox. Many good paradoxes are based on representing a situation in such a way that a false conclusion seems obvious.--Niels Ø 08:18, 6 October 2006 (UTC)

[edit] One of the finer Wiki entries

The storytelling conceit, complete with sly reference to those other Simpsons, "Bart" and "Lisa," works well for me. This kind of explanation helps me in explaining a concept to others, even as I work to fully grasp it myself. The inclusion of the Wikipedia within the definition does not seem overly self-referential, as one observer has worried. Entries like this are the reason I seek out Wikipedia's take on things before looking to other, traditional sources. Thanks for an entertaining and elucidating entry! Matthew Treder 18:42, 2 May 2006 (UTC)

Agreed. The examples are clear, well written, and logical. And the references to Bart & Lisa Simpson are not only clever and fun, they also make it EXTREMELY easy for many people to remember this phenomenon as well as its associated name. If we name them Dick & Jane it would be far less memorable. How great it is when practicality and humor intersect! Jon Miller

Indeed! --WikiSlasher (talk) 13:01, 11 December 2007 (UTC)

Why did I not get this earlier... mattbuck (talk) 14:22, 11 December 2007 (UTC)

I'm new to commenting here, so I apologize if I'm doing this wrong.

The question was raised as to whether or not it's appropriate for this article to reference Wikipedia [WP:Self]. I believe it may be, but should certainly be discussed. The point of avoiding self references, as I read that guideline, is to not use phrases such as "elsewhere on this site" or "in another Wikipedia article". The point is NOT to pretend that Wikipedia doesn't exist.

The article could reference bowling or mowing lawns or a great host of other activities where the characters' performance can be quantified. I suspect the Wikipedia reference was used simply because the author assumes that those reading it will be familiar with the process.

However, I don't believe that the act of editing Wikipedia articles is a good example of much anything, because most people I know who read Wikipedia have never edited anything. I've been reading for years and only today even created an account to post anything. So the example took a little more effort for me to understand than many other possible analogies could have.

And, continuing that thought and going back to the self reference guideline, the plan as I have understood it is to eventually do a printed Wikipedia. Regardless of the form, any time this article appears outside the wikipedia.org website the chances of the reader understanding the example become greatly diminished.

In other words, I like the example used here, but a different example may be more comprehensible and practical.

Ha! That's funny! Thank's for putting Bart and Lisa in the Simpson's paradox. --69.67.229.185 03:02, 26 August 2006 (UTC)

[edit] A word

The Lisa-Bart example ends in this sentence: But it is possible to retell the story so that it appears obvious that Bart is more diligent. Would it not be more natural to say "tell" instead of "retell", since it is the original statement of the situation that appears to have this conclusion?--Niels Ø 08:18, 6 October 2006 (UTC)

Good poinmt. Thanks. I've changed that line to something that I think is even better: But it is possible to have told the story in a way which would make it appear obvious that Bart is more diligent. --Keeves 12:13, 6 October 2006 (UTC)

[edit] The kidney case

I expanded the text on the two factors at the end of the section to relate more specifically to the medical example. Reading what I've written, it seems natural to ask: Why did doctors give the inferior treatment B to the milder cases, when A is better in those cases too? I have not consulted the references on this case story, but perhaps someone who has (or will) can answer my question. I imagine one of two answers: (i) Before this particular investigation, they did not know that B was inferior even in the milder cases. (ii) Treatment A is more expensive, and is therefore primarily given to those patients who need it the most. In fact, if there are no other confounding variables involved, and if A is more expensive than B, then, within a given budget, the largest number of cures is obtained by treating as many as possible from the large-stone-group with A.--Niels Ø 13:29, 13 October 2006 (UTC)

Thanks for your changes, it reads more clearly. I don't have access to the original study, but from the review and title it appears to compare surgery, ultrasound and/or using catheters. Unsurprisingly the open surgery (treatment A) is the most effective, and probably is the most the expensive with the greatest post-treatment complications. TobyK 13:36, 31 October 2006 (UTC)

[edit] Suggested addition to aid paradoxical comprehension

existing section under 'Explanation by example' subtitle

[Who is more accomplished? Lisa and Bart's mutual friends think Lisa is better—her overall success rate is higher. But it is possible to have told the story in a way which would make it appear obvious that Bart is more diligent.]

append with the addition of

+ [However, some will note that the use of statistical analysis to present a biased view is not uncommon, for example in politics. On close inspection, one may find that Bart's edits are of a higher quality, elucidating complex subjects poorly understood by the general populace. Although Lisa and Bart's mutual friends think Lisa is better, history may judge Bart's legacy to humanity to be more significant.]

This may help answer those who fail to comprehend the paradoxical nature

Teeteetee 09:51, 2 March 2007 (UTC)

How so? The quality of the edits is unrelated to the paradox we're dealing with here; it's entirely about the number of edits.--Niels Ø (noe) 09:56, 2 March 2007 (UTC)

Extracted from the article's sub-section. . . .

" worth of work/Success/managed/achieved successful/worse/we feel/disappointed/accomplished/mutual friends think/better/diligent "

Are these "entirely about the number of edits" ? Teeteetee 19:34, 4 March 2007 (UTC)

OK' I didn't put that as clearly as I should have. The point is, we need not distinguish very good edits from minor improvements; that's not what the example is about. Whether they elucidate complex subjects is utterly irrelevant. However, the words accomplished and diligent that you quote may be misleading for the same reason: They seem to suggest some edits not merely improve articles, but that they display particular diligence, which (though of course true) is, as I said, utterly irrelevant.--Niels Ø (noe) 20:25, 4 March 2007 (UTC)

I do not understand your meaning.

I have tried several times to understand.

If you could avoid criticising existing aspects of the article I might better understand.

....

Do you agree with the following statement ?

"If Bart only edited one article (and that one edit brought about world peace), Lisa's lifetime of editing thousands of articles may statistically appear better (to friends, family, politicians, religious leaders, and others viewing the statistical view), but may be judged by history to be worth less than Bart's one edit."

Teeteetee 11:52, 8 March 2007 (UTC)

Sure, but it's got nothing to do with Simpson's paradox. The Bart-and-Lisa example is solely about the number of edits that were improvements, and the number' that were not. It does not distinguish between large improvements and small improvements.--Niels Ø (noe) 16:24, 8 March 2007 (UTC)

By using "it"(in the sentence above "It does not distinguish..."), I assume you mean Simpson's Paradox.

If so, you appear to be writing "Simpson's Paradox does not distinguish between large improvements and small improvements"

....

or, put alternatively,

When Simpson's Paradox occurs improvements can be difficult to distinguish.

Teeteetee 17:29, 12 March 2007 (UTC)

If you are seriously suggesting changes to the article, I think you should either be bold and make those changes, or explain clearly at this talk page what you'd like to change, and why. I've no idea what your point is.--Niels Ø (noe) 22:01, 12 March 2007 (UTC)

Thankyou for the advice, but, I was bold on 01March2007. Also, I hoped I had clearly explained my suggestion above (at 09:51, 2 March 2007)

My original article edit can be found here> [1] at the end of the 'Explanation by example' section.Teeteetee 12:31, 13 March 2007 (UTC)

Well, I believe I have made my concerns clear, where as I do not understand what your point is. Do you think your contribution is related to Simpson's paradox, or does it merely offer an alternative angle on the Lisa-and-Bart example, an angle unrelated to Simpson's paradox? Do you actually understand Simpson's paradox, or are you trying to understand it?--Niels Ø (noe) 12:57, 13 March 2007 (UTC)

I believe I understand Simpson Paradox.

I also believe context aids understanding.

I was attempting to provide others with some context. Teeteetee 13:50, 3 April 2007 (UTC)

Then I am at a loss. I am certain I understand Simpson's paradox, and I am certain it (in the Bart-Lisa-example) has nothing to do with distingushing between large and small improvements. The context is clear (wikipedia editing, some edits being improvements, other not). Adding more context - irrelevant to the paradox - will confuse matters by having readers trying to understand how it is relevant. Please explain, what is the point?--Niels Ø (noe) 14:49, 3 April 2007 (UTC)

[edit] How is the Electoral College an example of Simpson's paradox?

In both the Lisa/Bart example and the kidney stones example, there is a 3x2 table with 6 entries. How can the Electoral College data be presented in this way? There are the 2 parties, so that's the "2" dimension. But what is the "3" dimension?

Example	the "2" dimension	the "3" dimension
Lisa / Bart	Lisa / Bart	Week 1 / Week 2 / Total
kidney stones	Treatment A/B	small stones / large stones / together
Electoral College	Rep / Dem	??? / ??? / total number of Electoral College votes

--Occultations 21:46, 15 May 2007 (UTC)

I suspect the analogy (the College cannot reproduce the paradox exactly since the outcome in each state is only related to the difference in votes through the sign of the difference, not magnitude. One could not lose the College if every state was won.) is that one can "win" the nationwide popular vote, but under certain circumstances can lose in the College. Baccyak4H (Yak!) 03:07, 16 May 2007 (UTC)

I've removed the Electoral College example, it's not an example of Simpson's paradox. Unless, that is, someone can show how it fits the 3x2 table pattern. --Occultations 12:53, 28 May 2007 (UTC)

[edit] Do we need the fake example?

We have four different real-world examples now, some with statistics. Do we need the "bart/lisa" fake example to explain it any more? At the very least, I'd like to move the real examples up above the pretend one - I think lots of people stop reading when the article lurches into "explaining" mode. - DavidWBrooks 23:41, 22 May 2007 (UTC)

I was about to make an almost identical heading. It's a pretty asinine self-reference in addition to being original research. Milto LOL pia 04:32, 23 May 2007 (UTC)

I agree with the removal of fake examples (as I've just done with the baseball example). This section should be moved below the examples, and then transformed into a general discussion of what may cause the paradox to appear (talking about weighted averages, confounding variables, etc). Schutz 07:12, 23 May 2007 (UTC)

Then I'll do the move, and we can do the transformation later. - DavidWBrooks 10:00, 23 May 2007 (UTC) .. oops, never mind: somebody already did.

You're still welcome to do the transformation now that I have done the move :-) Schutz 13:44, 23 May 2007 (UTC)

But that will require thought and skill - I hoped I could get away with a nice, mindless move. - DavidWBrooks 14:00, 23 May 2007 (UTC)

Too late :-) I'll think about the transformation, but, as you say, it requires quite a bit of thinking first. Before that, I'll add a few more references and reformat the examples, and hopefully (if I can get around to doing it), add 2 images. Schutz 21:27, 23 May 2007 (UTC)

I have readded the example after User:Miltopia removed it, since the consensus above was for now to move the example rather than delete it. We all agree that we have enough real examples and do not need fake examples on top of that; however, this section is the only one that goes beyond giving an example, but also discuss the question of weighted averages. I don't think it is very good, or that it covers everything it should, but at the moment it is better than nothing. If nothing happens with it in the near future, then it can be removed. Schutz 07:44, 24 May 2007 (UTC)

[edit] Correlation/Causation

Would it be an idea to add Correlation does not imply causation into the 'See also' section? Apologies if this has already been covered, I don't find any references to it. Flex Flint 08:57, 17 July 2007 (UTC)

[edit] Vector vs. Line

I reverted a diff [2] changing vector to line in one instance. First, the section it's in is called "Vector Interpretation", so referring to vectors is the expected language of that section. Second, the word change was made in only one instance, making the whole paragraph internally inconsistent as it switched from line in the first instance to vector in all other. qitaana (talk) 22:17, 26 February 2008 (UTC)

Categories: B-Class mathematics articles | Mid-Priority mathematics articles | Mathematics articles with no comments | Old requests for peer review