Talk:Birthday problem
From Wikipedia, the free encyclopedia
Archives |
/Archive 1 |
[edit] "Near matches" - careless comment?
It is very careless to say that
"Thus in a family with six members, it is more likely than not that two members will have a birthday within a week of each other."
While i realise that it's tempting to make the group of six people a family, birthdays within a family are generally far from independent, especially if siblings are included!
Given the article it belongs to, i think the statement is out of place.
—The preceding unsigned comment was added by Rileen (talk • contribs) 12:51, 9 August 2006 (UTC).
-
- Good point. Fixed. Thanks. --Keeves 18:33, 9 August 2006 (UTC)
-
-
- Glad to see that - this was my first (which is why i also forgot to mention my name) teeny-weeny contribution to Wikipedia. Thanks! --Rileen, 11 August 2006
-
This is a valid statement within the context of the 'uniform birthday distribution' ideal set forth at the start of article. —The preceding unsigned comment was added by Qe2eqe (talk • contribs) 18:17, 17 April 2007 (UTC).
[edit] 29 February
It makes the article significantly more complex to take into account 29 February, so most of it doesn't — except for bits of the first paragraph. I think the article should simply state early on that 29 February will be completely ignored, and then it should abide by that.
Does anyone object to this?
Ruakh 17:09, 13 August 2006 (UTC)
- I had thought of that exact idea yesterday, and I'm glad you suggested it. I have now added such a paragraph, and I hope it will be noticable enough to bring this back-and-forth to an end. --Keeves 21:32, 13 August 2006 (UTC)
-
- Hmm. Your paragraph is very clear, but I worry that it might be a bit much; after all, it's not really the central point of the article, but really just a caveat clarifying that the article is about an abstract mathematical concept and does not necessarily apply perfectly to real life. It seems like a single sentence could conceivably suffice. (Further, it's not true that "it is just as likely that a randomly-chosen person's birthday will be January 21 or October 5 or almost any other day of the calendar"; different times of year actually have fairly different birth rates.) Ruakh 03:12, 14 August 2006 (UTC)
[edit] Mystery!
"23"! --nlitement [talk] 13:37, 30 September 2006 (UTC)
[edit] Approximation based on Taylor Series expansion
Isn't the expression obtained for p_bar(n) =approx= 1*e^-1/356*e^-2/365*...*e^-(n-1)/365 in the "Approximation" subsection based on the inequality "(1-x) < (e^-x)"? It doesn't seem to be based on the Taylor Series expansion of the exponential function; instead, it directly uses the exponential function. —The preceding unsigned comment was added by Wiki user 618 (talk • contribs) 03:13, 9 November 2006 (UTC).
- But exp(-x) = 1 - x + x^2/2 - ... IS the Taylor expansion of exp(-x). Cut the expansion off after the term in x, and you get exp(-x) = 1-x. You can see that 1-x < exp(-x), by looking at the quadratic term of the Taylor expansion (the Taylor expansion is an alternating series). So you are basically saying the same thing, but the article actually shows why the inequality holds. —The preceding unsigned comment was added by 134.58.253.131 (talk • contribs) 11:30, 12 February 2007 (UTC).
[edit] Near matches?
Can we either cite or insert the math for the "near match" birthdays? The information is stated without proof. arctic 00:28, 27 November 2006 (UTC)
- You're right to be concerned; I just wrote a quick Perl script to investigate this probabilistically, and it seems like the correct values for the table would be:
-
-
within k days # people required 0 23 1 14 2 11 3 9 4 8 5 8 ← not 7, as the page currently has 7 7 ← not 6, as the page currently has
-
- (Note: that was only probabilistic, but I used some basic techniques to ensure accurate results, so I'm fairly confident those are the correct values. As you say, though, a source or mathematical justification is quite necessary here. BTW, lest anyone be concerned that my random number generator might be biased — that's a valid concern, and it might well be, but if that were affecting the results, you'd typically expect greater synchronicity, hence lower values for the number of people necessary to ensure a 50% probability of a near match. The only bias that would produce higher values is a bias against synchronicity; that is, if the generator has a greater-than-random tendency against yielding recently-yielded numbers. I'll run some tests to ensure that that's not the case here, then comment back.)
- Ruakh 01:49, 27 November 2006 (UTC)
-
- Okay, I ran some tests, and it doesn't look like my random number generator has any bias against synchronicity. So, I think the table in the article is wrong. Ruakh 02:11, 27 November 2006 (UTC)
-
-
- The table was indeed wrong, so I corrected it (and cited a source for the data). It's unfortunate that the error remained so long. --Sopoforic 03:16, 21 February 2007 (UTC)
-
[edit] Collision counting
This was useful to me testing a password generation programme, and I couldn't find anywhere else in Wikipedia that gave this information. The hash collision article could perhaps link directly to this section.
I suggest that the section's usefulness would be improved by the addition that for 1 << n << d, as is typical, the formula reduces to n2 / 2d.
Also, knowing the standard deviation would be useful, as I'd like to assess how reasonable my observed number of collisions is. Jlittlenz 05:07, 8 December 2006 (UTC)
I am also interested in more detail on the counting issue. In particular, rather than just knowing the expected number of collisions, I'm wondering what is the distribution of the maximum number of collisions for any particular day across the year. To put it in hash terms, if you do get a hash collision, you typically use a tree or a linked list to store all the data with the same hash value. The question is, how many items do you have in your tree or linked list, in particular how long is the longest list going to be? It would be good to have a formula p(n,d,m) where n is the number of people/hash entries, d is the number of days per year/size of the hash table, and m is the maximum number of collisions. For example p(1,d,0) = 1 for all d, since there will be 0 collisions with only one person! Conversely p(1,d,m) = 0 for all m > 0, more generally p(n,d,m) = 0 for all m >= n—Preceding unsigned comment added by 213.123.216.147 (talk) 13:38, 18 October 2007 (UTC)
[edit] Forward vs. reverse?
Is there any reason for the notion that the "birthday problem" is computing p given n, while the "reverse birthday problem" is computing n given p (as implied in #Reverse problem)? Insofar as the two are separate problems, it seems to me that the ordinary birthday problem is computing the minimal n such that p > ½, so the "reverse birthday problem" would be computing p given n. —RuakhTALK 16:23, 8 December 2006 (UTC)
[edit] 366 People isn't necessarily 100%
In the article, it claims, although it cannot actually be 100% unless there are at least 366 people.[1] However, the real number should be 367, because there are 366 days in a leap year. Someone could be born on February 29th, and then no one would have the same birthday out of 366 people. Correct me if I'm wrong, Thanks, RAmen, Demosthenes 21:48, 15 February 2007 (UTC)
- Yes, you are correct, but check the footnote. We're assuming 365 days in a year for sake of simplicity in calculations. — Edward Z. Yang(Talk) 22:48, 15 February 2007 (UTC)
[edit] Featured Article
I think this is an interesting subject, great article i think it's featured article quality comments? I dont know how to tag it for that status myself.. —The preceding unsigned comment was added by 83.49.105.6 (talk) 20:56, 12 March 2007 (UTC).
[edit] Conundrum, not paradox
Paradox? Don't you mean conundrum. a paradox is something I thought to be impossible!! And this is not. —The preceding unsigned comment was added by 131.111.8.98 (talk • contribs) 00:05, 18 April 2007 (UTC).
- To quote the third sentence of the article:
I have only come across this before referred to as the 'Birthday Surprise' which is a better description since it is not a paradox. You should mention that it is also known as 'Birthday Surprise' (as it has in note 4 and the Klamkin and Newman reference).62.255.240.100 12:18, 15 May 2007 (UTC)
- Look up "paradox" in a dictionary. The first definition usually is something like, "A true statment that seems not to make sense at first glance." Then, somewhere further down the list of definitions, you'll find the one that says, "a mathematical proposition that can neither be proven, nor disproven." —Preceding unsigned comment added by 192.55.12.36 (talk) 20:08, 29 February 2008 (UTC)
[edit] My IB Math Studies Project
Hi people, I have already contributed in the past to this article, and having done my math studies project on this topic, I would like to contribute with my testings.
By the way why is my name always deleted from the references?? I contributed to this article —The preceding unsigned comment was added by 90.0.168.169 (talk) 19:15, 30 April 2007 (UTC).
- Hi,
- Thanks for your contributions!
- Please note that the purpose of the "references" section is to show the sources from which Wikipedians take their information, not to show the Wikipedians who took their information from those sources.
- Unless you can cite a specific claim in the article that comes from your high school math project and not from the other references, it looks (and is) really awkward to list a contributor's high school math project in the references. And if there is a claim that comes from your high school math project and not from the other references, then we should consider removing that claim until such time as you get your research published in a peer-reviewed journal; verifiability from reputable sources is a fundamental element of Wikipedia. (This is not in any way to criticize your math project; I'm familiar with the IB program, and am sure that your project was very impressive by high school standards. Kudos to you. But that doesn't make it appropriate as a reference here. After all, just because something was an IB project, that doesn't mean that it garnered a good grade, or that it would stand up to professional scrutiny. IB projects are first and foremost a learning experience.)
Hi again As I was doing my math project, ergo in the past year and a half, I periodically updated this article with what I found with my researches. Unfortunately I didn't know that my project would be sent to IB to be verified. This means that they may think I have copied, even though it is the exact contrary :( That's why I would like my name to appear. Anyways thanks for the answer —The preceding unsigned comment was added by 90.0.46.7 (talk • contribs) 12:32, 1 May 2007 (UTC).
So is it ok to put me in the references? —The preceding unsigned comment was added by 90.0.46.7 (talk • contribs) 16:08, 1 May 2007 (UTC).
And In the External link can I put back the test I did with the players? —The preceding unsigned comment was added by 90.0.46.7 (talk • contribs) 20:36, 1 May 2007 (UTC).
[edit] Nonsense.
i didn't believe that these calculations are correct so i made an experiment: i rounded up 3 groups of people. every group constisted of 50 people so there SHOULD be a 97% chance that 2 of them celebrate their birthday on the same day (in each group).
guess what: there weren't 2 guys with the same birthday in none of the groups. in fact there weren't even any guys in these 150 persons who had their birthday on the same day.
mathematically, the chance of that happening is close to 0 (like 0,0x%). so the result of the experiment would be far more than highly unlikely. still, it proves that this calculation (like everything in maths) is just another pathetic attempt of human beings to define the nature of our environment. —The preceding unsigned comment was added by Sonandzon (talk • contribs) 21:26, 5 May 2007 (UTC).
- Re: Nonsense
- Unsigned, juvenile, biased, and disparaging commentary will be (and has been, here) deleted. The validity of basic statistical and mathematical methodology is beyond reproach. Lesotho 23:09, 5 May 2007 (UTC)
-
-
- Sonandzon, I think this is an interesting study. If what you found is repeatable, it shows that although the mathematics is correct, the assumptions behind it are not. Why don't you do a serious experiment and publish your findings in a peer reviewed conference or journal? -Pgan002 00:26, 22 June 2007 (UTC)
-
- Re: Nonsense: You are obviously lying. If you would have done what you claim you would have found lots of matching pairs.Likebox 14:38, 18 October 2007 (UTC)
-
- Please don't feed trolls. (In this case it's not a big deal, as (s)he's presumably moved on, but it's a bad habit to get into. If you don't know what I'm talking about, please read Wikipedia:What is a troll?.) —RuakhTALK 03:25, 19 October 2007 (UTC)
[edit] Wrong
I disagree with the mathematics on which this article is based.
The probability that in a group of two, both share the same birthday = 1/366.
The probablility that in a group of three, one pair share the same birthday (three people make 3 pairs = (1/366) + (365/366)*(1/365) + (365/366)*(364/365)*(1/364) = 3/366
The probability that in a group of four (i.e. 6 pairs) that one pair share the same birthday is (1/366) + (365/366)*(1/365) + (365/366)*(364/365)*(1/364) + (365/366)*(364/365)*(363/364)*(1/363) + (365/366)*(364/365)*(363/364)*(362/363)*(1/362) + (365/366)*(364/365)*(363/364)*(362/363)*(361/362)*(1/361) = 6/366
From this it can be seen that for a group of n people, there will be n(n − 1) / 2 pairs, and that the probability that one pair share the same birthday will thus be (n(n − 1) / 2) / 366.
Hence the probability that a single pair in a group of n people will share the same birthday will be at least 50% when there are 133 pairs, because 133=366/2. This occurs when n = 19 (approximate to nearest whole).
The probability that a single pair in a group of n people will share the same birthday is 100% when there are 366 or more pairs of people: this occurs when n = 27 (approximate to nearest whole). --Justificatus 15:47, 6 August 2007 (UTC)
- The article explicitly skips leap years, but it sounds like you're calculating based on one. Mmernex 16:02, 6 August 2007 (UTC)
- You're close, but a bit off. The problem is, it's possible for three people to all share a birthday, and you're counting each case of this as though it were three cases of two people sharing a birthday, when for our purposes it's actually equivalent to just one case of two people sharing a birthday. (And, analogously with larger groups.) In more general terms, remember that P ( A ∨ B ) = P ( A ) + P ( B ) − P ( A ∧ B ); hence, P ( A ∨ B ) ≠ P ( A ) + P ( B ) (except in the special case that A and B are mutually exclusive). —RuakhTALK 18:39, 6 August 2007 (UTC)
-
- There may be something more deeply wrong with Justificatus' argument than you realize. Read that last paragraph again. It says that the probability of a shared birthday in a random group of 27 people is 100%. That is to say, it's guaranteed. That is to say, there is no possible way to choose 27 people such that each of them has a different birthday.
-
- Justificatus "disagrees" with the math on this page. But, this is not some ivory tower abstraction that we're talking about here, this should be basic common sense. 192.55.12.36 (talk) 20:53, 29 February 2008 (UTC)
-
-
- I agree with all of your comment except the "more deeply wrong" part. Superficially, the problem with his argument is that its result is nonsensical; but I think "your result is wrong because it contradicts this obvious fact: ___" is less helpful than "your reasoning is flawed because you're making this erroneous assumption: ___". Hence, I tried to explain why his reasoning was flawed, rather than how I knew it was flawed. —RuakhTALK 23:06, 19 April 2008 (UTC)
-
[edit] 365 different birthdays -- what am I missing?
I wish to take issue with the statement, "After having met people with n different birthdays (n < 365), the chance that the next person you meet has a colliding birthday is (365-n)/365."
Suppose I am just getting started. I have met one person, and recorded their birthday. The number of different birthdays n is then 1.
The quoted text asserts that there are 364 chances out of 365 that the second person I meet will have the same birthday as the first one. This seems counter-intuitive. Likewise, if my list of birthdates is long, and I have n = 364, then the equation predicts that there is only one chance in 365 that the next person I meet has a birthday that is already on my list.
In summary, the equation says that at the beginning when n is small, I will have a large number of collisions, and as the size of my list of birthdays grows, the likelihood of a collision will decrease. --Jamesglong 18:14, 13 August 2007 (UTC)
- Heh, good catch. I'm not sure how to fix this, though; in general, that section is assuming some things I'm not sure about. I think for now I'll label it "disputed", and we can work on fixing it up. —RuakhTALK 18:49, 13 August 2007 (UTC)
- I think that (365-n)/365 is the probability that the next person you meet will have a non-colliding birthday. After you've collected one birthday, there is a good chance (364 in 365) that the next person will have a different birthday. Once you've collected 365 birthdays, you cannot collect any new birthdays and the probability is zero.--143.215.153.96 19:53, 14 August 2007 (UTC)
[edit] Coupon collector's problem
I didn't find a page for this problem on wikipeida, altough it is related to this article. There are N coupons on a desk, and you have to pick from them randomly (without taking the picked one from the desk). What is the expected amount of picks, until you have picked al of them at least once.
Google found the answer N*(ln N + Euler's constant) here —Preceding unsigned comment added by 81.183.27.21 (talk) 17:20, August 24, 2007 (UTC)
[edit] Pigeonhole principle
The pigeon hole principle says that there MUST be at least two people with the same birthdays if you have 367 or more people. Imagine 366 people (allowing for leap years), with one birthday on each of the 366 days of the (leap) year. When you introduce the 367th. person, this persons birthday can only happen on one of the already occupied birthdays. —Preceding unsigned comment added by 80.2.196.26 (talk) 18:22, 14 September 2007 (UTC)
- Please read the first footnote. :-) —RuakhTALK 21:43, 14 September 2007 (UTC) —Preceding unsigned comment added by Ruakh (talk • contribs)
[edit] How about sharing the same birthday date?
I think one of the earlier posts discussing a test with groups of 50 may have been looking for the same birthday date (month/day/year) as opposed to simply the same day (month/day). Can you compute the odds for the same date? I suppose the distribution of people would impact the odds - but, can you make estimates based on assumptions for your groups ("even distribution of ages 20-60", or "distribution mirroring earth's population at each age")? Now that seems like a fun excercise. And one I'm not capable of completing. :-(
69.134.204.75 20:58, 14 September 2007 (UTC)
- Well, the "even distribution of ages 20–60" version is not hard: it's the same as the normal birthday problem, but assuming 14,975 distinct days instead of 365. With that assumption, it would take 145 people before you have a 50% chance that two had the same DOB — still amazingly few, IMHO. The latter version would obviously requiring knowing that distribution, which I do not. (And the math would be significantly more complicated, too. I'm actually not sure exactly how one could go about that, other than a brute-force or probabilistic method.) —RuakhTALK 21:53, 14 September 2007 (UTC) —Preceding unsigned comment added by Ruakh (talk • contribs)
[edit] GA Review
This article does not meet the Good Article criteria at this time, and will not be listed. Currently, I'd probably grade it as somewhere on the border of Start-class and B-class. The main issues are some major organizational and prose issues, as well as insufficient reference citations. I am also unclear whether the items mentioned under 'references' were really used to back up anything in the article at all, or are, in fact, just extra books that some might want to use to look up additional information about the subject. In which case, the section should be renamed to further reading.
There's a considerable amount of text in the 'notes' section. Why isn't this used in the article itself. These are clearly not footnotes! It would help editors to review WP:CITE for guidelines on how to properly use inline citations to cite material in an article.
The section header titles are fairly long, and not very clear and concise. It would be advisable to shorten and simplify these so that they are easier to understand when reading the table of contents. Some sections might be reorganized and merged with others as well, and check that all second, third, and fourth level subsection headers are absolutely necessary.
Many sections seem overly dependent on some very complex equations with very little text to actually describe the significance and meaning of these equations. This is likely to scare off many readers. Equations should also have a citation, so that we can verify that what is being included is clearly not original research. The citation should not appear next to the equation itself, it should appear in the sentence preceding and referring to the equation, usually immediately after a colon, such as in "the probability that two birthdays coincide can be attributed to the following equation:[1]
The lead section actually does a reasonably good job of introducing the topic, but could better summarize the article. It might help to review WP:LEAD for tips on this. Also, the 'birthday attack' is mentioned in the lead, and briefly in the text, but there is no source for this information, and the text is pretty vague.
Hopefully, this will help editors improve the article. Good luck! Dr. Cash 04:59, 2 October 2007 (UTC)
- footnote86.42.66.94 (talk) 00:59, 10 January 2008 (UTC)
[edit] Knapsack Problem?
The formulation of the knapsack problem on this page is quite different from the knapsack problem with which I am familiar, which is as it is described in its own Wikipedia page. A bit of research reveals that what is described here appears to be the partition problem. The partition problem is a special case of the subset-sum problem, which is itself a special case of the knapsack problem. Thus, I suppose the characterization in this article is not strictly incorrect, but it is certainly misleading. I apologize if this discussion ends up at the wrong place; this is my first contribution to Wikipedia and I am not quite sure what I am doing. —Preceding unsigned comment added by 216.164.23.210 (talk) 03:24, 26 October 2007 (UTC)
- It ended up in exactly the right place! It's a good idea to sign your talk entries (but not article changes) - writing four tildes (~) or clicking the signature button over the edit window will do that. If you are not logged in, it will just add your IP number and a timestamp (click "Show preview" below the edit window to see the effect).
- I've changed the article to say "a variant of the knapsack problem"; I think it's correct this way, though it could still be improved upon. Go ahead, be bold, and do so, if you see a way!--Niels Ø (noe) 08:12, 26 October 2007 (UTC)
[edit] Solution for "near matches"
Hi,
Can anyone point me to a solution for the "near matches" version (i.e. birhdays within one week of each other)?
Thanks, nyenyec ☎ 17:28, 10 November 2007 (UTC)
[edit] Understanding the problem, not
The section #Understanding the problem is rather confusing to me, and I'm not a total math idiot, although I am a math idiot. In particular, I can't make much sense of the following paragraphs:
- The actual birthday problem asks whether any of the 23 people have a matching birthday with any of the others — not one in particular. (See "#Same birthday as you" below for an analysis of this much less surprising alternative problem.)
- The section #Same birthday as you appears to contradict this, and even if not, it doesn't help understand the problem.
- Since in every group of 23 people there are 23*22/2=253 pairs, which is more than half of the number of days in the year, the chance that one of these pairs has a matching birthday is not small. For 28 people, the number of pairs exceeds the number of days, and the probability of matching is considerably greater.
- "not small"? How great exactly?
- 20*19/2=190 pairs is also more than half of the number of days in the year. Why then is exactly 23 used? Should be explained.
- 28? Where does that number come into play?
- "considerably greater"? How great exactly? How is this relevant to understanding the problem?
I dorftrottel I talk I 03:22, December 3, 2007
- Both the sentences you quote are takes at making the computational result, following from a rather complicated analysis, less counter-intuitive. While, in a group of 23, the chances that someone (anyone in the group) has the same birthday as a previously identified person are low, the chances that someone (anyone in the group) has the same birthday as someone else (anyone else in the group) are higher. The second sentence makes this somewhat quantitative, though still less quantitative (and less complicated) than the exact calculation. If you in a list of 23 persons compare the birthday of the first person on the list to the others, you have 22 chances of success, but if you compare each and everyone to the others, you have 253 chances. (Why 253? Well, you can compare each of the 23 to each of the other 22, yielding 23*22 comparisons, but then you'll have made all comparisons twice (comparing A to B and comparing B to A), so we divide by two, 23*22/2=253.) Each of these many chances is small, because there are 365 days in a year. In a group of 28, there are 28*27/2=378, which exceeds 365. It is thus not so surprising that you have a fair chance of at least one success.
- I think this is a good point to make in the article; possibly it could and should be made more clear. If you understand it now, you may be the right one to improve on it!--Niels Ø (noe) 08:27, 3 December 2007 (UTC)
- Thanks a lot for the explanations! I'm not confident enough (yet) to edit the article, but I'll do some more reading and get back to it later. I dorftrottel I talk I 07:45, December 4, 2007
- This explanation can also cause misunderstandings.
In a list of 23 persons, if you compare the birthday of the first person on the list to the others, you have 22 chances of success, but if you compare each to the others, you have 253 chances. This is because in a group of 23 people there are 23*22/2=253 pairs, which is more than half of the number of days in the year. So the chance that one of these pairs has a matching birthday is not small.
- The numbers are correct, but arguing that the number of pairs is more than half of the number of days in the year would also fit for just 22 people! Furthermore some readers might be inclined to conclude that if the number of pairs exceeds the number of days we would have a sure hit. Maybe it should be pointed out in the main article that this isn't so. --Stevemiller (talk) 17:55, 11 March 2008 (UTC)
- This explanation can also cause misunderstandings.
- Thanks a lot for the explanations! I'm not confident enough (yet) to edit the article, but I'll do some more reading and get back to it later. I dorftrottel I talk I 07:45, December 4, 2007
[edit] Reordering and clarification
It seems to me the Halmos argument section should come before the approximations, since it follows from the discussion in the first paragraph of "Understanding the problem".
It's also not clear how the Poisson approximation is derived. What does the Binomial distribution have to do with the problem at hand? —Preceding unsigned comment added by Justin.mauger (talk • contribs) 19:53, 4 January 2008 (UTC)
[edit] Extremely confusing, text butchered?
Something must've been removed, since you go straight on to the approximations without actually stating the problem mathematically. Am I missing something? —Preceding unsigned comment added by 85.228.97.83 (talk) 21:54, 7 January 2008 (UTC)
- It did indeed get butchered, by this anonymous edit and failure to properly revert afterwards. I believe I've just repaired it. —Keenan Pepper 03:33, 24 January 2008 (UTC)
-
- At one point in the section you restored, the text says "It is easier to first calculate the probability p(n) that all n birthdays are different." Why are we bothering with an assumption that all birthdays are different? As unlikely as it is, they could all be the same day. They each could be any day. This section should be deleted if it isn't seen to be useful to the reader. At the very least, it should be made more plain that this "easier" method is just a step toward understanding the actual problem. Binksternet (talk) 04:18, 24 January 2008 (UTC)
-
-
- Um, I don't think you understand what it says. It's not assuming that all birthdays are different. If you assumed all birthdays are different, then no two birthdays could be the same, so the probability of any two birthdays being the same is zero, and there's nothing to calculate. It's calculating the probability that all the birthdays are different, and calling this probability . Then the probability that any two birthdays are the same is , because "all birthdays are different" is the logical negation of "at least two birthdays are the same". This is not an "easier" method and does not make any simplifying assumptions. It calculates the exact answer. —Keenan Pepper 05:20, 24 January 2008 (UTC)
-
[edit] Pool of possible birthdays stays 365
Some of the article text and prose references assumed that if you document one person's birthday then for the next person you document you could expect their birthday to come from a pool of birthdays reduced by one. However, the pool of possible birthdays is always the same (in the simplified problem format presented here.) Every single new person you see will always have 365 (actually 365.24219) possible birthdays. Binksternet (talk) 23:26, 21 January 2008 (UTC)
- The pool of birthdays that you need to hit in order not to have a match is reduced by 1. --Storkk (talk) 13:34, 3 February 2008 (UTC)
-
- Why? Binksternet (talk) 17:28, 3 February 2008 (UTC)
-
-
- Because you're trying to not have a match, meaning that if you were to hit the same birthday as someone else, you would have a match, so there's one less possibility each time. For example if the first person was born on Jan 1, in order to not have a match you'd need to be born on a day other than Jan 1, which, assuming we're not talking about leap years, is 364 possible days.131.247.152.4 (talk) 14:57, 5 May 2008 (UTC)
-
[edit] Seasonal peaks and troughs
This whole thing works only in theory, which the article does not address. There are seasonal peaks and troughs that vary geographically, which alter the real-life probabilities discussed in this article. For example, there is a peak of birthdays in April/May in Canada, and a trough in December/January. So the probability of any given birthday falling on a particular date is not 1/365. Am I wrong here? 76.71.33.251 (talk) 02:33, 20 February 2008 (UTC)
- Nevermind... I see that the article indeed addresses this. Thanks! 76.71.33.251 (talk) 02:39, 20 February 2008 (UTC)
[edit] Partition Problem problem
The text for this section says "...If there are only two or three weights, the answer is very clearly no..." But surely weights of 1,3 & 4 grams will balance - 1,3 vs 4 ? --83.105.33.91 (talk) 12:16, 17 March 2008 (UTC)
- You don't understand. It says each weight is an integer number of grams randomly chosen between one gram and one million grams. The question is whether it's possible for a majority of the cases. I'll try to clarify the article. —Keenan Pepper 02:00, 18 March 2008 (UTC)
[edit] Birthday Pair Problem
I am comparing this article to page 322 in Applied Cryptography by Bruce Schneier and either the book and my math are wrong or the following line is in correct.
"...23 people there are 23*22/2=253 pairs"
It should read "...23 people there are 23*22=506 pairs." Perhaps the division by 2 is present due to pair AB being equivalent to pair BA. If that is the case the article should be more specific because mathematically there are 506 and not 253 pairs. —Preceding unsigned comment added by Rabunbike (talk • contribs) 18:27, 26 March 2008 (UTC)
- You have 506 ordered pairs and 253 pairs without a need to order. You are not doing the test twice on each pair, but once on each pair of people. That is why 253 pairs (no order) are correct for this problem. -- 87.162.75.104 (talk) 10:26, 19 April 2008 (UTC)
[edit] Presidents of the United States
We have thus far had 42 Presidents of the United States and have yet to have a matching birthday pair. The odds that two of our Presidents will share a birthday is rapidly(relatively speaking) approaching 1.0, is it not? —Preceding unsigned comment added by NCJRB (talk • contribs) 01:34, 5 May 2008 (UTC)
- That's simply not true. James Polk and Warren G. Harding share the birthday November 2. It is probably fairly unlikely that there would be only one such match (other than the obvious fact that Grover Cleveland shares a birthday with himself), but still there's one. There are also considerably more matching pairs in their death days.Eebster the Great (talk) 03:04, 7 May 2008 (UTC)
[edit] Same birthday as you
In the graph next to the same birthday as you section, shouldn't q(n) = 1 when n = 365 (assuming n is the number of people other than you) or n = 366 (assuming n is the number of people including you) ? —Preceding unsigned comment added by 131.247.152.4 (talk) 15:00, 5 May 2008 (UTC)
- No, it seemed that way to me at first, too, but that would only be the case if nobody else shared a birthday. You can imagine a group of you and 200 million other people, but all 200 million of the other people were born January 1 and you were born June 30. However, it's rather surprising how slowly that probability q(n) grows.Eebster the Great (talk) 03:04, 7 May 2008 (UTC)
[edit] Thought I had a grasp on this but I don't.
Alright, I read the section that said 23 different people means 253 different pairs, which made sense to me. However, I was doing the calculations upwards, and you end up with more than 365 different pairs in the with only 28 or so people range. So why isn't the probability a hundred percent at that point? —Preceding unsigned comment added by 69.62.140.50 (talk • contribs) 03:17, 8 May 2008 (UTC)
- You're comparing apples and oranges — number of pairs with number of days. It's possible to have a group of 365 people, one having each possible birthday (not counting 29 February); in that case you'd have 66,430 pairs of people, but each pair having two distinct birthdays. The "253 different pairs" thing is a bit of an intuitive hint that even with only 23 people, there are more possibilities than it might seem at first, but it's not a valid way to determine what the actual probability is of a match. (The problem is that the pairs aren't independent; if A and B share a birthday, and B and C share a birthday, then A and C must share a birthday, and conversely, if A and B don't share a birthday, and B and C don't share a birthday, then there's a slightly elevated chance that A and C do share a birthday.) I think this part of the article needs to be tweaked a bit, as you're not the first person to come out of it with that same question. —RuakhTALK 04:08, 8 May 2008 (UTC)