Suppose that the average number of cars passing by a building, $X$, is 3 cars per minute. Assuming that $X$ follows the Poisson distribution with mean $\lambda=3$, the probability that 1 car passes per minute is $$P(X=1)=e^{-3}\frac{3^1}{1!}\approx0.15.$$ On the other hand, let $Y$ be the number of cars passing by the building in an hour. Then $Y$ follows the Poisson distribution with mean $60\lambda=180$, so the probability that 60 cars pass by the building in an hour is $$P(Y=60)=e^{-180}\frac{180^{60}}{60!}\approx1.67\times10^{-25},$$ which is much smaller than $P(X=1)$. But all I did was to change the unit from minute to hour, and intuitively, the probability of 1 car per minute should be the same as the probability of 60 cars per hour. Somehow the choice of unit seems to affect the Poisson process significantly, which is odd. I suspect that I'm not fully understanding the Poisson process. Can anyone explain in layman's term?
3 Answers
intuitively, the probability of 1 car per minute should be the same as the probability of 60 cars per hour.
This intuition is incorrect. When $\lambda = 3$ this means that the mean number of cars per minute is $3$, and so having only $1$ car pass per minute is somewhat unusual; it means fewer cars than usual passed in a minute. When you go up to an hour, the mean number of cars is $180$, and having only $60$ cars pass during that time is much more unusual; it means that there had to be unusually few cars for sixty minutes in a row, which is much less likely than just having unusually few cars for a single minute. In fact
$$ \left( 3e^{-3} \right)^{60} \approx 2.8 \times 10^{-50}$$
so the second probability is a bit higher than the first one to the power of sixty (since there are more ways to get $60$ cars in an hour than just to have one car per minute; some of those minutes there could be fewer cars and other times there could be more). But you can see it's roughly-ish the same ballpark.
The question is a bit like asking what the difference is between flipping heads once and flipping heads $60$ times in a row. The answer is to flip heads $60$ times in a row you have to get very very lucky for a long time! These just aren't the same distribution and the Poisson distribution for larger values of $\lambda$ isn't a scaled version of the Poisson distribution for smaller values of $\lambda$ either.
Edit: Some comment about "units" is also in order. I think your intuition is that the parameter $\lambda$ is like a velocity, so it should be expressible in different units (cars per minute, cars per hour) without changing what's going on. This is not correct! Actually the parameter $\lambda$ is and must be dimensionless or else the $e^{-\lambda}$ in the definition of the Poisson distribution doesn't make sense. $\lambda$ is just a number, and the Poisson distribution for small values of $\lambda$ behaves quite differently than for large values of $\lambda$. You can see this concretely in the fact that the mean is $\lambda$ but the standard deviation is $\sqrt{\lambda}$ (which also would not make sense if $\lambda$ had units).
So how do we square this with the impression that "cars per minute" sounds like a quantity with units? The answer is that to get a specific Poisson distribution we have to pick a specific time window; if $3$ cars pass per minute then we can consider the number of cars that pass in a minute and get $P(3)$ or the number of cars that pass in an hour and get $P(180)$, which means that the rate of $3$ cars passing per minute is being multiplied by a number $t$ of minutes, so the time units cancel. The quantity $3t$ is then dimensionless and this is the actual parameter of the Poisson distribution.
- 468,795
-
@Quaochu Yuan That's interesting. Thanks for the insight. But the argument doesn't seem as convincing if I compare $P(X=3)\approx0.224$ and $P(Y=180)\approx0.030$. Nothing unusual happens here, since 3 and 180 are the means of $X$ and $Y$, respectively. It is quite noticeable that $P(Y=180)$ is quite sizable. – ashpool Oct 09 '23 at 09:19
-
3@ashpool "But the argument doesn't seem as convincing" - different questions need different arguments... – AakashM Oct 09 '23 at 15:05
-
5@ashpool: The standard deviation of $X$ is $\sqrt3$, so the event $X=1$ is slightly more than $1$ standard deviation away from the mean. The standard deviation of $Y$ is $\sqrt{180}$, so the event $Y=60$ is almost $9$ standard deviations away from the mean. Does that help convince you that you're doing much more than changing the units? – Greg Martin Oct 09 '23 at 17:29
-
5@ashpool: For $X=3$ and $Y=180$, the explanation is simply that the bigger the expected number of events is, the smaller the probability of getting exactly that number (or exactly any number) of events becomes. Compare this with, say, rolling dice: if you roll two six-sided dice, the probability of their sum being 7 is pretty high (1 in 6). If you roll 200 dice, the probability of their sum being exactly 700 is a lot lower (about 1.65%, FWIW). Yes, the sum will probably be somewhere near 700, but most likely not exactly 700. – Ilmari Karonen Oct 09 '23 at 18:41
-
3@ashpool: here the coin-flipping analogy is the difference between flipping $1$ heads out of $2$ coins and flipping $50$ heads out of $100$ coins. This is still the mean number of heads but the more coins you flip the harder it is to get exactly the mean value, as opposed to approximately. (You can check that the decay rate in both of these cases is about the same - roughly $\frac{1}{\sqrt{n}}$, the inverse of the standard deviation - and this is due to the central limit theorem.) – Qiaochu Yuan Oct 09 '23 at 18:56
-
Thank you all for the very illuminating comments. I'm very much convinced now :) – ashpool Oct 09 '23 at 23:02
-
1@ashpool: I also added some comments about "units." I think you were under the impression that $\lambda$ is a unital parameter but actually it is unitless; I've added details in the answer. – Qiaochu Yuan Oct 10 '23 at 00:19
-
I think of it as $\lambda = \rho \Delta V$ where $\rho$ is the rate (cars/minute, particles/mL etc.) and $\Delta V$ is the volume element (1 minute, 60 minutes, 100 mL etc.) . This is how I translate a rate (or density) into the $\lambda$ parameter of Possion distribution -- you need to pick the volume element under consideration; then the distribution is for the count of entities in that selected volume. – Dave Oct 10 '23 at 13:43
A simple analogy: Roll a single fair (cubic) die. There are 6 possible outcomes (1, 2, 3, 4, 5, 6), each with probability $\frac{1}{6} = 0.1666666...$.
Now roll 60 dice. Their total can be anything between 60 and 360. That's 301 distinct values. Furthermore, their distribution is no longer uniform.
- The lowest total value (60) can be achieved only by having 1's on all 60 dice, which has a probability of $(\frac{1}{6})^{60} \approx 2.046 \times 10^{-47}$. It's the same for the highest total value (360), which can be achieved only by rolling 6's on all 60 dice.
- The most likely total value (210) has a probability of approximately $0.030077$. But this is only about 18% of the corresponding maximum probability when rolling 1 die. The more dice you have, the probability of achieving any exact sum will get lower.
- However, the probability of getting a sum near 210 — between 180 and 240, inclusive — rises to $0.50462$. Contrast with the $\frac{1}{3}$ probability of rolling either 3 or 4 on a single die.
The more dice you roll, the less likely any one exact total gets, and the more the distribution clusters towards the center and away from the extremes. So, you can't just assume that conducting a probability trial 60 times will have the same probabilities as conducting it once.
- 18,262
If the expected rate of passage is $\lambda$ cars per unit of time, then the probability of $n$ cars passing in $t$ units of time is $$\mathbb P(N=n)=e^{-\lambda t}\frac{(\lambda t)^n}{n!}$$
so it does not matter whether
- you say $\lambda=3$ cars per minute and $t=60$ minutes,
- or you say $\lambda=180$ cars per hour and $t=1$ hour.
In both cases you have $\lambda t=180$ cars expected to pass in the time period and so $$\mathbb P(N=n)=e^{-180}\frac{180^n}{n!}.$$
- 169,616