Please explain in simple terms two questions down at the bottom? I've an exercise:
In the lottery of 100 tickets half of them are winning. How many lottery tickets need to be bought to reach a probability of wining up to 0.999?
The solution to this exercise is:
Let $X_i$ be a probability of winning of ticket i, that is $P(X_i) = p$. Where p is a probability value and equals 0.5. Then, the probability of getting AT LEAST one winning ticket in n tickets equals to sum of independent events $X_1, X_2, \cdots , X_n$, or $P(X_1, X_2, \cdots , X_n) = 1 - (1-p)^n $.
Calculations:
$1 - (1 - p)^n > \ge 0.999 $
Given half of tickets are winning, therefore p = 0.5.
$(1 - 0.5)^n \le 1 - 0.999$
Logorithmizing we get:
$n \times \log(1 - 0.5) \le \log(1 - 0.999)$
Given that $\log(1 - 0.5)$ is negative we change $\le$ to $\ge$:
$n \ge$ $\frac{\log(1 - 0.999)} {\log(1 - 0.5)}$
$ n \ge$ $\frac{\log(0.001)} {\log(0.5)}$
$n \ge 10$
2 questions I can't really wrap my head around
When buying one ticket out of 100, the probability of a ticket changes, because 1st ticket's probability of winning is $\frac{50}{100}$ and the 2nd's is $\frac{49}{99}$ and so on (which is hypergeometric distribution I guess?). So how does it happen that the $p = 0.5$ remains constant when we deal with AT LEAST scenario in this question? Why p remains 0.5, whereas 5 tickets are bought?
Each time we buy a lottery, each ticket is an independent event. But yet it is joint event. I assume it is joint because, perhaps of the first question, each time one ticket is purchased, the probability of winning second time changes? I don't get it.
And I might be mistakenly mixing and mashing some concepts now by saying that: if they are joint and independent should I not somehow take into account their intersection? Because if I work with a sum of independent events $X_1, X_2, \cdots , X_n$, shall we not substract their intersection?