If I roll, say, 20 dice, what is the probability that at least 5 of them will be the same?
Specifically, I am not asking for the probability of e.g. rolling 5 sixes out of 20 dice. For that I believe I could use the binomial distribution and arrive at ~12.9%
I have made a Monte Carlo simulation using Python, where I rolled 20 dice a million times. From each iteration (1 iteration = rolling 20 times), I took the highest number of occurrences of the same number, ignoring what number it was. Then I calculated the number of times each number of occurrences was the highest. Then I calculated cumulative probability of each max number of occurrences. From my simulation, I arrived at ~92.8% probability that at least 5 of 20 rolls are the same.
I would love to see how this could be calculated using a specific formula, similar to the binomial distribution, so that I could reproduce it and be able to calculate e.g. probability of having at least 10 the same out of 30 etc.
Many thanks in advance for your advice!