How many samples you need to take a win rate seriously
The "not enough samples" thing gets thrown around a lot. Clearly at low sample sizes, the sample may not represent the true probability due to random variances. As sample sizes grow, we approach the true probability of the event. Somewhere along the way, we get to "enough" samples to make a claim. There isn't really a single "point" where you have "enough" samples. What happens is the margin of error above and below what you measured the probability to be gets smaller and smaller. Based on that margin of error, you can make claims with a confidence level. The confidence interval shows the lower and upper limits of what our sample could represent, and is defined though choosing a confidence level. That is often 95% or 99%.
This would read something like this:
"after flipping a coin 10 times and getting 5 heads, we are 95% sure that the probability of a heads is between 19.0% and 81.0%" "after flipping a coin 50 times and getting 25 heads, we are 95% sure that the probability of a heads is between 36.1% and 63.9%"
I did calculate those numbers, and it obviously shows how much better our idea of the chance of heads gets with an increase in sample size.
Due to random variances, we will not usually see the true probability in the sample. Say you flip a coin 30 times, much more often than not, you will not get exactly 15 heads. However, the vast majority of the time, you will get CLOSE to 15 heads. The margin of error deviating away +/- from what we measured in our sample is called the confidence interval. We can calculate the lower and upper values of the confidence interval using the formula in the link I provided.
So lets look at
. She has 26 games, with a win rate of 69.23%. Lets use a confidence level of 95%. We have:
p = 0.6923 n = 26 z1- α/2 = 1.96 for 95% confidence
Plug that into the nice forumula, and we are now 95% sure that
's win rate is between 51.49% and 86.97%
So, we are very confident that her win rate is NOT 50%. However it is possible she is reasonably balanced (51.5% win rate) and we happened to get a very win heavy sample so far at worlds at random. We will see after more samples what her win rate looks like. I imagine it will remain VERY high, and maybe within a 95% confidence we can say her win rate is >55%, which is by most peoples standards completely broken