English explorer, Inventor, Anthropologist
(1822–1911)
He studied medicine and mathematics at Cambridge University.
He invented the phrase “nature versus nurture”
In his will, he donated funds for a professorship in genetics to University College London.
We take each ball independently
In every level, the ball bounces either left or right
We represent these options as -1 and 1
At the last level the position is the sum of all bounces
We will simulate each ball one by one
Here M
is the number of “left-right” choices made by the ball
In other words, M
is the number of levels
M
, larger varianceIt is easy to see that the population mean is 0 for any M
The previous plot shows that the variance is M
Thus, standard deviation will be sqrt(M)
M
=5 Notice that the x
is not an integer anymore
M
=50M
=500M
=5000This “bell-shaped” curve is found in many experiments, especially when they involve the sum of many small contributions
It is called Gaussian distribution, or also Normal distribution
“The sum of several independent random variables converges to a Normal distribution”
The sum should have many terms, they should be independent, and they should have a well defined variance
(In Biology sometimes the variables are not independent, so be careful)
Here outcomes are real numbers
Any real number is possible
Probability of any \(x\) is zero (!)
We look for probabilities of intervals
≈95% of normal population is between \(-2\cdot\text{sd}(\mathbf x)\) and \(2\cdot\text{sd}(\mathbf x)\)
≈99% of normal population is between \(-3\cdot\text{sd}(\mathbf x)\) and \(3\cdot\text{sd}(\mathbf x)\)
Instead of simulating the Galton machine several times, we can simulate the Normal distribution using the R function
The parameter n
is mandatory. It is the sample size
You can also change the mean and the standard deviation of the simulation