Demonstration of the central limit theorem

Provides a "guided tour" of the Central Limit Theorem, simulating multiple throws of a die to illustrate the theorem. Concepts are explained in notes, and graphs show the results of simulations. The theorem states that if random samples of size n are drawn again and again from a population with a finite mean, mu(y), and standard deviation, sigma(y), then when n is large, the distribution of the sample means will be approximately normal with mean equal to mu(y), and standard deviation equal to (sigma(y))/sqrt(n).

Download the Macro

Be sure that Minitab knows where to find your downloaded macro. Choose File > Options > General. Under Macro location browse to the location where you save macro files.

Important

If you use an older web browser, when you click the Download button, the file may open in Quicktime, which shares the .mac file extension with Minitab macros. To save the macro, right-click the Download button and choose Save target as.

Download CLT.mac

Running the Macro

Note

The macro produces data in the worksheet. Please confirm that an empty worksheet is active before running the macro.

To run the macro, choose View > Command Line/History and type

%CLT

Click Run.

The Central Limit Theorem states that if random samples of size n are drawn again and again from a population with a finite mean, mu(y), and standard deviation, sigma(y), then when n is large, the distribution of the sample means will be approximately normal with mean equal to mu(y), and standard deviation equal to (sigma(y))/sqrt(n).

Let's examine the effects of the Central Limit Theorem with the following experiment. Suppose you toss a fair die 1000 times. You would expect to get about an equal number of 1's, 2's, and so on. Let's examine the distribution of 1000 tosses. This is shown in Graph 1.

Press Enter to proceed.

Now suppose you were to toss the die two times and take the average of the two tosses. You will repeat this experiment 1000 times also. Let's see what the distribution of the averages of two tosses looks like. This is shown in Graph 2.

Press Enter to proceed.

Did you notice that with only two tosses the distribution of the averages was already becoming mound-shape Suppose that you now toss the die three times and take the average of the three tosses. Again, you will repeat this experiment 1000 times. Let's see what effect this has on the distribution of the averages. This is shown in Graph 3.

Press Enter to proceed.

Again, the shape of the distribution is quite close to that of a normal distribution. Did you notice anything else that was happening to the distribution?

Let's toss the die five times and take the average. Again, you will repeat this experiment 1000 times. This is shown in Graph 4.

Press Enter to proceed.

Have you begun to notice any patterns in what is happening yet?

Let's continue to increase the number of tosses that we are averaging. This time you will toss the die 10 times and take the average of the 10 tosses. This is shown in Graph 5.

Press Enter to proceed.

By now you should see two phenomena as you increase the number of tosses. First, you should see that the shape of the distribution of averages is really beginning to take on the shape of a normal distribution. Second, you should see that as the number of tosses increases, the distribution becomes narrower and narrower. Let's continue increasing the number of tosses. This time you will toss the die 20 times. This is shown in Graph 6.

Press Enter to proceed.

You should by now be adequately convinced of the effects that increasing the sample size has on the distribution of sample averages. You will increase the sample size one more time to reinforce this thought. This time you will toss the die 30 times. This is shown in Graph 7.

Press Enter to proceed.

Let's review what you have seen.

You will draw the histograms for samples of size 2, 5, 10, 20, and 30 together in one plot to see the changes in the distribution.

Press Enter to proceed.

The Central Limit Theorem tells us what you should have seen, theoretically. Lets' compare this to what you actually did see:

Theoretical Results Observed Results ------------------- ---------------- Sample Standard Standard Size Mean Deviation Mean Deviation ------ ---- --------- ----- --------- 1 3.5 1.707825 3.453 1.7041 2 3.5 1.207615 3.527 1.2320 3 3.5 0.986013 3.546 0.9503 5 3.5 0.763763 3.481 0.7532 10 3.5 0.540062 3.506 0.5289 20 3.5 0.381879 3.510 0.3891 30 3.5 0.311805 3.507 0.3148