Interpret the key results for Power and Sample Size for 2 Proportions

Complete the following steps to interpret Power and Sample Size for 2 Proportions. Key output includes the comparison proportion, the sample size, the power, and the power curve.

Step 1: Examine the calculated values

Using the values of the two power function variables that you entered, Minitab calculates the comparison proportion, the sample size, or the power of the test.

Comparison p

Minitab calculates the comparison proportion. The difference between the comparison proportion and the baseline proportion is the minimum difference for which you can achieve the specified level of power for each sample size. Larger sample sizes enable the test to detect smaller differences. You want to detect the smallest difference that has practical consequences for your application.

Sample size

Minitab calculates how large your sample must be for a test with your specified power to detect the difference between the baseline proportion and comparison proportion. Because sample sizes are whole numbers, the actual power of the test might be slightly greater than the power value that you specify.

If you increase the sample size, the power of the test also increases. You want enough observations in your sample to achieve adequate power. But you don't want a sample size so large that you waste time and money on unnecessary sampling or detect unimportant differences to be statistically significant.

Power

Minitab calculates the power of the test based on the specified comparison proportion and sample size. A power value of 0.9 is usually considered adequate. A value of 0.9 indicates you have a 90% chance of detecting a difference between the population proportions when a difference actually exists. If a test has low power, you might fail to detect a difference and mistakenly conclude that none exists. Usually, when the sample size is smaller or the difference is smaller, the test has less power to detect a difference.

Results

Sample SizePowerComparison p
10000.90.669724
10000.90.528190
The sample size is for each group.
Key Results: Comparison p, Sample Size, Power

In these results, based on a sample size of 1,000 and a power value of 0.9, Minitab calculates that the test can detect comparison proportions of approximately 0.67 and 0.53. Because the baseline rate is 0.6, the comparison proportions show that you will be able to detect a difference of approximately 0.07 in either direction.

Step 2: Examine the power curve

Use the power curve to assess the appropriate sample size or power for your test.

The power curve represents every combination of power and comparison proportion for each sample size when the significance level is held constant. Each symbol on the power curve represents a calculated value based on the values that you enter. For example, if you enter a sample size and a power value, Minitab calculates the corresponding comparison proportion and displays the calculated value on the graph.

Examine the values on the curve to determine the difference between the comparison proportion and the baseline proportion that can be detected at a certain power value and sample size. A power value of 0.9 is usually considered adequate. However, some practitioners consider a power value of 0.8 to be adequate. If a hypothesis test has low power, you might fail to detect a difference that is practically significant. If you increase the sample size, the power of the test also increases. You want enough observations in your sample to achieve adequate power. But you don't want a sample size so large that you waste time and money on unnecessary sampling or detect unimportant differences to be statistically significant. If you decrease the size of the difference that you want to detect, the power also decreases.

In this graph, the power curve shows that for a sample size of 1000 and a power of 0.9, the test can detect a difference between the comparison proportion and baseline proportion of approximately 0.07 in either direction. As the comparison proportion approaches the baseline proportion (0.6, in this graph), the power of the test decreases and approaches α (also called the significance level), which is 0.05 for this analysis.