Because sample sizes are whole numbers, the actual power of the test might be slightly greater than the power value that you specify.
The definitions and interpretation in this topic apply to an equivalence test that uses the default alternative hypothesis for the difference (Lower limit < test mean - reference mean < upper limit or the default alternative hypothesis for the ratio (Lower limit < test mean / reference mean < upper limit).
Power for difference: | Test mean - reference mean |
---|---|
Null hypothesis: | Difference ≤ -1 or Difference ≥ 1 |
Alternative hypothesis: | -1 < Difference < 1 |
α level: | 0.05 |
Assumed standard deviation: | 0.31 |
Difference | Sample Size | Power |
---|---|---|
0.8 | 25 | 0.727010 |
These results show that if the sample size is 25 in each group and the difference is 0.8, the power of the test to demonstrate equivalence is approximately 0.73. Because the power of the test is not adequate to accommodate a difference of 0.8, you should increase the sample size, if possible. You can also use the power curve to determine at what smaller difference value the test may achieve adequate power (0.9) with the specified sample size.
Use the power curve to assess the appropriate sample size or power for your test.
The power curve represents every combination of power and difference (or ratio) for each sample size when the significance level and the standard deviation (or coefficient of variation) are held constant. Each symbol on the power curve represents a calculated value based on the values that you enter. For example, if you enter a sample size and a power value, Minitab calculates the corresponding difference (or ratio) and displays the calculated value on the graph.
Examine the values on the curve to determine the difference (or ratio) between the test mean and the reference mean that can be accommodated at a certain power value and sample size. A power value of 0.9 is usually considered adequate. However, some practitioners consider a power value of 0.8 to be adequate. If an equivalence test has low power, you might fail to demonstrate equivalence even when the population means are equivalent. If you increase the sample size, the power of the test also increases. You want enough observations in your sample to achieve adequate power. But you don't want a sample size so large that you waste time and money on unnecessary sampling or detect unimportant differences to be statistically significant. Usually, differences (or ratios) that are closer to the equivalence limits require more power to demonstrate equivalence.