Find definitions and interpretation guidance for every result that is provided in the Test table of the equivalence test for a 2x2 crossover design.

The null and alternative hypotheses are mutually exclusive statements about a population. An equivalence test uses sample data to determine whether to reject the null hypothesis.

- Null hypothesis
- Minitab tests one or both of the following null hypotheses, depending on the alternative hypothesis you chose:
- The difference (or ratio) between the mean of the test population and the mean of the reference population is greater than or equal to the upper equivalence limit.
- The difference (or ratio) between the mean of the test population and the mean of the reference population is less than or equal to the lower equivalence limit.

- Alternative Hypothesis
- The alternative hypothesis states one or both of the following:
- The difference (or ratio) between the mean of the test population and the mean of the reference population is less than the upper equivalence limit
- The difference (or ratio) between the mean of the test population and the mean of the reference population is greater than the lower equivalence limit.

Use the null and alternative hypotheses to verify that the equivalence criteria are correct and that you have selected the appropriate alternative hypothesis to test.

The significance level (denoted by alpha or α) is the maximum acceptable level of risk for rejecting the null hypothesis when the null hypothesis is true (type I error). For example, if you perform an equivalence test using the default hypotheses, an α of 0.05 indicates a 5% risk of claiming equivalence when it is not actually true.

The α-level for an equivalence test also determines the confidence level for the confidence interval. By default, the confidence level is (1 – α) x 100%. If you use the alternative method of calculating the confidence interval, the confidence level is (1 – 2α) x 100%.

Use the α-level to decide whether to reject or fail to reject the null hypothesis (H_{0}).

If the p-value is less than the α-level, then you reject H_{0} and claim that your results are statistically significant.

The degrees of freedom (DF) indicate the amount of information that is available in your data to estimate the values of the unknown parameters, and to calculate the variability of these estimates.

Minitab uses the degrees of freedom to calculate the test statistic. Degrees of freedom are affected by the sample size. Increasing your sample size provides more information about the population, which increases the degrees of freedom.

The test statistic evaluates the size of the difference between two population means relative to the variation of the sample. If equivalence criteria are expressed in terms of a difference between the test mean and reference mean, or a ratio of test mean/reference mean using a lognormal transformation, the t-value measures the difference between the sample reference mean and the sample test mean in units of standard error. If equivalence criteria are expressed in terms of a ratio between the test mean and the reference mean, the t-value measures the difference between the sample test mean and a proportion of the reference mean, relative to the variability of both samples.

You can use the t-value to determine whether to reject the null hypothesis. However, most people use the p-value or the confidence interval because they are easier to interpret.

Generally, the greater the magnitude of difference or ratio relative to the sampling variability, the greater the absolute value of the t-value for the test, and the stronger the evidence against the null hypothesis.

The t-value for each test is used to calculate its corresponding p-value. If the p-value associated with this t-value is less than your significance level, you reject the null hypothesis and conclude that the results are statistically significant. For more information, see the section on the P-value for the test.

The p-value is a probability that measures the evidence against the null hypothesis. Lower probabilities provide stronger evidence against the null hypothesis.

The null hypothesis depends on which alternative hypothesis you selected for the test. For more information, go to Hypotheses for Equivalence Test for a 2x2 Crossover Design.

Use the p-value for the test to determine whether there is enough evidence to reject the null hypothesis and accept the alternative hypothesis. Compare each p-value with the significance level (also denoted alpha or α). Usually, an α of 0.05 works well.

When testing for equivalence using the default hypotheses, Minitab tests two null hypotheses about the difference (or ratio) between the test mean and the reference mean: 1) the difference (or ratio) of the population means is greater than the lower equivalence limit and 2) the difference (or ratio) of the population means is less than the upper equivalence limit.

- P-value ≤ α: The difference (or ratio) is within the equivalence limit
- If the p-value is less than or equal to α, you reject the null hypothesis and conclude that the difference (or ratio) between the population means is within the equivalence limit.
- P-value > α: The difference (or ratio) is not within the equivalence limit
- If the p-value is greater than α, you fail to reject the null hypothesis. You do not have enough evidence to conclude that the difference (or ratio) between the population means is within the equivalence limit.

To demonstrate equivalence, p-values for both null hypotheses must be below the α-level. If the p-value for either test is greater than the α-level, you cannot claim equivalence.
###### Tip

To visually evaluate the results of an equivalence test, examine the results on the equivalence plot, which is easier to interpret than the p-values.