Data considerations for Analyze Variability

To ensure that your results are valid, consider the following guidelines when you collect data, perform the analysis, and interpret your results.

The design must be a 2-level factorial design: If you do not have a 2-level factorial design, use Test for Equal Variances.
The response should be standard deviations of repeat or replicates measurements: Repeat measurements are taken during the same experimental run or consecutive runs, while replicate measurements are taken during identical but distinct experimental runs. You enter repeats across rows of multiple columns, while you enter replicates down a single column.; If you want to analyze the location effect, you can store the means of repeats and then use Analyze Factorial Design.

The data must include at least 2 factors, which can be either continuous or categorical

If you have only one categorical factor and no continuous predictors, use Test for Equal Variances.

Ensure that the measurement system produces reliable response data

If the variability in your measurement system is too great, your experiment may lack the power to find important effects.

Each observation should be independent from all other observations

If your individual observations are dependent, your results might not be valid. Consider the following points to determine whether your observations are independent:

If an observation provides no information about the value of another observation, the observations are independent.
If an observation provides information about another observation, the observations are dependent.

The experimental runs should be randomized

Randomization reduces the chance that uncontrolled conditions will bias the results. Randomization also lets you estimate the inherent variation in materials and conditions so that you can make valid statistical inferences based on the data from your experiment.

In some situations, randomization may lead to an undesirable run order. For instance, factor level changes can be difficult, expensive, or take a long time to produce a stable process. Under these conditions, you may want to randomize with a split-plot design to minimize the level changes.

Collect data using best practices

To ensure that your results are valid, consider the following guidelines:

Make certain that the data represent the population of interest.
Collect enough data to provide the necessary precision.
Record the data in the order it was collected.

The model should provide a good fit to the data

If the model does not fit the data, the results can be misleading. In the output, use the residual plots, the diagnostic statistics for unusual observations, and the model summary statistics to determine how well the model fits the data.