The sample size (N) is the total number of observations in each group.
The sample size affects the confidence interval and the power of the test.
Usually, a larger sample yields a narrower confidence interval. A larger sample size also gives the test more power to detect a difference.
The mean of the observations within each group. The mean describes each group with a single value identifying the center of the data. It is the sum of all the observations with a group divided by the number of observations in that group.
The mean of each sample provides an estimate of each population mean. The differences between sample means are the estimates of the difference between the population means.
Because the difference between the group means are based on data from a sample and not the entire population, you cannot be certain it equals the population difference. To get a better sense of the population difference, you can use the confidence interval.
The standard deviation is the most common measure of dispersion, or how spread out the data are around the mean. The symbol σ (sigma) is often used to represent the standard deviation of a population. The symbol s is used to represent the standard deviation of a sample.
The sample standard deviation of a group is an estimate of the population standard deviation of that group. The standard deviations are used to calculate the confidence intervals and the p-values. Larger sample standard deviations result in less precise (wider) confidence intervals and lower statistical power.
Analysis of variance assumes that the population standard deviations for all levels are equal. If you cannot assume equal variances, use Welch's ANOVA, which is an option for One-Way ANOVA.
These confidence intervals (CI) are ranges of values that are likely to contain the true mean of each population. The confidence intervals are calculated using the pooled standard deviation.
Because samples are random, two samples from a population are unlikely to yield identical confidence intervals. But, if you repeat your sample many times, a certain percentage of the resulting confidence intervals contain the unknown population parameter. The percentage of these confidence intervals that contain the parameter is the confidence level of the interval.
Use the confidence interval to assess the estimate of the population mean for each group.
For example, with a 95% confidence level, you can be 95% confident that the confidence interval contains the group mean. The confidence interval helps you assess the practical significance of your results. Use your specialized knowledge to determine whether the confidence interval includes values that have practical significance for your situation. If the interval is too wide to be useful, consider increasing your sample size.
Paint | N | Mean | StDev | 95% CI |
---|---|---|---|---|
Blend 1 | 6 | 14.73 | 3.36 | (11.37, 18.10) |
Blend 2 | 6 | 8.57 | 5.50 | (5.20, 11.93) |
Blend 3 | 6 | 12.98 | 3.73 | (9.62, 16.35) |
Blend 4 | 6 | 18.07 | 2.64 | (14.70, 21.43) |
In these results, each blend has a confidence interval for its mean hardness. The multiple comparison results for these data show that Blend 4 is significantly harder than Blend 2. That Blend 4 is harder than Blend 2 does not show that Blend 4 is hard enough for the intended use of the paint. The confidence interval for the group mean is better for judging whether Blend 4 is hard enough.
The pooled standard deviation is an estimate of the common standard deviation for all levels. The pooled standard deviation is the standard deviation of all data points around their group mean (not around the overall mean). Larger groups have a proportionally greater influence on the overall estimate of the pooled standard deviation.
A higher standard deviation value indicates greater spread in the data. A higher value produces less precise (wider) confidence intervals and low statistical power.
Minitab uses the pooled standard deviation to create the confidence intervals for both the group means and the differences between group means.
Group | Mean | Standard Deviation | N |
---|---|---|---|
1 | 9.7 | 2.5 | 50 |
2 | 12.1 | 2.9 | 50 |
3 | 14.5 | 3.2 | 50 |
4 | 17.3 | 6.8 | 200 |
The first three groups are equal in size (n=50) with standard deviations around 3. The fourth group is much larger (n=200) and has a higher standard deviation (6.8). Because the pooled standard deviation uses a weighted average, its value (5.488) is closer to the standard deviation of the largest group.