The sample size (N) is the total number of observations in the sample.
The sample size affects the confidence interval and the power of the test.
Usually, a larger sample size results in a narrower confidence interval. A larger sample size also gives the test more power to detect a difference. For more information, go to What is power?.
The median is the midpoint of the data set. This midpoint value is the point at which half the observations are above the value and half the observations are below the value. The median is determined by ranking the observations and finding the observation that are at the number [N + 1] / 2 in the ranked order. If the number of observations are even, then the median is the average value of the observations that are ranked at numbers N / 2 and [N / 2] + 1.
The median of each sample is an estimate of the population median of each sample.
The difference is the difference between the medians of the two samples.
Because the difference is based on sample data and not on the entire population, it is unlikely that the sample difference equals the population difference. To better estimate the population difference, use the confidence interval.
The confidence interval provides a range of likely values for the population difference. Because samples are random, two samples from a population are unlikely to yield identical confidence intervals. But, if you repeated your sample many times, a certain percentage of the resulting confidence intervals or bounds would contain the unknown population difference. The percentage of these confidence intervals or bounds that contain the difference is the confidence level of the interval. For example, a 95% confidence level indicates that if you take 100 random samples from the population, you could expect approximately 95 of the samples to produce intervals that contain the population difference.
An upper bound defines a value that the population difference is likely to be less than. A lower bound defines a value that the population difference is likely to be greater than.
The confidence interval helps you assess the practical significance of your results. Use your specialized knowledge to determine whether the confidence interval includes values that have practical significance for your situation. If the interval is too wide to be useful, consider increasing your sample size.
Difference | CI for Difference | Achieved Confidence |
---|---|---|
-1.85 | (-3, -0.9) | 95.52% |
In these results, the estimate of the population median for the difference in the number of months that paint persists on two highways is −1.85. You can be 95.52% confident that the difference between the population medians is between −3.0 and −0.9.
The Mann-Whitney statistic (W-Value) is the sum of the ranks of the first sample.
Minitab uses the Mann-Whitney statistic to calculate the p-value, which is a probability that measures the evidence against the null hypothesis.
Because the interpretation of the Mann-Whitney statistic depends on the sample size, use the p-value to make a decision about the test. The p-value has the same meaning for any sample size.
The p-value is a probability that measures the evidence against the null hypothesis. A smaller p-value provides stronger evidence against the null hypothesis.
Use the p-value to determine whether the difference in population medians is statistically significant.
A tie occurs when the same value is in both samples. If your data has ties, Minitab displays a p-value that is adjusted for ties and a p-value that is not adjusted. The adjusted p-value is usually more accurate than the unadjusted p-value. However, the unadjusted p-value is the more conservative estimate because it is always greater than the adjusted p-value for a specific pair of samples.