Examine the center and spread of the distribution. Assess how the sample size may affect the appearance of the boxplot.
Hold the pointer over the boxplot to display a tooltip that shows these statistics. For example, this boxplot of resting heart rates shows that the median heart rate is 71. Most subjects have a resting heart rate that is between 64 and 80, but some subjects have heart rates that are as low as 48 and as high as 100.
Investigate any surprising or undesirable characteristics on the boxplot. For example, a boxplot may show that the median length of wood boards is much lower than the target length of 8 feet.
Because the quartiles of the boxplot are calculated values, they may not be actual observations in your data set. For more information on how to interpret the quartiles, go to What are quartiles?. For specific information on how the quartiles are calculated, go to Methods and formulas for Graphical Summary.
The sample size can affect the appearance of the graph.
A boxplot works best when the sample size is at least 20. If the sample size is too small, the quartiles and outliers shown by the boxplot may not be meaningful. If the sample size is less than 20, consider using an Individual value plot instead.
Skewed data indicate that data may be nonnormal. Outliers may indicate other conditions in your data.
When data are skewed, the majority of the data are located on the high or low side of the graph. Skewness indicates that the data may not be normally distributed.
These boxplots illustrate skewed data. The boxplot with right-skewed data shows wait times. Most of the wait times are relatively short, and only a few wait times are long. The boxplot with left-skewed data shows failure time data. A few items fail immediately and many more items fail later.
If you know that your data are not naturally skewed, investigate possible causes. If you want to analyze severely skewed data, read the data considerations topic for the analysis to make sure that you can use data that are not normal.
Outliers, which are data values that are far away from other data values, can strongly affect your results. Often, outliers are easiest to identify on a boxplot.
Hold the pointer over the outlier to identify the data point.
Try to identify the cause of any outliers. Correct any data entry or measurement errors. Consider removing data values that are associated with abnormal, one-time events (special causes). Then, repeat the analysis.
If your boxplot has groups, assess and compare the center and spread of groups.
Look for differences between the centers of the groups.
Look for differences between the spreads of the groups.