Interpret all statistics and graphs for Moving Average

Find definitions and interpretation guidance for every statistic and graph that is provided with moving average.

Length

The number of observations in the time series.

NMissing

The number of missing values in the time series.

Moving Average Length

The moving average length is the number of consecutive observations that Minitab uses to calculate the moving averages. For example, for monthly data, a value of 3 indicates that the moving average for March is the average of the observations from March, February, and January.

The moving average length adjusts the amount of smoothing. Usually, you should smooth the data enough to reduce the noise (irregular fluctuations) so that the pattern is more apparent. However, don't smooth the data so much that you lose important details. Lower values produce a less smooth line. Higher values produce a smoother line.
Moving average = 2
Moving average = 6

MAPE

The mean absolute percent error (MAPE) expresses accuracy as a percentage of the error. Because the MAPE is a percentage, it can be easier to understand than the other accuracy measure statistics. For example, if the MAPE is 5, on average, the forecast is off by 5%.

However, sometimes you may see a very large value of MAPE even though the model appears to fit the data well. Examine the plot to see if any data values are close to 0. Because MAPE divides the absolute error by the actual data, values close to 0 can greatly inflate the MAPE.

Interpretation

Use to compare the fits of different time series models. Smaller values indicate a better fit. If a single model does not have the lowest values for all 3 accuracy measures, MAPE is usually the preferred measurement.

The accuracy measures are based on one-period-ahead residuals. At each point in time, the model is used to predict the Y value for the next period in time. The difference between the predicted values (fits) and the actual Y are the one-period-ahead residuals. Because of this, the accuracy measures provide an indication of the accuracy you might expect when you forecast out 1 period from the end of the data. Therefore, they do not indicate the accuracy of forecasting out more than 1 period. If you're using the model for forecasting, you shouldn't base your decision solely on accuracy measures. You should also examine the fit of the model to ensure that the forecasts and the model follow the data closely, especially at the end of the series.

MAD

The mean absolute deviation (MAD) expresses accuracy in the same units as the data, which helps conceptualize the amount of error. Outliers have less of an effect on MAD than on MSD.

Interpretation

Use to compare the fits of different time series models. Smaller values indicate a better fit.

The accuracy measures are based on one-period-ahead residuals. At each point in time, the model is used to predict the Y value for the next period in time. The difference between the predicted values (fits) and the actual Y are the one-period-ahead residuals. Because of this, the accuracy measures provide an indication of the accuracy you might expect when you forecast out 1 period from the end of the data. Therefore, they do not indicate the accuracy of forecasting out more than 1 period. If you're using the model for forecasting, you shouldn't base your decision solely on accuracy measures. You should also examine the fit of the model to ensure that the forecasts and the model follow the data closely, especially at the end of the series.

MSD

The mean square deviation (MSD) measures the accuracy of the fitted time series values. Outliers have a greater effect on MSD than on MAD.

Interpretation

Use to compare the fits of different time series models. Smaller values indicate a better fit.

The accuracy measures are based on one-period-ahead residuals. At each point in time, the model is used to predict the Y value for the next period in time. The difference between the predicted values (fits) and the actual Y are the one-period-ahead residuals. Because of this, the accuracy measures provide an indication of the accuracy you might expect when you forecast out 1 period from the end of the data. Therefore, they do not indicate the accuracy of forecasting out more than 1 period. If you're using the model for forecasting, you shouldn't base your decision solely on accuracy measures. You should also examine the fit of the model to ensure that the forecasts and the model follow the data closely, especially at the end of the series.

MA

The moving average values are calculated from consecutive observations. For example, for monthly data with a moving average length of 3, the moving average for March is the average of the observations from March, February, and January.

Predict (also called fits)

The predicted value for time t is equal to the moving average values at time t-1.

Observations that have predicted values which are very different from the observed value may be unusual or influential. Try to identify the cause of any outliers. Correct any data entry or measurement errors. Consider removing data values that are associated with abnormal, one-time events (special causes). Then, repeat the analysis.

Error

The error values are also called residuals. The error values are the differences between the observed values and the predicted values.

Interpretation

Plot the error values to determine whether your model is adequate. The values can provide useful information about how well the model fits the data. In general, the error values should be randomly distributed around 0 with no obvious patterns and no unusual values.

Period

Minitab displays the period when you generate forecasts. The period is the time unit of the forecast. By default, the forecasts start at the end of the data.

Forecast

The forecasts are the fitted values obtained from the time series model. Minitab displays the number of forecasts that you specify. The forecasts begin either at the end of the data or at the point of origin that you specify.

Interpretation

Use forecasts to predict a variable for a specified period of time. For example, a warehouse manager can model how much product to order for the next 3 months based on the previous 60 months of orders.

Examine the fits and the forecasts in the plot to determine whether the forecasts are likely to be accurate. The forecasts should generally follow the data at the end of the series. If the fits shift away from the data at the end of the series, the forecasts may not be accurate. Because the forecasts from moving average are constant, it is important that there is no trend in the data before the forecasts. If there is a trend before the forecasts, the forecasts may not be accurate.

The forecasts from moving average are very conservative because they are based solely on the latest estimate of the level, and no estimate of the trend. You should usually only forecast 6 periods into the future.

Lower and Upper

The lower and upper prediction limits produce a prediction interval for each forecast. The prediction interval is a range of likely values of forecasts. For example, with a 95% prediction interval, you can be 95% confident that the prediction interval contains the forecast at the specified time.

Moving Average Plot

The moving average plot displays the observations versus time. The plot includes the fits that are calculated from the moving averages, the forecasts, the moving average length, and the accuracy measures. You can also choose to display the smoothed values instead of the fits.

Interpretation

Examine the smoothing plot to determine whether your model fits your data. If the fits closely follow the actual data, the model fits your data.
  • If the model fits the data, you can perform Single Exponential Smoothing and compare the two models.
  • If the model does not fit the data, examine the plot for trends or seasonality. If you see evidence of a trend or seasonality, you should use a different time series analysis. For more information, go to Which time series analysis should I use?.

On this smoothing plot, the fits closely follow the data, which indicates that the model fits the data.

Histogram of the residuals

The histogram of the residuals shows the distribution of the residuals for all observations. lf the model fits the data well, the residuals should be random with a mean of 0. So the histogram should be approximately symmetric around 0.

Normal probability plot of the residuals

The normal plot of the residuals displays the residuals versus their expected values when the distribution is normal.

Interpretation

Use the normal plot of the residuals to determine whether the residuals are normally distributed. However, this analysis does not require normally distributed residuals.

If the residuals are normally distributed, the normal probability plot of the residuals should approximately follow a straight line. The following patterns imply that the residuals are not normally distributed.

S-curve implies a distribution with long tails.

Inverted S-curve implies a distribution with short tails.

Downward curve implies a right-skewed distribution.

A few points lying away from the line implies a distribution with outliers.

Residuals versus fits

The residuals versus fits plot displays the residuals on the y-axis and the fitted values on the x-axis.

Interpretation

Use the residuals versus fits plot to determine whether the residuals are unbiased and have a constant variance. Ideally, the points should fall randomly on both sides of 0, with no recognizable patterns in the points.

The patterns in the following table may indicate that the residuals are biased and have a nonconstant variance.
Pattern What the pattern may indicate
Fanning or uneven spreading of residuals across fitted values Nonconstant variance
Curvilinear A missing higher-order term
A point that is far away from zero An outlier

If you see nonconstant variance or patterns in the residuals, your forecasts may not be accurate.

Residuals versus order

The residuals versus order plot displays the residuals in the order that the data were collected.

Interpretation

Use the residuals versus order plot to determine how accurate the fits are compared to the observed values during the observation period. Patterns in the points may indicate that model does not fit the data. Ideally, the residuals on the plot should fall randomly around the center line.

The following patterns may indicate that the model does not fit the data.
Pattern What the pattern may indicate
A consistent long-term trend The model does not fit the data
A short-term trend A shift or a change in pattern
A point that is far away from the other points An outlier
A sudden shift in the points The underlying pattern for the data has changed
The following examples show patterns that may indicate that the model does not fit the data.

Residuals systematically decrease as the order of the observations increases from left to right.

A sudden change in the values of the residuals occurs from low (left) to high (right).

Residuals versus variables

The residuals versus variables plot displays the residuals versus another variable.

Interpretation

Use the plot to determine whether the variable affects the response in a systematic way. If patterns are present in the residuals, the other variables are associated with the response. You can use this information as the basis for additional studies.