Interpret all statistics for Predict

Find definitions and interpretation guidance for every statistic that is provided with the prediction analysis.

95% CI

The confidence interval for the fit provides a range of likely values for the mean response given the specified settings of the predictors.

Interpretation

Use the confidence interval to assess the estimate of the fitted value for the observed values of the variables.

For example, with a 95% confidence level, you can be 95% confident that the confidence interval contains the population mean for the specified values of the variables in the model. The confidence interval helps you assess the practical significance of your results. Use your specialized knowledge to determine whether the confidence interval includes values that have practical significance for your situation. A wide confidence interval indicates that you can be less confident about the mean of future values. If the interval is too wide to be useful, consider increasing your sample size.

95% PI

The prediction interval is a range that is likely to contain a single future response for a selected combination of variable settings.

Interpretation

With a 95% PI, you can be 95% confident that a single response will be contained in the interval given the settings of the predictors that you specified. The prediction interval is always wider than the confidence interval because of the added uncertainty involved in predicting a single response versus the mean response.

For example, a materials engineer at a furniture manufacturing site develops a simple regression model to predict the stiffness of particleboard from the density of the board. The engineer verifies that the model meets the assumptions of the analysis. Then, the analyst uses the model to predict the stiffness.

The regression equation predicts that the stiffness for a new observation with a density of 25 is -21.53 + 3.541*25, or 66.995. While it is unlikely that such an observation would have a stiffness of exactly 66.995, the prediction interval indicates that the engineer can be 95% confident that the actual value will be between approximately 48 and 86.

The prediction interval is always wider than the corresponding confidence interval. In this example, the 95% confidence interval indicates that the engineer can be 95% confident that the mean stiffness will be between approximately 60 and 74.

Fit

Fitted values are also called fits or . The fitted values are point estimates of the mean response for given values of the predictors. The values of the predictors are also called x-values.

Interpretation

Fitted values are calculated by entering the specific x-values for each observation in the data set into the model equation.

For example, if the equation is y = 5 + 10x, the fitted value for the x-value, 2, is 25 (25 = 5 + 10(2)).

Regression equation

Use the regression equation to describe the relationship between the response and the terms in the model. The regression equation is an algebraic representation of the regression line. The regression equation for the linear model takes the following form: y = b0 + b1x1. In the regression equation, y is the response variable, b0 is the constant or intercept, b1 is the estimated coefficient for the linear term (also known as the slope of the line), and x1 is the value of the term.

The regression equation with more than one term takes the following form:

y = b0 + b1x1 + b2x2 + ... + bkxk

In the regression equation, the letters represent the following:
  • y is the response variable
  • b0 is the constant
  • b1, b2, ..., bk are the coefficients
  • x1, x2, ..., xk are the values of the term

If the model contains both continuous and categorical variables, the regression equation table can display an equation for each level of the categorical variable. To use these equations for prediction, you must choose the correct equation, based on the values of the categorical variables, and then enter the values of the continuous variables.

SE Fit

The standard error of the fit (SE fit) estimates the variation in the estimated mean response for the specified variable settings. The calculation of the confidence interval for the prediction uses the standard error of the fit.

Interpretation

The smaller the standard error, the more precise the predicted mean response. For example, an analyst develops a model to predict delivery time. For one set of variable settings, the model predicts a mean for the delivery time of 3.80 days. The standard error of the fit for these settings is 0.08 days. For a second set of variable settings, the model produces the same mean delivery time with a standard error of the fit of 0.02 days. The analyst can be more confident that the mean delivery time for the second set of variable settings is close to 3.80 days.

With the fitted value, the standard error of the fit can be used to create a confidence interval for the mean response. For example, depending on your sample size, a 95% confidence interval extends approximately two standard errors above and below the predicted mean. For the delivery times, the 95% confidence interval for the predicted mean of 3.80 days when the standard error is 0.08 is (3.64, 3.96) days. You can be 95% confident that the population mean is within this range. When the standard error is 0.02, the 95% confidence interval is (3.76, 3.84) days. The confidence interval for the second set of variable settings is narrower because the standard error is smaller.

Variable settings

Minitab Express uses the regression equation and the variable settings to calculate the fit. If the variable settings are unusual compared to the data that was used to estimate the model, then a warning is displayed below the prediction.

Use the variable settings table to verify that you performed the analysis as you intended.

By using this site you agree to the use of cookies for analytics and personalized content.  Read our policy