Interpret the key results for Best Subsets Regression

In best subsets regression, Minitab selects the two models with the highest R2 values that contain one predictor, two predictors, and so on. You can determine which predictors are included in each model based on which columns in the output table are marked with an "X".

Use the goodness-of-fit statistics to determine which model provides the best fit to your data. Before you select a final model, you should examine residual plots and other diagnostic measures to ensure that the model meets the assumptions of the analysis.

R-sq

The higher the R2 value, the better the model fits your data. R2 is always between 0% and 100%.

R2 always increases when you add additional predictors to a model. For example, the best five-predictor model will always have an R2 that is at least as high as the best four-predictor model. Therefore, R2 is most useful when you compare models of the same size.

R-sq (adj)

Use adjusted R2 when you want to compare models that have different numbers of predictors. R2 always increases when you add a predictor to the model, even when there is no real improvement to the model. The adjusted R2 value incorporates the number of predictors in the model to help you choose the correct model.

R-sq (pred)

Use predicted R2 to determine how well your model predicts the response for new observations. Models that have larger predicted R2 values have better predictive ability.

A predicted R2 that is substantially less than R2 may indicate that the model is over-fit. An over-fit model occurs when you add terms for effects that are not important in the population. The model becomes tailored to the sample data and, therefore, may not be useful for making predictions about the population.

Predicted R2 can also be more useful than adjusted R2 for comparing models because it is calculated with observations that are not included in the model calculation.

Mallows' Cp
Mallows' Cp compares the precision and bias of the full model to models with the best subsets of predictors. A Mallows' Cp value that is close to the number of predictors plus the constant indicates that the model produces relatively precise and unbiased estimates.
S

Use S to assess how well the model describes the response. Use S instead of the R2 statistics to compare the fit of models that have no constant.

S is measured in the units of the response variable and represents how far the data values fall from the fitted values. The lower the value of S, the better the model describes the response. However, a low S value by itself does not indicate that the model meets the model assumptions. You should check the residual plots to verify the assumptions.

Consider the following points when you interpret the R2 values:
  • Small samples do not provide a precise estimate of the strength of the relationship between the response and predictors. For example, if you need R2 to be more precise, you should use a larger sample (typically, 40 or more).

  • Goodness-of-fit statistics are just one measure of how well the model fits the data. Even when a model has a desirable value, you should check the residual plots to verify that the model meets the model assumptions.

Response is Heat Flux

VarsR-SqR-Sq (adj)R-Sq (pred)Mallows CpSI
n
s
o
l
a
t
i
o
n
E
a
s
t
S
o
u
t
h
N
o
r
t
h
T
i
m
e

o
f

D
a
y
172.171.066.938.512.328      X 
139.437.126.3112.718.154X       
285.984.881.49.18.9321    XX 
282.080.674.217.810.076      XX
387.485.979.07.68.5978  XXX 
386.584.981.49.78.9110X  XX 
489.187.380.65.88.1698XXXX 
488.086.079.38.28.5550X  XXX
589.987.778.86.08.0390XXXXX
Key Results: R-Sq, R-Sq (adj), R-Sq (pred), Mallows' Cp, S

In these results, there are several models to examine further. The model with all 5 predictors has the lowest value of S and the highest value of adjusted R2, approximately 8 and 88 respectively. A model with 2 predictors and a model with 3 predictors both have the highest predicted R2 values of 81.4%. Before you select the final model, you should examine the models for violations of the regression assumptions using residual plots and other diagnostic measures.