Minitab can use either the (0, 1) or (−1, 0, +1) coding scheme to include categorical variables in the model. The (0, 1) scheme is the default for regression analysis while the (−1, 0, +1) scheme is the default for ANOVA and DOE. The choice between these two schemes does not change the statistical significance of the categorical variables. However, the coding scheme does change the coefficients and how to interpret them.
Verify the coding scheme that is displayed to ensure that you performed the intended analysis. Interpret the coefficients for the categorical variables as follows:
If you chose to standardize the continuous predictors in your model, Minitab provides details about the method in the Continuous predictor standardization table.
Usually, you use standardization to center variables, to scale variables, or both. When you center variables, you reduce multicollinearity caused by polynomial terms and interaction terms, which improves the precision of the coefficient estimates. In most cases, when you scale variables, Minitab converts the different scales of the variables to a common scale, which lets you compare the size of the coefficients.
Use the standardization method table to verify that you performed the analysis as you intended. Depending on your choice for the method, you may have to change the interpretation of the coefficients as follows:
The exact interpretation of the coefficients also depends on other aspects of the analysis, such as the link function.
Use the link function to find a model that best fits your data. Use the goodness-of-fit statistics to compare fits using different link functions. Certain link functions can be used for historical reasons or because they have a special meaning in a discipline.
One advantage of the logit link function is that it provides an estimate of the odds ratio for each predictor in the model.
The output also identifies which level of the response is the reference event.
Use the response information to examine how much data are in the analysis. Larger random samples with many occurrences of each level usually provide more accurate inferences about the population.
Also use the response information to determine which event is the reference event. Interpretation of statistics like coefficients and odds ratios depend on which event is the reference event.
When you use a test data set, the table shows the percentage of the data that are in the test data set. When you use cross-validation, the table shows the number of folds. When you specify a column that specifies which observations are in the test data set or which observations are in each fold, then the table shows the title of the column.
Verify the validation method that is in the results to ensure that you performed the intended analysis.