Misclassification rate vs number of trees plot for Fit Model and Discover Key Predictors with TreeNet® Classification

Note

This command is available with the Predictive Analytics Module. Click here for more information about how to activate the module.

The Misclassification Rate vs Number of Trees Plot plots the misclassification rate on the y-axis and the number of trees on the x-axis. The minimum misclassification rate indicates whether the model is a good classifier. Use the test results to assess the performance of the model to predict new observations. Compare the training results and the test results to see whether there are overfitting problems with the model for the training data set.

When the area under the ROC curve determines the number of trees for the optimal model, then Minitab displays the Area Under ROC Curve vs Number of Trees plot. When the maximum likelihood value determines the number of trees for the optimal model, then Minitab displays the Average-LogLikelihood vs Number of Trees plot.

Interpretation

The misclassification rates must be ≥ 0. Lower values indicate a better classification model. The reference line indicates the optimal misclassification rate for the test data and the number of trees in the model. If the test curve indicates an insufficient model, consider whether to retry the analysis with alternative settings, such as larger or smaller learning rates, or a larger subsample fraction.