This command is available with the Predictive Analytics Module. Click here for more information about how to activate the module.
A team of researchers collects data from the sale of individual residential properties in Ames, Iowa. The researchers want to identify the variables that affect the sale price. Variables include the lot size and various features of the residential property. The researchers want to assess how well the best MARS® model fits the data.
By default, MARS® Regression fits an additive model so all the basis functions in the regression equation use 1 predictor. The first predictor in the list is BF2. BF2 uses the predictor Living Area SF. Because the predictor is in 1 basis function, the predictor has 2 different slopes in the model. The function max(0, 3078 - Living Area SF) defines that the slope is non-zero when the living area is less than 3,078.
In these results, the list of basis functions has 15 basis functions but the optimal number of basis functions is 13. The regression equation contains 13 basis functions. The list of basis functions contains BF7 and BF17, which are the basis functions that identify the missing values. These basis functions are not important on their own because they did not reduce the MSE as much as other basis functions in the search. These 2 basis functions are in the list to show the full calculation of BF10 and BF 19, which are important.
Total predictors | 77 |
---|---|
Important predictors | 10 |
Maximum number of basis functions | 30 |
Optimal number of basis functions | 13 |
Statistics | Training | Test |
---|---|---|
R-squared | 89.61% | 87.61% |
Root mean squared error (RMSE) | 25836.5197 | 27855.6550 |
Mean squared error (MSE) | 667525749.7185 | 775937512.8264 |
Mean absolute deviation (MAD) | 17506.0038 | 17783.5549 |
The model summary table includes measures of how well the model performs. You can use these values to compare models. For these results, the test R-squared is about 88%.