Example of Fit Regression Model

A research chemist wants to understand how several predictors are associated with the wrinkle resistance of cotton cloth. The chemist examines 32 pieces of cotton cellulose produced at different settings of curing time, curing temperature, formaldehyde concentration, and catalyst ratio. The durable press rating, a measure of wrinkle resistance, is recorded for each piece of cotton.

The chemist performs a multiple regression analysis to fit a model with the predictors and eliminate the predictors that do not have a statistically significant relationship with the response.

  1. Open the sample data, WrinkleResistance.MTW.
  2. Choose Stat > Regression > Regression > Fit Regression Model.
  3. In Responses, enter Rating.
  4. In Continuous predictors, enter Conc Ratio Temp Time.
  5. Click Graphs.
  6. Under Effects Plots, check Pareto.
  7. Under Residuals plots, choose Four in one.
  8. In Residuals versus the variables, enter Conc Ratio Temp Time.
  9. Click OK in each dialog box.

Interpret the results

The predictors temperature, catalyst ratio, and formaldehyde concentration have p-values that are less than the significance level of 0.05. These results indicate that these predictors have a statistically significant effect on wrinkle resistance. The p-value for time is greater than 0.05, which indicates that there is not enough evidence to conclude that time is related to the response. The chemist may want to refit the model without this predictor.

The Pareto chart shows that the effects for temperature, catalyst ratio, and formaldehyde concentration are statistically significant at the 0.05 level of significance. The largest effect is catalyst ratio because it extends the farthest. The effect for time is the smallest because it extends the least.

The residual plots indicate that there may be problems with the model.
  • The points on the residuals versus fits plot do not appear to be randomly distributed about zero. There appear to be clusters of points that could represent different groups in the data. The chemist should investigate the groups to determine their cause.
  • The plot of the residuals versus ratio shows curvature, which suggests a curvilinear relationship between catalyst ratio and wrinkles. The chemist should consider adding a quadratic term for ratio to the model.

Regression Equation

Rating=-0.756 + 0.1545 Conc + 0.2171 Ratio + 0.01081 Temp + 0.0946 Time

Coefficients

TermCoefSE CoefT-ValueP-ValueVIF
Constant-0.7560.736-1.030.314 
Conc0.15450.06332.440.0221.03
Ratio0.21710.03166.860.0001.02
Temp0.010810.004622.340.0271.04
Time0.09460.05461.730.0941.00

Model Summary

SR-sqR-sq(adj)R-sq(pred)
0.81184072.92%68.90%62.81%

Analysis of Variance

SourceDFAdj SSAdj MSF-ValueP-Value
Regression447.909611.977418.170.000
  Conc13.92323.92325.950.022
  Ratio131.021631.021647.070.000
  Temp13.60313.60315.470.027
  Time11.98391.98393.010.094
Error2717.79530.6591   
  Lack-of-Fit2517.78360.7113121.940.008
  Pure Error20.01170.0058   
Total3165.7049     

Fits and Diagnostics for Unusual Observations

ObsRatingFitResidStd Resid
94.8003.1781.6222.06R
R  Large residual