Interpret all statistics for Nominal Logistic Regression

Find definitions and interpretation for every statistics that is provided with the nominal logistic regression analysis.

Response information

Minitab displays this information about the response:
Variable
Name of the response variable
Value
Levels of the response variable
Count
Number of observations at each level of the response variable
Total
Number of nonmissing observations

The output also identifies which level of the response is the reference event.

Interpretation

Use the response information to examine how much data are in the analysis. Larger random samples with many occurrences of each level usually provide more accurate inferences about the population.

Also use the response information to determine which event is the reference event. Interpretation of statistics like coefficients and odds ratios depend on which event is the reference event.

Factor information

The factor information table displays the factors in the design, the numbers of levels, and the values of the levels. Factors can assume only a limited number of possible values, known as factor levels. Factor levels can be text or numeric. Numeric factors use a few controlled values in the experiment, even though many values are possible.

Interpretation

Use the factor information table to see the number of levels in the analysis. For example, a quality analyst plans to study factors that could affect plastic strength during the manufacturing process. The analyst includes Additive. Additive is a categorical variable which can be type A or type B.

Factor Information

FactorLevelsValues
Additive2A, B

Factors can be crossed or nested. Two factors are crossed when each level of one factor occurs in combination with each level of the other factor. Two factors are nested when a set of the levels for one factor appear at only one level of a second factor. For example, if a design contains machine and operator, these factors are crossed if all operators use all machines. However, operator is nested in machine if each machine has a different set of operators.

In the factor information table, parentheses indicate nested factors. For example, Standard(Appraiser) indicates that Standard is nested within Appraiser. In this context, the nesting indicates that each appraiser has their own set of standard parts. The factor levels of a nested factor are repeated for each level of nesting, which increases the number of levels for the nested factor. In this example, each appraiser has 5 standards, but because standard is nested in appraiser, standard has 20 different levels.

Factor Information

FactorLevelsValues
Standard(Appraiser)201(Amanda), 2(Amanda), 3(Amanda), 4(Amanda), 5(Amanda), 1(Britt),
2(Britt), 3(Britt), 4(Britt), 5(Britt), 1(Eric), 2(Eric),
3(Eric), 4(Eric), 5(Eric), 1(Mike), 2(Mike), 3(Mike), 4(Mike),
5(Mike)
Appraiser4Amanda, Britt, Eric, Mike

For more information on factors, go to Factors and factor levels, What are factors, crossed factors, and nested factors?, and What is the difference between fixed and random factors?.

Coef

The nominal logistic equation treats each nominal outcome separately. The logistic regression equation is comprised of multiple logit functions, one for each value of the response minus one. Each equation has a unique slope for the predictors. These equations evaluate how the probability of one nominal outcome changes relative to another nominal outcome as the predictor variables change.

Interpretation

Use the coefficients to examine how the probability of an outcome changes as the predictor variables change. The estimated coefficient for a predictor represents the change in the link function for each unit change in the predictor, while the other predictors in the model are held constant. The relationship between the coefficient and the probability of an outcome depends on several aspects of the analysis, including the reference outcome for the response variable and the reference levels for categorical predictors. Generally, positive coefficients make the reference outcome less likely as the predictor increases. Negative coefficients make the reference outcome more likely as the predictor increases. An estimated coefficient near 0 implies that the effect of the predictor is small.

For example, a school administrator wants to assess different teaching methods. She uses age and teaching method to predict which subjects students prefer. The first outcome event is first in the response information table and is the reference outcome for the response variable. For this data, the reference outcome is that the student prefers science. Logit 1 compares the probability a student prefers math to science. In this equation, the p-value for the coefficient for age is greater than 0.7. Such a high p-value suggests that age has little effect on a whether a student prefers math to science.

Logit 2 compares arts to science. In this equation, the coefficient for age is larger than the coefficient that compares math to science. The coefficient for age is positive. As students get older, students are more likely to prefer arts to science.

The interpretation of the coefficients for categorical predictors depends on the reference level for the factor. In the teaching methods data, the two levels for teaching method are "Demonstrate" and "Explain." "Demonstrate" is not in the coefficients table, so "Demonstrate" is the reference level. The p-value for "Explain" in the equation that compares math to science is greater than 0.5. Such a high p-value suggests that the teaching method has little effect on whether a student prefers math to science.

In Logit 2, the coefficient for "Explain" is larger than the coefficient that compares math to science. The p-value for this coefficient is less than 0.05, so this coefficient is statistically significant at the 0.05 level. The coefficient for "Explain" in this equation is positive. When the teaching method is "Explain," the student is more likely to prefer art.

Response Information

VariableValueCount
SubjectScience10(Reference Event)
  Math11 
  Arts9 
  Total30 

Factor Information

FactorLevelsValues
Teaching Method2Demonstrate, Explain

Logistic Regression Table







95% CI
PredictorCoefSE CoefZPOdds RatioLowerUpper
Logit 1: (Math/Science)             
Constant-1.122664.56425-0.250.806     
Teaching Method             
  Explain-0.5631150.937591-0.600.5480.570.093.58
Age0.1246740.4010790.310.7561.130.522.49
Logit 2: (Arts/Science)             
Constant-13.84857.24256-1.910.056     
Teaching Method             
  Explain2.769921.372092.020.04415.961.08234.90
Age1.013540.5844941.730.0832.760.888.66
Log-Likelihood = -26.446

Test of All Slopes Equal to Zero

DFGP-Value
412.8250.012

Goodness-of-Fit Tests

MethodChi-SquareDFP
Pearson6.95295100.730
Deviance7.88622100.640

SE Coef

The standard error of the coefficient estimates the variability between coefficient estimates that you would obtain if you took samples from the same population again and again. The calculation assumes that the sample size and the coefficients to estimate would remain the same if you sampled again and again.

Interpretation

Use the standard error of the coefficient to measure the precision of the estimate of the coefficient. The smaller the standard error, the more precise the estimate.

Z-value

The Z-value is a test statistic that measures the ratio between the coefficient and its standard error.

Interpretation

Minitab uses the Z-value to calculate the p-value, which you use to make a decision about the statistical significance of the terms and the model. The test is accurate when the sample size is large enough that the distribution of the sample coefficients follows a normal distribution.

A Z-value that is sufficiently far from 0 indicates that the coefficient estimate is both large and precise enough to be statistically different from 0. Conversely, a Z-value that is close to 0 indicates that the coefficient estimate is too small or too imprecise to be certain that the term has an effect on the response.

P-value

The p-value is a probability that measures the evidence against the null hypothesis. Lower probabilities provide stronger evidence against the null hypothesis.

Interpretation

To determine whether the association between the response and each term in the model is statistically significant, compare the p-value for the term to your significance level to assess the null hypothesis. The null hypothesis is that the term's coefficient is equal to zero, which implies that there is no association between the term and the response. Usually, a significance level (denoted as α or alpha) of 0.05 works well. A significance level of 0.05 indicates a 5% risk of concluding that an association exists when there is no actual association.
P-value ≤ α: The association is statistically significant
If the p-value is less than or equal to the significance level, you can conclude that there is a statistically significant association between the response variable and the term.
P-value > α: The association is not statistically significant
If the p-value is greater than the significance level, you cannot conclude that there is a statistically significant association between the response variable and the term. You may want to refit the model without the term.
If there are multiple predictors without a statistically significant association with the response, you can reduce the model by removing terms one at a time. For more information on removing terms from the model, go to Model reduction.
If a model term is statistically significant, the interpretation depends on the type of term. The interpretations are as follows:
  • If a continuous predictor is significant, you can conclude that the response level probabilities depend on the predictor.
  • If a categorical predictor is significant, you can conclude that response levels have different probabilities of occurring at that level of the factor than at the reference level of the factor.
  • If an interaction term is significant, you can conclude that the relationship between a predictor and the response level probabilities depends on the other predictors in the term.
  • If a polynomial term is significant, you can conclude that the relationship between a predictor and the response level probabilities depends on the magnitude of the predictor.

Odds ratio

The odds ratio compares the odds of two outcomes. The odds of an outcome are the probability that the comparison outcome occurs divided by the probability that the reference outcome occurs.

Interpretation

Use the odds ratio to understand the effect of a predictor. The interpretation of the odds ratio depends on whether the predictor is categorical or continuous. In the logistic regression table, the comparison outcome is first outcome after the logit label and the reference outcome is the second outcome. The reference outcome is the same for every logit.

Odds ratios for continuous predictors

Odds ratios that are greater than 1 indicate that the comparison outcome is more likely than the reference outcome as the predictor increases. Odds ratios that are less than 1 indicate that the reference outcome is more likely than the comparison outcome.

For example, a school administrator wants to assess different teaching methods. For logit 1, the comparison outcome is math. For logit 2, the comparison outcome is arts. The reference outcome is science. In logit 2, the estimate of the odds ratio is 2.76, which is greater than 1. As age increases, a student is more likely to prefer arts to science. For each additional year of age, the odds that a student prefers arts is 3 times greater than the odds that they prefer science.

Logistic Regression Table







95% CI
PredictorCoefSE CoefZPOdds RatioLowerUpper
Logit 1: (Math/Science)             
Constant-1.122664.56425-0.250.806     
Teaching Method             
  Explain-0.5631150.937591-0.600.5480.570.093.58
Age0.1246740.4010790.310.7561.130.522.49
Logit 2: (Arts/Science)             
Constant-13.84857.24256-1.910.056     
Teaching Method             
  Explain2.769921.372092.020.04415.961.08234.90
Age1.013540.5844941.730.0832.760.888.66

Odds ratios for categorical predictors

For categorical predictors, the odds ratio compares the odds of the comparison outcome at two different levels of the predictor. The comparison level is in the logistic regression table and has an estimated odds ratio. Odds ratios that are greater than 1 indicate that the comparison outcome becomes more likely relative to the reference outcome when the categorical predictor changes from the reference level to the comparison level. Odds ratios that are less than 1 indicate that the comparison outcome becomes less likely relative to the reference outcome when the categorical predictor changes from the reference level to the comparison level.

For example, a school administrator wants to assess different teaching methods. For logit 1, the comparison outcome is math. For logit 2, the comparison outcome is arts. The reference outcome is science. For logit 2, the estimate of the odds ratio for teaching method is 15.96, which is greater than 1. When the teaching methods changes from "demonstrate" to "explain," the odds that a student prefers arts is about 16 times greater than the odds that they prefer science.

Logistic Regression Table







95% CI
PredictorCoefSE CoefZPOdds RatioLowerUpper
Logit 1: (Math/Science)             
Constant-1.122664.56425-0.250.806     
Teaching Method             
  Explain-0.5631150.937591-0.600.5480.570.093.58
Age0.1246740.4010790.310.7561.130.522.49
Logit 2: (Arts/Science)             
Constant-13.84857.24256-1.910.056     
Teaching Method             
  Explain2.769921.372092.020.04415.961.08234.90
Age1.013540.5844941.730.0832.760.888.66

Confidence interval for odds ratio (95% CI)

These confidence intervals (CI) are ranges of values that are likely to contain the true values of the odds ratios. The calculation of the confidence intervals uses the normal distribution. The confidence interval is accurate if the sample size is large enough that the distribution of the sample odds ratios follow a normal distribution.

Because samples are random, two samples from a population are unlikely to yield identical confidence intervals. However, if you take many random samples, a certain percentage of the resulting confidence intervals contain the unknown population parameter. The percentage of these confidence intervals that contain the parameter is the confidence level of the interval.

The confidence interval is composed of the following two parts:
Point estimate
The point estimate is the estimate of the parameter that is calculated from the sample data.
Margin of error
The margin of error defines the width of the confidence interval and is affected by the range of the event probabilities, the sample size, and the confidence level.

Interpretation

Use the confidence interval to assess the estimate of the odds ratio.

For example, with a 95% confidence level, you can be 95% confident that the confidence interval contains the value of the odds ratio for the population. The confidence interval helps you assess the practical significance of your results. Use your specialized knowledge to determine whether the confidence interval includes values that have practical significance for your situation. If the interval is too wide to be useful, consider increasing your sample size.

Test for terms with more than 1 degree of freedom

This test is an overall test that considers all of the coefficients for a categorical predictor simultaneously. The test is for categorical predictors with more than 2 levels.

Interpretation

Use the test to determine whether a categorical predictor with more than 1 coefficient has a statistically significant relationship with the response events. When a categorical predictor has more than 2 levels, the coefficients for individual levels have different p-values. The overall test gives a single answer about whether the predictor is statistically significant.

To determine whether the association between the response events and the categorical predictor is statistically significant, compare the p-value for the test to your significance level to assess the null hypothesis. The null hypothesis is that no association exists between the predictor and the response events. Usually, a significance level (denoted as α or alpha) of 0.05 works well. A significance level of 0.05 indicates a 5% risk of concluding that an association exists when there is no actual association.
P-value ≤ α: The association is statistically significant
If the p-value is less than or equal to the significance level, you can conclude that there is a statistically significant association between the response variable and the predictor.
P-value > α: The association is not statistically significant
If the p-value is greater than the significance level, you cannot conclude that there is a statistically significant association between the response variable and the predictor.

Log-likelihood

Minitab maximizes the log-likelihood function to find optimal values of the estimated coefficients.

Interpretation

Use the log-likelihood to compare two models that use the same data to estimate the coefficients. Because the values are negative, the closer to 0 the value is, the better the model fits the data.

The log-likelihood cannot decrease when you add terms to a model. For example, a model with 5 terms has higher log-likelihood than any of the 4-term models you can make with the same terms. Therefore, log-likelihood is most useful when you compare models of the same size. To make decisions about individual terms, you usually look at the p-values for the term in the different logits.

Test that all slopes are zero

This test is an overall test that considers all of the coefficients for predictors in the model.

Interpretation

Use the test to determine whether at least one of the predictors in the model has a statistically significant association with the response events. Usually, you do not interpret the G statistic or the degrees of freedom (DF). The DF are equal to the number of coefficients for predictors in the model.

To determine whether the association between the response events and the predictors is statistically significant, compare the p-value for the test to your significance level to assess the null hypothesis. The null hypothesis is that all of the coefficients for predictors in the model are zero, which implies that no association exists between the response events and any of the predictors. Usually, a significance level (denoted as α or alpha) of 0.05 works well. A significance level of 0.05 indicates a 5% risk of concluding that an association exists when there is no actual association.
P-value ≤ α: The association is statistically significant
If the p-value is less than or equal to the significance level, you can conclude that there is a statistically significant association between the response variable and at least one of the predictors.
P-value > α: The association is not statistically significant
If the p-value is greater than the significance level, you cannot conclude that there is a statistically significant association between the response variable and any of the terms.

Pearson goodness-of-fit test

The Pearson goodness-of-fit test assesses the discrepancy between the current model and the full model.

Interpretation

Use the goodness-of-fit tests to determine whether the predicted probabilities deviate from the observed probabilities in a way that the multinomial distribution does not predict. The test is not useful when the number of distinct values is approximately equal to the number of observations, but the test is useful when you have multiple observations at the same values of the predictors. If the p-value for the goodness-of-fit test is lower than your chosen significance level, the predicted probabilities deviate from the observed probabilities in a way that the multinomial distribution does not predict. This list provides common reasons for the deviation:
  • Omitted higher-order term for variables in the model
  • Omitted predictor that is not in the model

Deviance goodness-of-fit test

The deviance goodness-of-fit test assesses the discrepancy between the current model and the full model.

Interpretation

Use the goodness-of-fit tests to determine whether the predicted probabilities deviate from the observed probabilities in a way that the multinomial distribution does not predict. The test is not useful when the number of distinct values is approximately equal to the number of observations, but the test is useful when you have multiple observations at the same values of the predictors. If the p-value for the goodness-of-fit test is lower than your chosen significance level, the predicted probabilities deviate from the observed probabilities in a way that the multinomial distribution does not predict. This list provides common reasons for the deviation:
  • Omitted higher-order term for variables in the model
  • Omitted predictor that is not in the model