Example for Fit Cox Model with Fixed Predictors only

Medical researchers want to determine whether the stage of larynx cancer affects the chance of death. The researchers plan to adjust the analysis for a patient's age. The researchers record the stage and age of 90 male cancer patients. Then, the researchers record the number of years between the first treatment and either the patient's death or the end of the study. Finally, the researchers record whether the patient died.

The medical researchers perform Cox regression to evaluate the relationship between death, age, and the stage of the cancer. The researchers also want to estimate the survival probability for a 60-year-old man for each stage.

Note

These data were adapted based on a public data set from Kardaun that can be found in Klein and Moeschberger (2003)1. However, the results in this example do not match the textbook because the textbook uses the Breslow method for handling ties and this example uses the Efron method.

  1. Open the sample data, LarynxCancer.MTW.
  2. Choose Stat > Reliability/Survival > Cox Regression > Fit Cox Model with Fixed Predictors only.
  3. In Response, enter Time.
  4. In Censoring column (optional), enter Death.
  5. In Continuous predictors, enter Age.
  6. In Categorical predictors, enter Stage.
  7. Select Graphs and check Display survival plot for predictor values.
  8. From the drop down, select Enter individual values. Enter the following values in the table.
    Age Stage
    60 I
    60 II
    60 III
    60 IV
  9. Select OK in each dialog box.

Interpret the results

First, the researchers use the goodness-of-fit tests to evaluate the overall fit of the model. The p-values for all 3 tests are below 0.05, so the researchers conclude that the model fits the data well. Then the researchers use the ANOVA table to evaluate the effect of individual terms. The p-value for stage is significant at an α-level of 0.05. Therefore, the medical researchers conclude that the stage of the cancer has a statistically significant effect on the survival of the patient. However, the p-value for age is 0.182, so the effect of age is not significant at an α-level of 0.05. The coefficients for the predictors define an equation that describes the relationship between the stage, the age of the patient, and the survival time.

The researchers use the Relative Risks for Categorical Predictors table to assess the risk between different stages of cancer. For example, the risk of death among patients at stage IV is 5.5 times higher than the risk for patients in Stage I. Moreover, the confidence interval shows that the true risk of death for patients in Stage IV could be as little as 2.4 times or as much as 12.6 times more than the risk for patients at Stage I, at the 95 percent level of confidence. The confidence interval does not contain 1, so the difference between the risk of death for Stage I and Stage IV is statistically significant.

The survival plot displays the survival probability for a 60-year-old man for each stage of cancer over multiple years. A diagnosis of stage IV cancer has the greatest affect on survival probability. The plot shows that after 1 year a 60 year old with stage IV cancer has only a 64% chance of surviving. The other three stages have a probability of 85% or higher. After 2 years, the probability drops to 42% for a patient with stage IV, but remains at 74% or higher for the other three stages.

Method

Cox model typeFixed predictors only
Categorical predictor coding(1, 0)
Tie adjustmentEfron

Censoring Information

Uncensored
Units
Censored
Units
TotalPercent
Censored
50409044.44%
Censoring value: Death = No

Regression Equation

Stage
IRisk Score=0.0 + 0.01903 Age
       
IIRisk Score=0.1400 + 0.01903 Age
       
IIIRisk Score=0.6424 + 0.01903 Age
       
IVRisk Score=1.706 + 0.01903 Age

Coefficients

TermCoefSE CoefZ-ValueP-Value
Age0.01900.01431.330.182
Stage       
  II0.1400.4620.300.762
  III0.6420.3561.800.071
  IV1.7060.4224.040.000

Relative Risks for Continuous Predictors

Unit of
Change
Relative
Risk
95% CI
Age11.0192(0.9911, 1.0481)

Relative Risks for Categorical Predictors

Level ALevel BRelative
Risk
95% CI
Stage     
  III1.1503(0.4647, 2.8477)
  IIII1.9010(0.9459, 3.8204)
  IVI5.5068(2.4086, 12.5901)
  IIIII1.6526(0.6819, 4.0049)
  IVII4.7872(1.7825, 12.8566)
  IVIII2.8968(1.2952, 6.4788)
Risk for level A relative to level B

Model Summary

ModelLog-LikelihoodR-sqAICAICcBIC
Without terms-196.86393.73393.73393.73
With terms-187.7118.65%383.41384.30391.06

Goodness-of-Fit Tests

TestDFChi-SquareP-Value
Likelihood Ratio418.310.001
Wald421.150.000
Score424.780.000

Analysis of Variance



Wald Test
SourceDFChi-SquareP-Value
Age11.780.182
Stage317.920.000
1 Klein, J.P. & Moeschberger, M.L. (2003). Examples of survival data: Death times of male laryngeal cancer patients. Survival Analysis: Techniques for Censored and Truncated Data (2nd ed., pp. 9-10). Springer.