Example for Fit Cox Model in a Counting Process Form

Medical researchers want to determine the success rate of recovery from a bone marrow transplant as a treatment for acute leukemia. Recovery depends on factors such as the patient's Risk Category at the time of transplantation, their Disease Stage, and whether their platelet count returned to normal levels. Risk Category and Disease Stage are fixed predictors because they do not change throughout the study. However, a patient's platelet count is a time-dependent predictor because the count can change during the recovery process.

The medical researchers study 137 patients after they receive the transplant and record the number of days they are disease-free. A patient is not disease-free if they die before their platelet count returns to normal or if their leukemia returns after their platelet count returns to normal. A value of Yes indicates a disease-free patient and is a censored observation. A censored observation is when the event does not occur by the end of the observation time.

The data are in the counting process form, which means that multiple rows represent each patient. Each row describes a time interval when the values of all the variables are constant. Time-dependent predictors change between rows. The intervals begin just after the start time and include the end time.

For example, the following table contains the data for the patient with an ID of 1. The observed values of Risk Category and Disease Stage are the same in every row because those predictors are fixed. Because a normal platelet count can change during the study, each patient requires a new row of data whenever this predictor changes. The first row shows that the patient did not have a normal platelet count in the interval of the first 13 days after the transplant. The second row shows that the patient had a normal platelet count from after day 13 until the end of the study on day 2,081.

ID Risk Category Start Time End Time Disease Free Normal Platelets Disease Stage
1 1 0 13 Yes No Normal
1 1 13 2081 Yes Yes Normal
Note

These data were adapted based on a public data set from Copelan that is in Klein and Moeschberger (2003)1.

  1. Open the sample data, BoneMarrow.MTW.
  2. Choose Stat > Reliability/Survival > Cox Regression > Fit Cox Model in a Counting Process Form.
  3. In Start Time, enter Start Time. In End Time, enter End Time.
  4. In Censoring column (optional), enter Disease Free. In Censoring value, select Yes.
  5. In Categorical predictors, enter Risk Category, Normal Platelets, and Disease Stage.
  6. Select Options. In Case identification (for subject residuals), enter ID.
  7. Select OK in each dialog box.

Interpret the results

First, the researchers use the goodness-of-fit tests to evaluate the overall fit of the model. The p-values for all 3 tests are below 0.05, so the researchers conclude that the model fits the data well. Then the researchers use the ANOVA table to evaluate the effect of individual terms. The p-values for all 3 terms are significant at an α-level of 0.05. Therefore, the medical researchers conclude that the patient's risk category at the time of transplantation, their disease stage, and whether their platelet count is at normal levels all have a statistically significant effect on whether a patient recovers from a bone marrow transplant.

The researchers use the Relative Risks for Categorical Predictors table to assess the risk between different levels of the predictors. For example, the risk of death or recurrence of leukemia among patients with normal platelets is 0.37 times lower than a patient without normal platelets. Moreover, the confidence interval shows that the true risk of death or recurrence for patients with normal platelets could be as little as 0.19 times or as much as 0.7 times less than the risk for patients without normal platelets, at the 95 percent level of confidence. The confidence interval does not contain 1, so the difference between the risk of death or recurrence for patients with and without normal platelets is statistically significant.

Method

Cox model typeCounting process form
Categorical predictor coding(1, 0)
Tie adjustmentEfron
Rows unused1

Censoring Information

Uncensored
Units
Censored
Units
TotalPercent
Censored
8317325667.58%
Censoring value: Disease Free = Yes

Regression Equation

Risk Score=0.0 Risk Category_1 - 0.793 Risk Category_2 - 0.033 Risk Category_3
+ 0.0 Normal Platelets_No - 1.004 Normal Platelets_Yes + 0.0 Disease Stage_High
Risk - 0.696 Disease Stage_Normal

Coefficients

TermCoefSE CoefZ-ValueP-Value
Risk Category       
  2-0.7930.321-2.470.014
  3-0.0330.325-0.100.919
Normal Platelets       
  Yes-1.0040.332-3.020.003
Disease Stage       
  Normal-0.6960.275-2.530.011

Relative Risks for Categorical Predictors

Level ALevel BRelative
Risk
95% CI
Risk Category     
  210.4524(0.2409, 0.8495)
  310.9673(0.5116, 1.8290)
  322.1383(1.2487, 3.6616)
Normal Platelets     
  YesNo0.3666(0.1912, 0.7029)
Disease Stage     
  NormalHigh Risk0.4986(0.2909, 0.8547)
Risk for level A relative to level B

Model Summary

ModelLog-LikelihoodR-sqAICAICcBIC
Without terms-373.30746.59746.59746.59
With terms-358.6011.47%725.20725.71734.88

Goodness-of-Fit Tests

TestDFChi-SquareP-Value
Likelihood Ratio429.390.000
Wald432.470.000
Score435.220.000

Analysis of Variance



Wald Test
SourceDFChi-SquareP-Value
Risk Category29.770.008
Normal Platelets19.130.003
Disease Stage16.410.011
1 Klein, J.P. & Moeschberger, M.L. (2003). Semiparametric proportional hazards regression with fixed covariates. Survival Analysis: Techniques for Censored and Truncated Data (2nd ed., pp. 243-293). Springer.