The Cox regression model and Aalen’s additive model are applied to the data. Estimates of the coefficients \(\:{\alpha\:}_{r}\left(t\right)\)for\(\:\:r=\text{0,1},2,\dots\:,8\), and \(\:{\beta\:}_{r}\left(t\right)\) for \(\:r=\text{1,2},\dots\:,8\) for Aalen’s additive model and Cox regression model respectively are shown in Table 1. The table also contains standard errors and p-values for the time-invariant effect for Aalen’s additive model and p-values for the constant covariate effect for the Cox regression model.
Table 1
Estimated parameters for the Aalen's additive model and Cox regression model: First output
covariates | Aalen’s additive model | Cox regression model |
| Coef\(\:{\alpha\:}_{r}\left(t\right)\) | RR | SE | z | p-value | Coef \(\:{\beta\:}_{r}\left(t\right)\) | RR | SE | z | p-value |
factor(Gender)1 | 0.059 | 1.061 | 0.059 | 1.380 | 0.169 | 1.091 | 2.977 | 0.457 | 2.386 | 0.017 |
WHOSBL | -0.062 | 0.940 | 0.046 | -2.220 | 0.027 | -1.435 | 0.238 | 0.283 | -1.538 | 0.124 |
Age | -0.000 | 1 | 0.004 | -0.079 | 0.937 | -0.031 | 0.969 | 0.027 | -1.113 | 0.266 |
factor(CD4BL)3 | -0.046 | 0.955 | 0.111 | -0.679 | 0.497 | -0.242 | 0.785 | 0.625 | -0.388 | 0.698 |
factor(CD4BL)4 | -0.083 | 0.920 | 0.108 | -1.300 | 0.195 | -0.992 | 0.371 | 0.611 | -1.623 | 0.105 |
factor(CD4BL)5 | -0.132 | 0.876 | 0.104 | -2.170 | 0.030 | -1.548 | 0.213 | 0.616 | -2.511 | 0.012 |
BMI | 0.002 | 1.002 | 0.006 | 0.328 | 0.743 | -0.015 | 0.985 | 0.040 | -0.362 | 0.717 |
factor(Reaction)1 | 0.005 | 1.005 | 0.061 | 0.082 | 0.935 | 0.027 | 1.027 | 0.419 | 0.064 | 0.949 |
factor(DTB)1 | -0.074 | 0.929 | 0.049 | -1.430 | 0.151 | -0.951 | 0.386 | 0.415 | -2.294 | 0.022 |
factor(TBB4Enrl)1 | 0.036 | 1.037 | 0.032 | 1.120 | 0.261 | 0.363 | 1.438 | 0.230 | 1.580 | 0.114 |
Key: Coef = coefficient; SE = standard error; z = z score or z statistic or critical value; RR = relative risk |
The exponent coefficients or the relative risks in Table 1 are interpreted as additive effects or multiplicative effects for the Aalen model and the Cox model respectively.
The results show that for Aalen's additive model, two covariates turned to be significant at the 5% level, that is, WHO stage baseline (WHOSBL) and CD4BL5 (below 200 cells/mm3). The coefficient associated with WHOSBL is -0.062 with a standard error of 0.046 and hence a z-statistic of -2.220, giving a significant p value of 0.027. The associated relative risk (RR) is 0.940, implying that a unit increase in WHOSBL reduces the rate of immune deterioration by 0.940. The CD4BL5, has a coefficient is -0.132 and a standard error of 0.104, giving a z-statistic of -2.170 and hence a significant p-value of 0.030. The RR is 0.876, implying that a patient with \(\:0\:\le\:CD4\:<\:200\) has a slightly lower rate of immune deterioration than a patient with\(\:\:500\:\le\:CD4\:<\:750\). The Cox regression model is also fitted to the same data and three covariates turned out to be significant at 5% level, that is, Gender (male=1; female=0), CD4BL5 (individuals with CD4 baseline below 200 cells/mm3) and DTB1 (developed TB during treatment=1; did not develop TB during treatment=0). The coefficient of Gender is 1.091 with a standard error of 0.457, giving a z-statistic of 2.386 and hence a significant p-value of 0.017. The RR is 2.977, implying males increase the rate of immune deterioration by 2.977 when compared to females. The coefficient of CD4BL5 is -1.548 with a standard error of 0.616, giving a z-statistic of -2.511 and hence a significant p-value of 0.012. The RR is 0.213, implying that a patient with \(\:0\:\le\:CD4\:<\:200\) has a much lower rate of immune deterioration than a patient with\(\:\:500\:\le\:CD4\:<\:750\). The coefficient of (DTB)1 is -0.951 with a standard error of 0.415, giving a z-statistic of -2.294 and hence a significant p-value of 0.022. The RR is 0.386, implying that a patient who developed TB on ART (DTB) has a lower rate of immune deterioration than a patient who did not develop TB.
The results show that the relative risk of HIV progression, indicated by immune deterioration, for males, is 1.061 times or 2.977 times greater than that of their female counterparts if the Aalen additive model or Cox models, respectively, is used. Patients with a CD4 baseline below 200 cells/mm3 (CD4BL5) have a 0.876 times smaller risk of immune deterioration than patients who start treatment with a CD4 baseline between 500 and 750 cells/mm3 (CD4BL2). This is the lowest rate compared to all the other categories of higher CD4 baseline. Patients who react to treatment React1 have a 1.005 times greater risk of immune deterioration than patients who did not react to treatment.
The test for the significance of the models fitted in Table 1 is also performed at 5% level. The overall fit of Aalen’s additive model is investigated by the supremum-test of significance and the test for time-invariant effect is investigated using the Kolmogorov-Smirnov test and the results are shown in Table 2 below.
Table 2
Test for the significance of Aalen’s additive model
Test for non-significant effects: | Test for time-invariant effects: |
Supremum-test of sig | p-value H0: \(\:{\beta\:}_{j}\left(t\right)=0\) | Kolmogorov-Smirnov test | p-value H0: constant effect | Cramer von Mises test | p-value H0: constant effect |
\(\:{\beta\:}_{0}\left(t\right)\)=6.34 | 0.000 | \(\:{\beta\:}_{0}\left(t\right)\)=3.07 | 0.000 | \(\:{\beta\:}_{0}\left(t\right)\)=55.2 | 0.000 |
The output shown above contains a test for non-significance and time-invariance effects of the baseline rate for the fitted Aalen's additive model. The results from Table 2 above show that the tests are statistically significant as shown by small p-values (all equal to 0.000).
A goodness-of-fit test is also performed for the fitted Cox regression model and is investigated using three asymptotically equivalent test statistics; the likelihood ratio test, the Wald test, and the Score (logrank) test. Under the null hypothesis; \(\:{H}_{j}:{\beta\:}_{j}\left(t\right)=\beta\:\), these three statistics are asymptotically chi-squared distributed. The results are presented in Table 3 below.
Table 3
Test for the significance of Cox’s regression model
Statistic | Test statistic | Degrees of freedom | p-value |
Likelihood ratio test | 26.52 | 10 | p = 0.003104 |
Wald test | 27.66 | 10 | p = 0.002047 |
Score (logrank) test | 30.06 | 10 | p = 0.0008389 |
Concordance = 0.712 (se = 0.044 ); R-square(R2) = 0.244 (max possible = 0.999 ) |
The overall fit of the Cox model is investigated by the likelihood ratio, Wald and Score tests. The test is statistically significant with p-values of 0.003104, 0.002047, and 0.0008, respectively.
The analysis is done using a backward stepwise selection which starts with a model with all covariates and the removal of some covariates is based on the p-values. The first step, shown in Table 1, shows that for both models the variables Reaction1 (individuals who developed some adverse reaction to treatment) and body mass index (BMI) are non-significant (p-values above 0.9 and p-values above 0.7 respectively) to both the time-varying effect and the constant covariate effect of HIV/aids progression for individuals on drug therapy. Therefore, in step 2 the variable Reaction is removed from the analysis since it has the highest p-value.
The results from analysis for step 2 are not shown but they show the significance of the goodness of fit for both models. Results from step 2 still maintain non-significance for the variable Body Mass index (BMI) during treatment for both Aalen's regression model and Cox regression model. For this reason, BMI is removed from the models. The removal of BMI resulted in most of the variables having significant effects except for CD4BL3 (corresponding with CD4 baseline between 350 and 500 cells/mm3), TBB4Enrl1 (individuals who had TB before initiation of antiretroviral drugs), and WHO stage baseline. This results in the variable TB before enrolment being removed from the models. In the third step, the variable WHO stage baseline is removed from both models.
Results from Aalen's additive model and Cox regression model that best describe the HIV progression of individuals on drug therapy are summarised in Table 4 below.
Table 4
Aalen's additive model and Cox regression model: Final models
Covariates | Aalen’s additive model | Cox regression model |
| Coef | RR | SE | z | p-value | Coef | RR | SE | z | p-value |
factor(Gender)1 | 0.086 | 1.090 | 0.050 | 2.56 | 0.011 | 0.959 | 2.609 | 0.351 | 2.74 | 0.006 |
Age | -0.003 | 0.997 | 0.002 | -1.46 | 0.145 | -0.029 | 0.971 | 0.015 | -1.87 | 0.062 |
factor(CD4BL)3 | -0.117 | 0.890 | 0.098 | -1.86 | 0.063 | -0.705 | 0.494 | 0.449 | -1.57 | 0.117 |
factor(CD4BL)4 | -0.147 | 0.863 | 0.091 | -2.80 | 0.005 | -1.349 | 0.259 | 0.400 | -3.37 | 0.000 |
factor(CD4BL)5 | -0.231 | 0.794 | 0.089 | -4.60 | 0.000 | -1.941 | 0.144 | 0.392 | -4.95 | 0.006 |
factor(DTB)1 | -0.097 | 0.908 | 0.040 | -2.35 | 0.019 | -0.909 | 0.403 | 0.317 | -2.86 | 0.004 |
Key: Coef = coefficient; SE = standard error; z = z score or z statistic or critical value, RR = relative risk |
Removal of the WHO stage baseline results in most of the variables being significant except for Age and CD4BL3 (corresponding to individuals who initiated treatment with a CD4 baseline between 350 and 500 cells/mm3). The p-values of the coefficients are 0.011, 0.145, 0.063, 0.005, 0.000, and 0.019 for gender, age, CD4 baseline 3, CD4 baseline 4 Cd4 baseline 5, and development TB respectively for Aalen’s additive regression model. This shows that there is an indication of some possible effect of some early commencing of ART (CD4BL3 = between 350 and 500 cells/mm3) and developing TB to HIV progression at 10% level. There is a significant effect for starting ART with CD4 baseline 4 and 5 (CD4 cell counts below 350 cells/mm3). The effect of gender is also significant to HIV progression at a 5% level.
For the Cox proportional hazards model, the p-values for the coefficients are 0.006, 0.062, 0.117, 0.000, 0.006, 0.004 for gender, age, CD4 baseline 3, CD4 baseline 4, CD4 baseline 5, and development TB (DTB1) respectively. This shows that there is some age effect on HIV progression at the 10% level of significance. There are also significant effects for gender, CD4 baseline 4, CD4 baseline 5, and DTB at a 5% level of significance.
For Aalen’s additive regression model, the coefficient associated with gender is -0.086 with a standard error of 0.050 and hence a z-statistic of 2.56, giving a significant p-value of 0.011. The associated relative risk (RR) is 1.090, implying that males have a 2.56 times greater risk of immune deterioration than their female counterparts. The coefficient associated with CD4BL3 is -0.117 with a standard error of 0.098 and hence a z-statistic of -1.86, giving a significant p-value of 0.063. The associated relative risk (RR) is 0.890, implying that patients starting ART with a CD4 baseline between 350 and 500 cells/mm3 have 0.890 times less risk of immune deterioration than patients who start ART with a CD4 baseline between 500 and 750 cells/mm3.
The coefficient associated with CD4BL4 is -0.147 with a standard error of 0.091 and hence a z-statistic of -2.80, giving a significant p-value of 0.005. The associated relative risk (RR) is 0.863, implying that patients starting ART with a CD4 baseline between 200 and 350 cells/mm3 have 0.863 times less risk of immune deterioration than patients who start ART with a CD4 baseline between 500 and 750 cells/mm3. The coefficient associated with CD4BL5 is -0.231 with a standard error of 0.089 and hence a z-statistic of -4.60, giving a significant p value of 0.000. The associated relative risk (RR) is 0.794, implying that patients starting ART with a CD4 baseline below 200 cells/mm3 have 0.794 times less risk of immune deterioration than patients who start ART with a CD4 baseline between 500 and 750 cells/mm3. The coefficient associated with DTB1 is -0.097 with a standard error of 0.040 and hence a z-statistic of -2.35, giving a significant p-value of 0.019. The associated relative risk (RR) is 0.908, implying that patients who develop TB and get treated have 0.908 times less risk of immune deterioration than patients who did not develop TB.
The results for the Cox proportional hazards model are almost similar with the results from Aalen’s additive model with some differences in the magnitudes. The other difference is that whereas CD4BL3 contributes significantly when Aalen’s model is used, it does not have any significant effect when the Cox proportional model is used.
Assessment of the fitted model
Goodness-of-fit tests were performed for the final models for both the Cox regression model and Aalen's additive model. The results are presented in the next sections.
Test for significance of Cox regression model
Test for the significance of the final Cox regression model is performed using the test statistics; likelihood ratio test, Wald test, and Score (logrank) test. The results are shown in Table 5 below. A further test is also performed using the Schoenfeld residual plots for the variables in the final model.
Table 5
Test for the significance of the final Cox regression model
Statistic | Test statistic | Degrees of freedom | p-value |
Likelihood ratio test | 30.48 | 6 | 0.00003185 |
Wald test | 32.79 | 6 | 0.00001152 |
Score (logrank) test | 36.55 | 6 | 0.000002159 |
Concordance = 0.686 (standard error = 0.038 ); R-square = 0.216 (max possible = 1 ) |
The overall fit for the final Cox regression model is statistically significant with p-values 0.00003184, 0.00001152, and 0.000002159 for the Likelihood ratio test, Wald test, and the score test, respectively. The likelihood ratio test for the final model is 30.48 (p-value = 0.00003185) which is higher than the likelihood ratio test = 26.52 (p-value = 0.003104) for the first model with a full set of variables. This indicates that the last model is preferable compared to the other models.
Tests for the proportional hazards assumption are done for each of the covariates together with a global test for the whole model. This test is based on the scaled Schoenfeld residuals. The scaled Schoenfeld residuals is the difference between the covariate at the failure time and the expected value of the covariate at this time. The results are shown in Table 6 below:
Table 6
Test for proportional hazards assumption
covariates | rho | chisq | p-value |
factor(Age) | 0.15211 | 3.93477 | 0.0473 |
factor(Gender)1 | -0.00624 | 0.00623 | 0.9371 |
factor(CD4BL)3 | -0.08836 | 1.12351 | 0.2892 |
factor(CD4BL)4 | 0.08034 | 0.85007 | 0.3565 |
factor(CD4BL)5 | 0.06781 | 0.59293 | 0.4413 |
factor(DvpTB)1 | 0.10341 | 1.86523 | 0.1720 |
Global | NA | 8.84077 | 0.1827 |
Key: rho = correlation coefficient, chisq = chi-square statistic |
The results in Table 6 give strong evidence that the only variable displaying a significant deviation from the proportional hazards assumption is the age variable which has a p-value = 0.0473.
Figure 1a-f below shows the plots of the Schoenfeld residuals for the variables in the final Cox regression model. The plot of the Schoenfeld residuals is a useful diagnostic tool. A non-zero slope (\(\:\beta\:\left(t\right)\) will be a horizontal line) is an indication of the violation of the proportional hazards assumption.
The plots in Fig. 1b, 1d, 1e and 1f are relatively horizontal. This indicates that the proportional hazard assumption is satisfied for gender (Fig. 1b), CD4 baseline category 4 (starting ART when CD4cell count is between 200 and 350 (Fig. 1d)) and 5 (CD4cell count is below 200 (Fig. 1e)), and for patients who developed TB during the course of treatment (Fig. 1f). Figure 1a and 1c show a slightly increasing and decreasing trend, respectively. Thus, it seems there is not exactly satisfaction of the proportional hazard assumption for age (Fig. 1a) and patient in the CD4 cell count baseline category 3 (Cd4 baseline between 350 and 500 (Fig. 1c)).
Test for significance of Aalen’s additive model
Test for non-significance in Table 7 and the time-invariant effects were also performed for the final Aalen’s additive model and the results are presented below:
Table 7
Test for the significance of the final Aalen’s additive regression model
Test for non-significant effects: | Test for time-invariant effects: |
Supremum-test of sig | p-value H0: \(\:{\beta\:}_{j}\left(t\right)=0\) | Kolmogorov-Smirnov test | p-value H0: constant effect | Cramer von Mises test | p-value H0: constant effect |
\(\:{\beta\:}_{0}\left(t\right)\)=8.27 | 0.000 | \(\:{\beta\:}_{0}\left(t\right)\)=3.2 | 0.000 | \(\:{\beta\:}_{0}\left(t\right)\)=58.2 | 0.000 |
The results still maintain that the baseline rate is significant and time-varying as was shown by the first output containing all the variables.
Figure 2a-f below shows the cumulative regression functions for the final Aalen's additive model with variables, Age, WHO stage baseline, Gender, CD4 baseline, and development of TB during treatment as well as for the intercept.
confidence intervals based on Aalen’s additive model
Figure 2 indicates that the estimates of cumulative regression function for patient age (Fig. 2b), WHOSBL (Fig. 2f) are constant at a level of zero, hence the removal of these covariates except for age which was left because of its epidemiological importance. The estimated cumulative regression function plot for gender (Fig. 2c) increased rapidly after t = 4. The estimated cumulative regression function plot for DTB (Fig. 2e) and CD4BL (Fig. 2d) decreased slowly after t = 4. This could be an indication that the effectiveness of anti-retroviral therapy is notable after 4 six month periods (2 years) of treatment.