Survival Analysis - Notes and Technical Information
| Var_1: | Var_2: | Var_3: |
| Interval Start | No. of Obs. Censored | No. of Obs. Failed |
| 0 | 10 | 185 |
| 1 | 10 | 88 |
| 2 | 10 | 55 |
| 3 | 10 | 43 |
| 4 | 14 | 32 |
| 5 | 52 | 31 |
| 6 | 38 | 20 |
| 7 | 24 | 7 |
| 8 | 25 | 6 |
| 9 | 24 | 6 |
On the Life Table & Distribution of Survival Times - Table of survival times tab, specify the three variables in exactly this order (1-3) in the variable selection window (click the Variables button to select the variables). Then, enter the number of observations at the beginning of the study (that entered the first interval) by the Number entering first interval check box.
Fitting Theoretical Survival Distribution. The regression procedure for fitting the four theoretical distributions to the life table is based on algorithms proposed by Kennedy and Gehan (1971), and discussed in detail in Lee (1980). Basically, the hazard functions (specifically, the logarithmic transforms of the hazard functions) of all four theoretical distributions are linear functions of the survival times (or log-survival times). Thus, the hazard functions [h(t)] may be expressed in terms of linear regression functions as:
- h(t) = L Exponential
- h(t) = L'g*tg-1 Weibull (where L' = Lg)
- h(t) = exp(L+g*t) Gompertz
- h(t) = L+g*t Linear exponential
If we set y=h(t) or y=log h(t), then all four models can be stated in the general form:
y = a + b*x
WSS = S(wi(yi-a-b*xi)2)
Three different weights are used in the estimation:
wi = 1 (unweighted least squares)
wi = 1/vi
wi = ni*hi
where vi is the variance of the hazard estimate, and hi and ni are the interval width and number of observations exposed to risk in the i'th interval, respectively.
h(t,z) = h0(t)*exp(b'z)
In the above expression h(t,z) is the hazard rate, contingent on a particular covariate vector z; h0(t) is referred to as the baseline hazard, that is, it is the hazard rate when the values for all independent variables (that is, in z) are equal to zero; b is the vector of regression coefficients.
The program uses the Newton-Raphson method to maximize the simplified partial likelihood (proposed by Breslow, 1974):
L = Π exp(b'si)/[Σexp(b'zj)]d(i)
In this formula, d(i) stands for the number of cases observed to fail at time ti, si stands for the sum (over the d(i) cases observed to fail at time ti) of the covariates, zj stands for the covariate vector for a case j in the risk set at time ti. (The geometric sum Π is over all k distinct failure times, the regular sum Σ is over all cases jÎRi in the respective risk set Ri, that is, cases that are observed to fail at or after the respective failure time ti.)
To estimate the survival function S contingent on a particular covariate vector z, the algorithm uses the relationship:
S(t,z) = S0(t)exp(b'z)
In this formula, S0 is the baseline survival function which is independent of the covariates. Breslow (1974) proposed the following estimator of the baseline survival function:
S0(ti) = Π[1-(d/åexp(b'z))]
The Chi-square value is computed as a function of the log-likelihood for the model with all covariates (l1), and the log-likelihood of the model in which all covariates are forced to 0 (L0); specifically:
Chi-square = -2*(L0- L1)
Estimating the Parameters for Time-Dependent Covariates. To accommodate time-dependent covariates, the following modification to the simplified partial likelihood (Breslow, 1974) is introduced (see Lawless, 1982, page 393):
L = Π exp(b'si(ti))/Σ[exp(b'zj(ti))]d(i)
In this formula, si(ti) stands for the sum of the transformed (time-dependent) covariates observed to fail at a particular time ti, and zi(ti) stands for the transformed (time-dependent) covariate vector for a case j in the risk set at time ti, and d(i) stands for the number of cases observed to fail at time ti. (The geometric sum P is over all cases, the regular S is over all cases in the respective risk set, that is, cases that are observed to fail at or after the respective failure time ti.)
This partial likelihood function is minimized using the Newton-Raphson method. Note that in order to compute the likelihood for a given set of parameters, for each case i, all cases with survival times greater than or equal to that of case i have to be processed. Thus, fitting models with time-dependent covariates may require extensive computations, particularly when there are many cases in the datafile.
Wald statistic. The results spreadsheet with the parameter estimates for the Cox proportional hazard regression model includes the so-called Wald statistic, and the p-value for that statistic. This statistic is based on the asymptotic normality property of maximum likelihood estimates, and is computed as:
W = b * 1/Var(b) * b
In this formula , b stands for the parameter estimates, and Var(b) stands for the asymptotic variance of the parameter estimates. The Wald statistic is tested against the Chi-square distribution.
L(z) = exp(a + b*z)
The Chi-square value is computed as usual, that is, as a function of the log-likelihood for the model with all covariates (L1), and the log-likelihood of the model in which all covariates are forced to 0 (zero; L0).
ai= S(N-j+1) - 1
If the exponentiality assumption is met, all points in this plot should be arranged roughly in a straight line.
ti = a + zi * b
where zi is the covariate vector for subject i, and a is a constant. If lognormal regression is requested, ti is replaced by its natural logarithm.
The normal regression model is particularly useful because many data sets can be transformed to yield approximations of the normal distribution. Thus, in a sense this is the most general fully parametric model (as opposed to Cox's proportional hazard model which is non-parametric), and estimates can be obtained for a variety of different underlying survival distributions (see Schneider, 1986, for a discussion of the normal distribution and its transforms).