Cox Proportional Hazards Model Overview
The main characteristic of survival analysis that differentiates itself from other statistical or data-mining domains is that, methods in survival analysis are specifically designed to handle censored data.
A data point is considered censored, if the end point of interest is not observed for a particular individual. For this type of data, many modeling techniques are inappropriate, e.g., normal regression models.
The British statistician David Cox introduced the proportional hazards model in the 1972 paper, Regression Models and Life Tables, Journal of the Royal Statistical Society Series B 34 (2): 187-220. This statistical model, the Cox proportional hazards model, does not impose any specific form of the survivor function, allowing censored data to be modeled flexibly.
Specifically, Cox's proportional hazards model is a distribution-free model in which predictors are related to lifetime multiplicatively.
The form of the Cox proportional hazards model is as follows:
h(t|x) = h0(t) exp(xß)
This model has become popular in various domains whenever the dependent variable of interest represents the time to a terminal event, and the duration of study is limited in time.
Examples include: