Definition
Kaplan-Meier Estimator

Consider a Random Censoring case . Assume that , and the distinct failure times are where . Let be the number of deaths at , and be the number of alive at where the set is called the risk set at . We only observe
The Kaplan-Meier estimator is derived from the expression where
General Case
As an estimator of , consider
The Kaplan-Meier estimator is defined with the estimated ‘s The cumulative hazard function is estimated by Nelson-Aalen Estimator in the same logic
No Ties Case
When there’s no tie in the observation, , then the failure times are equal to the observation , death is equal to the censoring indicator , and . Thus, the Kaplan-Meier estimator is defined as
Properties
Self-Consistency
An estimator is self-consistent if where
The Kaplan-Meier estimator is the unique self-consistent estimator for where is the largest observation.
Generalized MLE
The Kaplan-Meier estimator gives the Generalized Maximum Likelihood Estimation of the Survival Function .
Strong Consistency
The Kaplan-Meier estimator uniformly Almost Surely converges to
Proof
Consider a function and decompose it to the sum of the subsurvival functions and . where is the uncensored case and is the censored case.
Then, the survival function can be expressed as a function of the subsurvival functions.
Define the empirical subsurvival functions and . The Kaplan-Meier estimator also can be expressed as a function of the empirical subsurvival functions.
By Glivenko-Cantelli theorem, and for all . Since is a continuous function of and ,
Asymptotic Normality
Kaplan-Meier estimator has asymptotic normality. where , , and .
The variance of the estimator is estimated by Greenwood’s formula For the no ties case, the formula is
Examples
case where
Facts
Kaplan-Meier estimator has Self-Consistency and Asymptotic Normality, and it is generalized MLE
If no censoring, Kaplan-Meier estimator is just the Empirical Survival Function.