Skip to main content

Survival analysis in Python, including Kaplan Meier, Nelson Aalen and regression

Project description

What is survival analysis and why should I learn it? Survival analysis was originally developed and applied heavily by the actuarial and medical community. Its purpose was to answer why do events occur now versus later under uncertainity (where events might refer to deaths, disease remission, etc.). This is great for researchers who are interested in measuring lifetimes: they can answer questions like what factors might influence deaths?

But outside of medicine and actuarial science, there are many other interesting and exciting applications of this lesser-known technique, for example: - SaaS providers are interested in measuring customer lifetimes, or time to first behaviours. - sociologists are interested in measure political parties lifetimes, or relationships, or marriages - Businesses are interested in what variables affect lifetime value

lifelines is a pure Python implementation of the best parts of survival analysis. We’d love to hear if you are using lifelines, please ping me at [@cmrn_dp](https://twitter.com/Cmrn_DP) and let me know your thoughts on the library.

Installation:

Dependencies:

The usual Python data stack: Numpy, Scipy, Pandas (a modern version please), Matplotlib

Installing

You can install lifelines using

pip install lifelines

Or getting the bleeding edge version with:

pip install git+https://github.com/CamDavidsonPilon/lifelines.git

or upgrade with

pip install --upgrade git+https://github.com/CamDavidsonPilon/lifelines.git

from the command line.

Intro to lifelines and survival analysis

Situation: 500 random individuals are born at time 0, currently it is time 12, so we have possibly not observed all death events yet.

# Create lifetimes, but censor all lifetimes after time 12
censor_after = 12
actual_lifetimes = np.random.exponential(10, size=500)
observed_lifetimes = np.minimum( actual_lifetimes, censor_after*np.ones(500) )
C = (actual_lifetimes < censor_after) #boolean array

Non-parametrically fit the survival curve:

from lifelines import KaplanMeierFitter

kmf = KaplanMeierFitter()
kmf.fit(observed_lifetimes, event_observed=C)

# fitter methods have an internal plotting method.
# plot the curve with the confidence intervals
kmf.plot()
kmf

It looks like 50% of all individuals are dead before time 7.

print kmf.survival_function_.head()

time            KM-estimate
0.000000        1.000
0.038912        0.998
0.120667        0.996
0.125719        0.994
0.133778        0.992

Non-parametrically fit the cumulative hazard curve:

from lifelines import NelsonAalenFitter

naf = NelsonAalenFitter()
naf.fit(observed_lifetimes, event_observed=C)

#plot the curve with the confidence intervals
naf.plot()
naf
print naf.cumulative_hazard_.head()

time       NA-estimate
0.000000     0.000000
0.038912     0.002000
0.120667     0.004004
0.125719     0.006012
0.133778     0.008024

Compare two populations using the logrank test:

from lifelines.statistics import logrank_test
other_lifetimes = np.random.exponential(3, size=500)

summary, p_value, results = logrank_test(observed_lifetimes, other_lifetimes, alpha=0.95)
print summary


Results
   df: 1
   alpha: 0.95
   t 0: -1
   test: logrank
   null distribution: chi squared

   __ p-value ___|__ test statistic __|__ test results __
         0.00000 |              268.465 |     True

(Less Quick) Intro to lifelines and survival analysis

If you are new to survival analysis, wondering why it is useful, or are interested in lifelines examples and syntax, please check out the Documentation and Tutorials page

Alternatively, you can use the IPython notebooks tutorials, located in the main directory of the repo:

  1. Introduction to survival analysis

  2. Using lifelines on real data

More examples

There are some IPython notebook files in the repo, and you can view them online here.

lifelines

License

The Feedback MIT License (FMIT)

Copyright (c) 2013, Cameron Davidson-Pilon

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

  1. The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

  2. Person obtaining a copy must return feedback to the authors.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

lifelines logo designed by Pulse designed by TNS from the Noun Project

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lifelines-0.4.4.0.tar.gz (606.5 kB view details)

Uploaded Source

Built Distributions

lifelines-0.4.4.0-cp34-none-win_amd64.whl (404.7 kB view details)

Uploaded CPython 3.4 Windows x86-64

lifelines-0.4.4.0-cp34-none-win32.whl (405.3 kB view details)

Uploaded CPython 3.4 Windows x86

lifelines-0.4.4.0-cp33-none-win_amd64.whl (404.8 kB view details)

Uploaded CPython 3.3 Windows x86-64

lifelines-0.4.4.0-cp33-none-win32.whl (405.3 kB view details)

Uploaded CPython 3.3 Windows x86

lifelines-0.4.4.0-cp27-none-win_amd64.whl (404.8 kB view details)

Uploaded CPython 2.7 Windows x86-64

lifelines-0.4.4.0-cp27-none-win32.whl (405.3 kB view details)

Uploaded CPython 2.7 Windows x86

File details

Details for the file lifelines-0.4.4.0.tar.gz.

File metadata

  • Download URL: lifelines-0.4.4.0.tar.gz
  • Upload date:
  • Size: 606.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for lifelines-0.4.4.0.tar.gz
Algorithm Hash digest
SHA256 c9fe3cba1594355e4bcbf06869118ad49ddbcbdc58540381eff8a6acdfd7ec3a
MD5 33ab4ca1b43bb5f0f4f457ed959e3a63
BLAKE2b-256 07727988066e4faf256294846ff0f23ef27bc31df477b74dea639189bab60330

See more details on using hashes here.

File details

Details for the file lifelines-0.4.4.0-cp34-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.4.0-cp34-none-win_amd64.whl
Algorithm Hash digest
SHA256 d983d8e589868ed2c7413bc7322e91298e16196e83b8f4e44b4a1c87344542bb
MD5 67d587def1bcc8914a02780093af4d39
BLAKE2b-256 64a862befe7d776ecba4819b31e3ca8d40dde2f5e25b817dea5394b847758adf

See more details on using hashes here.

File details

Details for the file lifelines-0.4.4.0-cp34-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.4.0-cp34-none-win32.whl
Algorithm Hash digest
SHA256 2af83951f6b4a822ef19b42c61911f3c5a8dc8cde027ebb6b4284dfaaf8f9e3b
MD5 46bdc2f7c6f63c78aa2e143d19b6d3e5
BLAKE2b-256 3c6bf9f8f0e8947d811f6e46236d7c5410a822d5fbd81eb3f93443a012f82d2f

See more details on using hashes here.

File details

Details for the file lifelines-0.4.4.0-cp33-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.4.0-cp33-none-win_amd64.whl
Algorithm Hash digest
SHA256 4ea6c3a3f7a38a0323747eaef552b82fdefe702ae88fe936cbe97200f4a8cc19
MD5 379557a44d7321b7b7d6eb569aa9ff61
BLAKE2b-256 008bc54e8e75c536345cc7c264176126a6e432898414bbbf3a21bc1667547e40

See more details on using hashes here.

File details

Details for the file lifelines-0.4.4.0-cp33-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.4.0-cp33-none-win32.whl
Algorithm Hash digest
SHA256 2ab11d934f1519d12f5ede432515c5b791d8e4d1aa68aa96c2efcf86af973023
MD5 148572690a42feb349de2e6ae582ddb4
BLAKE2b-256 efbcc7d8d557fce22c78c9d4520730cbf3bf966b438eda0cd39516d1b40edb65

See more details on using hashes here.

File details

Details for the file lifelines-0.4.4.0-cp27-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.4.0-cp27-none-win_amd64.whl
Algorithm Hash digest
SHA256 5f0f481d0301569df48b478da833fc3c3a0ef12f72263dcab016b6cec70026fe
MD5 03773e75109e9e8e7f2dcd74a67c189c
BLAKE2b-256 4ecce52ba64b0ec2922d8277c662b8fba5a72174bf33113f31c00588c150be35

See more details on using hashes here.

File details

Details for the file lifelines-0.4.4.0-cp27-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.4.0-cp27-none-win32.whl
Algorithm Hash digest
SHA256 d735aee9dac6eed73970ddacf452f62cc7355f24e38f6ba10cbea439201d508f
MD5 820dc14d3fc2911aad9f7165d03c3318
BLAKE2b-256 a19650acf0fd44cada32470c72ee6730ef3ccc2e5a1efbe4aeaea63ad990ad05

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page