Skip to main content

Survival analysis in Python, including Kaplan Meier, Nelson Aalen and regression

Project description

What is survival analysis and why should I learn it? Survival analysis was originally developed and applied heavily by the actuarial and medical community. Its purpose was to answer why do events occur now versus later under uncertainity (where events might refer to deaths, disease remission, etc.). This is great for researchers who are interested in measuring lifetimes: they can answer questions like what factors might influence deaths?

But outside of medicine and actuarial science, there are many other interesting and exciting applications of this lesser-known technique, for example: - SaaS providers are interested in measuring customer lifetimes, or time to first behaviours. - sociologists are interested in measure political parties lifetimes, or relationships, or marriages - Businesses are interested in what variables affect lifetime value

lifelines is a pure Python implementation of the best parts of survival analysis. We’d love to hear if you are using lifelines, please ping me at [@cmrn_dp](https://twitter.com/Cmrn_DP) and let me know your thoughts on the library.

Installation:

Dependencies:

The usual Python data stack: Numpy, Scipy, Pandas (a modern version please), Matplotlib

Installing

You can install lifelines using

pip install lifelines

Or getting the bleeding edge version with:

pip install git+https://github.com/CamDavidsonPilon/lifelines.git

or upgrade with

pip install --upgrade git+https://github.com/CamDavidsonPilon/lifelines.git

from the command line.

Intro to lifelines and survival analysis

Situation: 500 random individuals are born at time 0, currently it is time 12, so we have possibly not observed all death events yet.

# Create lifetimes, but censor all lifetimes after time 12
censor_after = 12
actual_lifetimes = np.random.exponential(10, size=500)
observed_lifetimes = np.minimum( actual_lifetimes, censor_after*np.ones(500) )
C = (actual_lifetimes < censor_after) #boolean array

Non-parametrically fit the survival curve:

from lifelines import KaplanMeierFitter

kmf = KaplanMeierFitter()
kmf.fit(observed_lifetimes, event_observed=C)

# fitter methods have an internal plotting method.
# plot the curve with the confidence intervals
kmf.plot()
kmf

It looks like 50% of all individuals are dead before time 7.

print kmf.survival_function_.head()

time            KM-estimate
0.000000        1.000
0.038912        0.998
0.120667        0.996
0.125719        0.994
0.133778        0.992

Non-parametrically fit the cumulative hazard curve:

from lifelines import NelsonAalenFitter

naf = NelsonAalenFitter()
naf.fit(observed_lifetimes, event_observed=C)

#plot the curve with the confidence intervals
naf.plot()
naf
print naf.cumulative_hazard_.head()

time       NA-estimate
0.000000     0.000000
0.038912     0.002000
0.120667     0.004004
0.125719     0.006012
0.133778     0.008024

Compare two populations using the logrank test:

from lifelines.statistics import logrank_test
other_lifetimes = np.random.exponential(3, size=500)

summary, p_value, results = logrank_test(observed_lifetimes, other_lifetimes, alpha=0.95)
print summary


Results
   df: 1
   alpha: 0.95
   t 0: -1
   test: logrank
   null distribution: chi squared

   __ p-value ___|__ test statistic __|__ test results __
         0.00000 |              268.465 |     True

(Less Quick) Intro to lifelines and survival analysis

If you are new to survival analysis, wondering why it is useful, or are interested in lifelines examples and syntax, please check out the Documentation and Tutorials page

Alternatively, you can use the IPython notebooks tutorials, located in the main directory of the repo:

  1. Introduction to survival analysis

  2. Using lifelines on real data

More examples

There are some IPython notebook files in the repo, and you can view them online here.

lifelines

License

The Feedback MIT License (FMIT)

Copyright (c) 2013, Cameron Davidson-Pilon

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

  1. The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

  2. Person obtaining a copy must return feedback to the authors.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

lifelines logo designed by Pulse designed by TNS from the Noun Project

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lifelines-0.4.3.tar.gz (628.8 kB view details)

Uploaded Source

Built Distributions

lifelines-0.4.3-cp34-none-win_amd64.whl (398.9 kB view details)

Uploaded CPython 3.4 Windows x86-64

lifelines-0.4.3-cp34-none-win32.whl (399.5 kB view details)

Uploaded CPython 3.4 Windows x86

lifelines-0.4.3-cp33-none-win_amd64.whl (398.9 kB view details)

Uploaded CPython 3.3 Windows x86-64

lifelines-0.4.3-cp33-none-win32.whl (399.5 kB view details)

Uploaded CPython 3.3 Windows x86

lifelines-0.4.3-cp27-none-win_amd64.whl (399.0 kB view details)

Uploaded CPython 2.7 Windows x86-64

lifelines-0.4.3-cp27-none-win32.whl (399.5 kB view details)

Uploaded CPython 2.7 Windows x86

File details

Details for the file lifelines-0.4.3.tar.gz.

File metadata

  • Download URL: lifelines-0.4.3.tar.gz
  • Upload date:
  • Size: 628.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for lifelines-0.4.3.tar.gz
Algorithm Hash digest
SHA256 7928853536edb2624977704e80d206f61f5b52ef9ed4a4cd1137dbeb8b919b68
MD5 85c10f17e6da3d0448a44bbe1c86dcee
BLAKE2b-256 cbbbf2cedda9ba438828af730c7c29c8a9ac3c99ede6e38f4bac1ed87b6008d4

See more details on using hashes here.

File details

Details for the file lifelines-0.4.3-cp34-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.3-cp34-none-win_amd64.whl
Algorithm Hash digest
SHA256 f7d5d7b74863486f0f2331d98a6da4db82f4f2ad575b3d146eca95ea4ef4a2ff
MD5 650eb86abc3d6ffcd3c265ea76c5ed5f
BLAKE2b-256 26c58208c45e368d5ad38f3fe4baac36dd8ada4c3935a9962f6dff0c3d68036c

See more details on using hashes here.

File details

Details for the file lifelines-0.4.3-cp34-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.3-cp34-none-win32.whl
Algorithm Hash digest
SHA256 31f7f02a7af9ee4e60667251954e33bb18a90540d73fdd5bb0cfa6f31610fade
MD5 cf239a60ebade11f0bf9c45950778eea
BLAKE2b-256 f7589fa315c7153104005c99c307fe20e8abf7ef514527a33895541be9e3cea2

See more details on using hashes here.

File details

Details for the file lifelines-0.4.3-cp33-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.3-cp33-none-win_amd64.whl
Algorithm Hash digest
SHA256 5e74b7840083dd65b63cc160e7e1fafc2b6b13bf93ba42b304dfeadb19eefc83
MD5 ece26f42843b3a0238241dcdecb01b48
BLAKE2b-256 2212267d3caeaad722c04f8ebad5b00033cc88f58f7890e17ad136ee949bf82d

See more details on using hashes here.

File details

Details for the file lifelines-0.4.3-cp33-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.3-cp33-none-win32.whl
Algorithm Hash digest
SHA256 164d90a547bf53ae0c6650559557ac0fce3577db2299f4b94503bafe0253acde
MD5 825923c8954c96a0379c004c524fbb4e
BLAKE2b-256 539dbf7e2a188aeb74cdcea384377fe512b2118cf249d75b932b773f9bbd5313

See more details on using hashes here.

File details

Details for the file lifelines-0.4.3-cp27-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.3-cp27-none-win_amd64.whl
Algorithm Hash digest
SHA256 493d775f8aab9cef5a49773b9e228b36f67db6d70bf8af86b69fd823c3fdcda3
MD5 99b01bd7018c7aaf99dbfb9889ed33f2
BLAKE2b-256 012781ff8e08c8bb21f227080aceae026f97a5cd7e6db22cef63e1c191c3ac52

See more details on using hashes here.

File details

Details for the file lifelines-0.4.3-cp27-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.3-cp27-none-win32.whl
Algorithm Hash digest
SHA256 856de90feffc035a32a17db0918b8a9a839282d49ae8ec3e5955ec1d3f3ec50f
MD5 85a04520c03827438c718aefdb1b3123
BLAKE2b-256 3761e254ef22046f8dc78c64b969cd43caf88d63b0fe9531c4b642fefc75af47

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page