Skip to main content

Survival analysis in Python, including Kaplan Meier, Nelson Aalen and regression

Project description

What is survival analysis and why should I learn it? Survival analysis was originally developed and applied heavily by the actuarial and medical community. Its purpose was to answer why do events occur now versus later under uncertainity (where events might refer to deaths, disease remission, etc.). This is great for researchers who are interested in measuring lifetimes: they can answer questions like what factors might influence deaths?

But outside of medicine and actuarial science, there are many other interesting and exciting applications of this lesser-known technique, for example: - SaaS providers are interested in measuring customer lifetimes, or time to first behaviours. - sociologists are interested in measure political parties lifetimes, or relationships, or marriages - Businesses are interested in what variables affect lifetime value

lifelines is a pure Python implementation of the best parts of survival analysis. We’d love to hear if you are using lifelines, please ping me at [@cmrn_dp](https://twitter.com/Cmrn_DP) and let me know your thoughts on the library.

Installation:

Dependencies:

The usual Python data stack: Numpy, Scipy, Pandas (a modern version please), Matplotlib

Installing

You can install lifelines using

pip install lifelines

Or getting the bleeding edge version with:

pip install git+https://github.com/CamDavidsonPilon/lifelines.git

or upgrade with

pip install --upgrade git+https://github.com/CamDavidsonPilon/lifelines.git

from the command line.

Intro to lifelines and survival analysis

Situation: 500 random individuals are born at time 0, currently it is time 12, so we have possibly not observed all death events yet.

# Create lifetimes, but censor all lifetimes after time 12
censor_after = 12
actual_lifetimes = np.random.exponential(10, size=500)
observed_lifetimes = np.minimum( actual_lifetimes, censor_after*np.ones(500) )
C = (actual_lifetimes < censor_after) #boolean array

Non-parametrically fit the survival curve:

from lifelines import KaplanMeierFitter

kmf = KaplanMeierFitter()
kmf.fit(observed_lifetimes, event_observed=C)

# fitter methods have an internal plotting method.
# plot the curve with the confidence intervals
kmf.plot()
kmf

It looks like 50% of all individuals are dead before time 7.

print kmf.survival_function_.head()

time            KM-estimate
0.000000        1.000
0.038912        0.998
0.120667        0.996
0.125719        0.994
0.133778        0.992

Non-parametrically fit the cumulative hazard curve:

from lifelines import NelsonAalenFitter

naf = NelsonAalenFitter()
naf.fit(observed_lifetimes, event_observed=C)

#plot the curve with the confidence intervals
naf.plot()
naf
print naf.cumulative_hazard_.head()

time       NA-estimate
0.000000     0.000000
0.038912     0.002000
0.120667     0.004004
0.125719     0.006012
0.133778     0.008024

Compare two populations using the logrank test:

from lifelines.statistics import logrank_test
other_lifetimes = np.random.exponential(3, size=500)

summary, p_value, results = logrank_test(observed_lifetimes, other_lifetimes, alpha=0.95)
print summary


Results
   df: 1
   alpha: 0.95
   t 0: -1
   test: logrank
   null distribution: chi squared

   __ p-value ___|__ test statistic __|__ test results __
         0.00000 |              268.465 |     True

(Less Quick) Intro to lifelines and survival analysis

If you are new to survival analysis, wondering why it is useful, or are interested in lifelines examples and syntax, please check out the Documentation and Tutorials page

Alternatively, you can use the IPython notebooks tutorials, located in the main directory of the repo:

  1. Introduction to survival analysis

  2. Using lifelines on real data

More examples

There are some IPython notebook files in the repo, and you can view them online here.

lifelines

License

The Feedback MIT License (FMIT)

Copyright (c) 2013, Cameron Davidson-Pilon

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

  1. The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

  2. Person obtaining a copy must return feedback to the authors.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

lifelines logo designed by Pulse designed by TNS from the Noun Project

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lifelines-0.4.1.1.tar.gz (391.5 kB view details)

Uploaded Source

Built Distributions

lifelines-0.4.1.1-cp34-none-win_amd64.whl (399.0 kB view details)

Uploaded CPython 3.4 Windows x86-64

lifelines-0.4.1.1-cp34-none-win32.whl (399.6 kB view details)

Uploaded CPython 3.4 Windows x86

lifelines-0.4.1.1-cp33-none-win_amd64.whl (399.0 kB view details)

Uploaded CPython 3.3 Windows x86-64

lifelines-0.4.1.1-cp33-none-win32.whl (399.6 kB view details)

Uploaded CPython 3.3 Windows x86

lifelines-0.4.1.1-cp27-none-win_amd64.whl (399.0 kB view details)

Uploaded CPython 2.7 Windows x86-64

lifelines-0.4.1.1-cp27-none-win32.whl (399.5 kB view details)

Uploaded CPython 2.7 Windows x86

File details

Details for the file lifelines-0.4.1.1.tar.gz.

File metadata

  • Download URL: lifelines-0.4.1.1.tar.gz
  • Upload date:
  • Size: 391.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for lifelines-0.4.1.1.tar.gz
Algorithm Hash digest
SHA256 2e63a5eed0003d9f8515d2f5aa5d5440aae4033664c19135372072b8a0a00fbb
MD5 c2cfe429a1496dfba60a179d59d337c6
BLAKE2b-256 0dfb8dd7781dcd31a972908b2ef8d09b5e01b31cf526ca7949c8c0e2f3fbf8f9

See more details on using hashes here.

File details

Details for the file lifelines-0.4.1.1-cp34-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.1.1-cp34-none-win_amd64.whl
Algorithm Hash digest
SHA256 75df7914f3afac969bd90f29340c3cc2b04cdc75402dee63252fab9f8deaff14
MD5 df60124584f121cd93c676fa43d6bfcb
BLAKE2b-256 bc1acd12ace8a0769f5f7686d8e1cb969bf609a1e94af48450d885b12aafe8bb

See more details on using hashes here.

File details

Details for the file lifelines-0.4.1.1-cp34-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.1.1-cp34-none-win32.whl
Algorithm Hash digest
SHA256 cd1ebc14a911311ae148ec2e0eadb02922b104530be1bba8a33a88f62276ef8d
MD5 24bcd824752d60ab7f9b3dbb07b6de51
BLAKE2b-256 59e988324209f65d8d0bc50ed212ff508dd69892dc0f499010c34d62daf2d33e

See more details on using hashes here.

File details

Details for the file lifelines-0.4.1.1-cp33-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.1.1-cp33-none-win_amd64.whl
Algorithm Hash digest
SHA256 26d3c2bd120aceb31218a866516feaa90739aacd65956a55441390dd66fe5269
MD5 537894287c59b1f7a4d6e85aaef3ba3c
BLAKE2b-256 c2bcd5bb69e1431ce60f4d79c0122bf169dde0bb1ff6e64d730d8d42ac275ca4

See more details on using hashes here.

File details

Details for the file lifelines-0.4.1.1-cp33-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.1.1-cp33-none-win32.whl
Algorithm Hash digest
SHA256 61246d8a208fe9669a5002768b0fc8c760b6502c8747bf6a26d58fc26c71757d
MD5 f2e3c9c4cfddbbb840a40de8adfee42c
BLAKE2b-256 f7c3d3f80c3d025d0564bc1d4d4d3feea7a1d64a40fc9328021820c2fe4093a3

See more details on using hashes here.

File details

Details for the file lifelines-0.4.1.1-cp27-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.1.1-cp27-none-win_amd64.whl
Algorithm Hash digest
SHA256 936f3cfeac9418e3daeca1b5ab21f8618982b783a9a3365d60c2dd476ef2b448
MD5 12029029c0b852611c4188de3b82ee5d
BLAKE2b-256 9933a1fe03804ec1b4bd6d823c29ed3686752407c1904eebbfab15b8183b46e5

See more details on using hashes here.

File details

Details for the file lifelines-0.4.1.1-cp27-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.1.1-cp27-none-win32.whl
Algorithm Hash digest
SHA256 7d683c0342edd4d60b8828a494b9e1d68d29487dafaf208c8742b71e3fa3fbb0
MD5 852b29524ce9bf8340e39fc88e42ca42
BLAKE2b-256 ace87c40266890392dc45f2509cca61c506f37685c3001ef9053cb2fe21f88a3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page