Skip to main content

Survival analysis in Python, including Kaplan Meier, Nelson Aalen and regression

Project description

What is survival analysis and why should I learn it? Survival analysis was originally developed and applied heavily by the actuarial and medical community. Its purpose was to answer why do events occur now versus later under uncertainity (where events might refer to deaths, disease remission, etc.). This is great for researchers who are interested in measuring lifetimes: they can answer questions like what factors might influence deaths?

But outside of medicine and actuarial science, there are many other interesting and exciting applications of this lesser-known technique, for example: - SaaS providers are interested in measuring customer lifetimes, or time to first behaviours. - sociologists are interested in measure political parties lifetimes, or relationships, or marriages - Businesses are interested in what variables affect lifetime value

lifelines is a pure Python implementation of the best parts of survival analysis. We’d love to hear if you are using lifelines, please ping me at [@cmrn_dp](https://twitter.com/Cmrn_DP) and let me know your thoughts on the library.

Installation:

Dependencies:

The usual Python data stack: Numpy, Scipy, Pandas (a modern version please), Matplotlib

Installing

You can install lifelines using

pip install lifelines

Or getting the bleeding edge version with:

pip install git+https://github.com/CamDavidsonPilon/lifelines.git

or upgrade with

pip install --upgrade git+https://github.com/CamDavidsonPilon/lifelines.git

from the command line.

Intro to lifelines and survival analysis

Situation: 500 random individuals are born at time 0, currently it is time 12, so we have possibly not observed all death events yet.

# Create lifetimes, but censor all lifetimes after time 12
censor_after = 12
actual_lifetimes = np.random.exponential(10, size=500)
observed_lifetimes = np.minimum( actual_lifetimes, censor_after*np.ones(500) )
C = (actual_lifetimes < censor_after) #boolean array

Non-parametrically fit the survival curve:

from lifelines import KaplanMeierFitter

kmf = KaplanMeierFitter()
kmf.fit(observed_lifetimes, event_observed=C)

# fitter methods have an internal plotting method.
# plot the curve with the confidence intervals
kmf.plot()
kmf

It looks like 50% of all individuals are dead before time 7.

print kmf.survival_function_.head()

time            KM-estimate
0.000000        1.000
0.038912        0.998
0.120667        0.996
0.125719        0.994
0.133778        0.992

Non-parametrically fit the cumulative hazard curve:

from lifelines import NelsonAalenFitter

naf = NelsonAalenFitter()
naf.fit(observed_lifetimes, event_observed=C)

#plot the curve with the confidence intervals
naf.plot()
naf
print naf.cumulative_hazard_.head()

time       NA-estimate
0.000000     0.000000
0.038912     0.002000
0.120667     0.004004
0.125719     0.006012
0.133778     0.008024

Compare two populations using the logrank test:

from lifelines.statistics import logrank_test
other_lifetimes = np.random.exponential(3, size=500)

summary, p_value, results = logrank_test(observed_lifetimes, other_lifetimes, alpha=0.95)
print summary


Results
   df: 1
   alpha: 0.95
   t 0: -1
   test: logrank
   null distribution: chi squared

   __ p-value ___|__ test statistic __|__ test results __
         0.00000 |              268.465 |     True

(Less Quick) Intro to lifelines and survival analysis

If you are new to survival analysis, wondering why it is useful, or are interested in lifelines examples and syntax, please check out the Documentation and Tutorials page

Alternatively, you can use the IPython notebooks tutorials, located in the main directory of the repo:

  1. Introduction to survival analysis

  2. Using lifelines on real data

More examples

There are some IPython notebook files in the repo, and you can view them online here.

lifelines

License

The Feedback MIT License (FMIT)

Copyright (c) 2013, Cameron Davidson-Pilon

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

  1. The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

  2. Person obtaining a copy must return feedback to the authors.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

lifelines logo designed by Pulse designed by TNS from the Noun Project

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lifelines-0.4.2.tar.gz (391.9 kB view details)

Uploaded Source

Built Distributions

lifelines-0.4.2-cp34-none-win_amd64.whl (398.5 kB view details)

Uploaded CPython 3.4 Windows x86-64

lifelines-0.4.2-cp34-none-win32.whl (399.1 kB view details)

Uploaded CPython 3.4 Windows x86

lifelines-0.4.2-cp33-none-win_amd64.whl (398.5 kB view details)

Uploaded CPython 3.3 Windows x86-64

lifelines-0.4.2-cp33-none-win32.whl (399.0 kB view details)

Uploaded CPython 3.3 Windows x86

lifelines-0.4.2-cp27-none-win_amd64.whl (398.5 kB view details)

Uploaded CPython 2.7 Windows x86-64

lifelines-0.4.2-cp27-none-win32.whl (399.0 kB view details)

Uploaded CPython 2.7 Windows x86

File details

Details for the file lifelines-0.4.2.tar.gz.

File metadata

  • Download URL: lifelines-0.4.2.tar.gz
  • Upload date:
  • Size: 391.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for lifelines-0.4.2.tar.gz
Algorithm Hash digest
SHA256 ea2e49829eb8ffb5b810a77836e68ecdbfd7a072cdd5038a55d3c3d994db6d65
MD5 ea699d1ac556b6360226fc595318f6c8
BLAKE2b-256 bf85c3e77db763c30ecf236ac215d5899c4b0a43c00b191094c4d746c5f18aa9

See more details on using hashes here.

File details

Details for the file lifelines-0.4.2-cp34-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.2-cp34-none-win_amd64.whl
Algorithm Hash digest
SHA256 5c1f776db377ea9b67a6437eca661a4c9350dfd37372e538786d9e8c359a8776
MD5 f3f9f9043b39b411a9e8e81752e3bafe
BLAKE2b-256 b9cfd0dd9a02d8613700a5a507fcbd411f1064a0e55a00e790039978a986742a

See more details on using hashes here.

File details

Details for the file lifelines-0.4.2-cp34-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.2-cp34-none-win32.whl
Algorithm Hash digest
SHA256 3a1e075547500efcd8ec4569d953373239f5533acd24556b0b6cc961b29bc338
MD5 565c6de01f9c3f02d81765892d7c0f08
BLAKE2b-256 67cbcb4b0d3803b825e06ce5757e61d7e6b0440a1f32e4c09fbae545fb15e061

See more details on using hashes here.

File details

Details for the file lifelines-0.4.2-cp33-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.2-cp33-none-win_amd64.whl
Algorithm Hash digest
SHA256 5c6afcd65b7a79058811d798b02a19a2351dde8564db2435204fe9a4e196bbdd
MD5 952530228d27f89c16367189a05aae24
BLAKE2b-256 1f6b1242c331dbff60cd5f89efcc512369b62e8cae4182b2b1b58e940c8d80f1

See more details on using hashes here.

File details

Details for the file lifelines-0.4.2-cp33-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.2-cp33-none-win32.whl
Algorithm Hash digest
SHA256 eded68f03d4d59efcd9bb9b33d9d7de1ed0097c9030d38d84b484a884936bb3c
MD5 ef2cb09c9dbf8b8936802943e48b7461
BLAKE2b-256 0dbe399def9afad3349acd0e82116f883f353b9eb96c9ddd61193d594053ce07

See more details on using hashes here.

File details

Details for the file lifelines-0.4.2-cp27-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.4.2-cp27-none-win_amd64.whl
Algorithm Hash digest
SHA256 5e9dbe6bf2fb15a996ef0ff5ea246281e4a73c824489a5a42084ba653478b083
MD5 5a2d7c0e3c1d98790c60cd09079722ad
BLAKE2b-256 d024c52603a82c3e79c66d750dec6565ea9a5693f4a03269934c4583606b9e68

See more details on using hashes here.

File details

Details for the file lifelines-0.4.2-cp27-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.4.2-cp27-none-win32.whl
Algorithm Hash digest
SHA256 aa16e4dbbfaf41a67eed7457f589bfb0d3d7ba586191791ec1ac4a2094953eb4
MD5 038870470ad2aa200eeb20858ef3ed5a
BLAKE2b-256 3c35870e158b15428f72f33137b18d8b56c8d7fc907f3993ef89cadb2da8b03f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page