Skip to main content

Survival analysis in Python, including Kaplan Meier, Nelson Aalen and regression

Project description

What is survival analysis and why should I learn it? Survival analysis was originally developed and applied heavily by the actuarial and medical community. Its purpose was to answer why do events occur now versus later under uncertainity (where events might refer to deaths, disease remission, etc.). This is great for researchers who are interested in measuring lifetimes: they can answer questions like what factors might influence deaths?

But outside of medicine and actuarial science, there are many other interesting and exciting applications of this lesser-known technique, for example: - SaaS providers are interested in measuring customer lifetimes, or time to first behaviours. - sociologists are interested in measure political parties lifetimes, or relationships, or marriages - Businesses are interested in what variables affect lifetime value

lifelines is a pure Python implementation of the best parts of survival analysis. We’d love to hear if you are using lifelines, please ping me at [@cmrn_dp](https://twitter.com/Cmrn_DP) and let me know your thoughts on the library.

Installation:

Dependencies:

The usual Python data stack: Numpy, Scipy, Pandas (a modern version please), Matplotlib

Installing

You can install lifelines using

pip install lifelines

Or getting the bleeding edge version with:

pip install git+https://github.com/CamDavidsonPilon/lifelines.git

or upgrade with

pip install --upgrade git+https://github.com/CamDavidsonPilon/lifelines.git

from the command line.

Intro to lifelines and survival analysis

Situation: 500 random individuals are born at time 0, currently it is time 12, so we have possibly not observed all death events yet.

# Create lifetimes, but censor all lifetimes after time 12
censor_after = 12
actual_lifetimes = np.random.exponential(10, size=500)
observed_lifetimes = np.minimum( actual_lifetimes, censor_after*np.ones(500) )
C = (actual_lifetimes < censor_after) #boolean array

Non-parametrically fit the survival curve:

from lifelines import KaplanMeierFitter

kmf = KaplanMeierFitter()
kmf.fit(observed_lifetimes, event_observed=C)

# fitter methods have an internal plotting method.
# plot the curve with the confidence intervals
kmf.plot()
kmf

It looks like 50% of all individuals are dead before time 7.

print kmf.survival_function_.head()

time            KM-estimate
0.000000        1.000
0.038912        0.998
0.120667        0.996
0.125719        0.994
0.133778        0.992

Non-parametrically fit the cumulative hazard curve:

from lifelines import NelsonAalenFitter

naf = NelsonAalenFitter()
naf.fit(observed_lifetimes, event_observed=C)

#plot the curve with the confidence intervals
naf.plot()
naf
print naf.cumulative_hazard_.head()

time       NA-estimate
0.000000     0.000000
0.038912     0.002000
0.120667     0.004004
0.125719     0.006012
0.133778     0.008024

Compare two populations using the logrank test:

from lifelines.statistics import logrank_test
other_lifetimes = np.random.exponential(3, size=500)

summary, p_value, results = logrank_test(observed_lifetimes, other_lifetimes, alpha=0.95)
print summary


Results
   df: 1
   alpha: 0.95
   t 0: -1
   test: logrank
   null distribution: chi squared

   __ p-value ___|__ test statistic __|__ test results __
         0.00000 |              268.465 |     True

(Less Quick) Intro to lifelines and survival analysis

If you are new to survival analysis, wondering why it is useful, or are interested in lifelines examples and syntax, please check out the Documentation and Tutorials page

Alternatively, you can use the IPython notebooks tutorials, located in the main directory of the repo:

  1. Introduction to survival analysis

  2. Using lifelines on real data

More examples

There are some IPython notebook files in the repo, and you can view them online here.

lifelines

License

The Feedback MIT License (FMIT)

Copyright (c) 2013, Cameron Davidson-Pilon

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

  1. The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

  2. Person obtaining a copy must return feedback to the authors.

THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

lifelines logo designed by Pulse designed by TNS from the Noun Project

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lifelines-0.5.0.0.tar.gz (519.5 kB view details)

Uploaded Source

Built Distributions

lifelines-0.5.0.0-cp34-none-win_amd64.whl (365.4 kB view details)

Uploaded CPython 3.4 Windows x86-64

lifelines-0.5.0.0-cp34-none-win32.whl (367.9 kB view details)

Uploaded CPython 3.4 Windows x86

lifelines-0.5.0.0-cp33-none-win_amd64.whl (365.4 kB view details)

Uploaded CPython 3.3 Windows x86-64

lifelines-0.5.0.0-cp33-none-win32.whl (367.9 kB view details)

Uploaded CPython 3.3 Windows x86

lifelines-0.5.0.0-cp27-none-win_amd64.whl (365.4 kB view details)

Uploaded CPython 2.7 Windows x86-64

lifelines-0.5.0.0-cp27-none-win32.whl (367.9 kB view details)

Uploaded CPython 2.7 Windows x86

File details

Details for the file lifelines-0.5.0.0.tar.gz.

File metadata

  • Download URL: lifelines-0.5.0.0.tar.gz
  • Upload date:
  • Size: 519.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for lifelines-0.5.0.0.tar.gz
Algorithm Hash digest
SHA256 0f63ff1ecfb08e6422db742b263f7c1f60aeeeea6aa8814fbe1b9a6597691a4b
MD5 44a61eefa0c13681ae25246a415b3ec8
BLAKE2b-256 b9110e99d8dd928860c8067b4ddd7d5fa93f7b1fd1c555455f00c327c242d308

See more details on using hashes here.

File details

Details for the file lifelines-0.5.0.0-cp34-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.5.0.0-cp34-none-win_amd64.whl
Algorithm Hash digest
SHA256 b791f57756f8b6d6763e18af004bb8f5ff995f5f24e4586656f82ce3a16cec10
MD5 9fef121ef0f27be5456b013a7489a7e9
BLAKE2b-256 3416d753d5079d351b63027840eef73eb9e360c3bf92831a3276a300dda91fde

See more details on using hashes here.

File details

Details for the file lifelines-0.5.0.0-cp34-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.5.0.0-cp34-none-win32.whl
Algorithm Hash digest
SHA256 ec443be5ce06b7c5f3b3d154b88e358242ce4b78549cf69869b80634f2778a3a
MD5 7dc0c8f0005aba71e3c660ee92c77bcc
BLAKE2b-256 5fea4006f6be1a5edf627a9e2852184a90176a4041faee076cd05a69cb1c7a1a

See more details on using hashes here.

File details

Details for the file lifelines-0.5.0.0-cp33-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.5.0.0-cp33-none-win_amd64.whl
Algorithm Hash digest
SHA256 1aa9d596215452eb35d87230321f9474344d12526553ad64f544bff95817e33e
MD5 15aa4b1e94501fb544d20376084d78ab
BLAKE2b-256 dd943896989e39643e1665a3bec3f613c275ff2b89118383b88dc7112d108217

See more details on using hashes here.

File details

Details for the file lifelines-0.5.0.0-cp33-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.5.0.0-cp33-none-win32.whl
Algorithm Hash digest
SHA256 7c0d8c514b0c6bd8946a0cadf10e96c7e1545ac0c56fd777830b3b2225e515a5
MD5 f4406e2b45388ed1f18eeccdf40f2120
BLAKE2b-256 35bcdb397a119aaa222d7561270af93f597b5c2acf5c73928cf370d28351c77c

See more details on using hashes here.

File details

Details for the file lifelines-0.5.0.0-cp27-none-win_amd64.whl.

File metadata

File hashes

Hashes for lifelines-0.5.0.0-cp27-none-win_amd64.whl
Algorithm Hash digest
SHA256 726df2dd955467d5e71884cc59a4da06cf953c6ecf0e711ec7c67219e10a4714
MD5 c9d17f9973225e17e25d4ae45433ae88
BLAKE2b-256 b346536f8d86dab14041042101a235036ee3dc4c52330f9d5bb79888bcac59ae

See more details on using hashes here.

File details

Details for the file lifelines-0.5.0.0-cp27-none-win32.whl.

File metadata

File hashes

Hashes for lifelines-0.5.0.0-cp27-none-win32.whl
Algorithm Hash digest
SHA256 ffc385f2f80541774fb1fd4ddaa0786eeb1f3527b6e3a2798aca2faed0f7d5dc
MD5 d4ddc3f8d585b63c7261db2ef74e59f8
BLAKE2b-256 d1521986e8b63b2f20c2725918d291ae7e1c1699f54bf1c80dc3da4e47b3d963

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page