Generalized additive models in Python.
Project description
generalized-additive-models
Generalized Additive Models (GAMs) in Python.
About
GAMs are uniquely placed on the interpretability vs. precitive power continuum. In many applications they perform almost as well as more complex models, but are extremely interpretable.
GAMs extend linear regression by allowing non-linear relationships between features and the target.
The model is still additive, but link functions and multivariate splines facilitate a broad class of models.
While GAMs are likely outperformed by non-additive models (e.g. boosted trees), GAMs are extremely interpretable.
Read more about GAMs:
A GAM is a statistical model in which the target variable depends on unknown smooth functions of the features, and interest focuses on inference about these smooth functions.
An exponential family distribution is specified for the target Y (.e.g Normal, Binomial or Poisson) along with a link function g (for example the identity or log functions) relating the expected value of Y to the predictor variables.
Installation
Install using pip:
pip install generalized-additive-models
Example
from sklearn.datasets import load_diabetes
from sklearn.model_selection import cross_val_score
from generalized_additive_models import GAM, Spline, Categorical
# Load data
data = load_diabetes(as_frame=True)
df, y = data.data, data.target
# Create model
terms = Spline("bp") + Spline("bmi", constraint="increasing") + Categorical("sex")
gam = GAM(terms)
# Cross validate
scores = cross_val_score(gam, df, y, scoring="r2")
print(scores) # array([0.26, 0.4 , 0.41, 0.35, 0.42])
Go to Read the Docs to see full documentation.
Contributing and development
Contributions are very welcome. You can correct spelling mistakes, write documentation, clean up code, implement new features, etc.
Some guidelines for development:
Code must comply with the standard. See the GitHub action pipeline for more information.
If possible, use existing algorithms from numpy, scipy and scikit-learn.
Write tests, especically regression tests if a bug is fixed.
Take backward compatibility seriously. API changes require good reason.
Installation for local development:
pip install -e '.[dev,lint,doc]'
Create documentation locally:
sudo apt install pandoc sphinx-build docs _built_docs/html -W -a -E --keep-going sphinx-autobuild docs _built_docs/html -v -j "auto" --watch generalized_additive_models
Once the version has been incremented, the commit must be tagged and pushed in order to publish to PyPi:
git tag -a v0.1.0 -m "Version 0.1.0" b22724c git push origin v0.1.0
Citing
TODO
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file generalized_additive_models-0.4.0.tar.gz
.
File metadata
- Download URL: generalized_additive_models-0.4.0.tar.gz
- Upload date:
- Size: 83.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2cae9882a5b12ee3e76f5dd3f6f1acaf3c0fb7ad67657ff42f54fa4fb5c4ec4a |
|
MD5 | 4c15b54efbd719f494734a9bb3bb4ec2 |
|
BLAKE2b-256 | f5cbf30da1d381493744b0471442403bbadddc24feb4cfe8a785caaae297aa72 |
File details
Details for the file generalized_additive_models-0.4.0-py3-none-any.whl
.
File metadata
- Download URL: generalized_additive_models-0.4.0-py3-none-any.whl
- Upload date:
- Size: 95.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f18c79163b42978177d589794b4dc643e24ebb9ba2147e999b565092ed553cfd |
|
MD5 | ad85450a0548a1723ee1c3cdda263782 |
|
BLAKE2b-256 | 3821e2a7c6afea6491270314ac6f9e7a00398e17300d9190c3413697e0f4caf5 |