Skip to main content

Debug machine learning classifiers and explain their predictions

Project description

PyPI Version Build Status Code Coverage Documentation

ELI5 is a Python package which helps to debug machine learning classifiers and explain their predictions.

Currently it allows to:

  • explain weights and predictions of scikit-learn linear classifiers and regressors;

  • explain weights of scikit-learn decision trees and tree-based ensemble classifiers (via feature importances);

  • debug scikit-learn pipelines which contain HashingVectorizer, by undoing hashing;

  • explain predictions of any black-box classifier using LIME ( http://arxiv.org/abs/1602.04938 ) algorithm.

TODO:

License is MIT.

Check docs for more (sorry, also TODO).

Changelog

0.0.5 (2016-09-27)

  • HashingVectorizer support in explain_prediction;

  • add an option to pass coefficient scaling array; it is useful if you want to compare coefficients for features which scale or sign is different in the input;

  • bug fix: classifier weights are no longer changed by eli5 functions.

0.0.4 (2016-09-24)

  • eli5.sklearn.InvertableHashingVectorizer and eli5.sklearn.FeatureUnhasher allow to recover feature names for pipelines which use HashingVectorizer or FeatureHasher;

  • added support for scikit-learn linear regression models (ElasticNet, Lars, Lasso, LinearRegression, LinearSVR, Ridge, SGDRegressor);

  • doc and vec arguments are swapped in explain_prediction function; vec can now be omitted if an example is already vectorized;

  • fixed issue with dense feature vectors;

  • all class_names arguments are renamed to target_names;

  • feature name guessing is fixed for scikit-learn ensemble estimators;

  • testing improvements.

0.0.3 (2016-09-21)

  • support any black-box classifier using LIME (http://arxiv.org/abs/1602.04938) algorithm; text data support is built-in;

  • “vectorized” argument for sklearn.explain_prediction; it allows to pass example which is already vectorized;

  • allow to pass feature_names explicitly;

  • support classifiers without get_feature_names method using auto-generated feature names.

0.0.2 (2016-09-19)

  • ‘top’ argument of explain_prediction can be a tuple (num_positive, num_negative);

  • classifier name is no longer printed by default;

  • added eli5.sklearn.explain_prediction to explain individual examples;

  • fixed numpy warning.

0.0.1 (2016-09-15)

Pre-release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eli5-0.0.5.tar.gz (15.3 kB view details)

Uploaded Source

Built Distribution

eli5-0.0.5-py2.py3-none-any.whl (20.4 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file eli5-0.0.5.tar.gz.

File metadata

  • Download URL: eli5-0.0.5.tar.gz
  • Upload date:
  • Size: 15.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for eli5-0.0.5.tar.gz
Algorithm Hash digest
SHA256 cfb03426fd7022aa9d1b4402389895c9e220f4655003841b2bd5da2320ef19b9
MD5 fade357ed8911898596dc3d4fa468eea
BLAKE2b-256 fad7f96aad853511ed82160ed2821ef4295faa78747b7d53751d4b2e1de9d5ef

See more details on using hashes here.

Provenance

File details

Details for the file eli5-0.0.5-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for eli5-0.0.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 8ef2131f06a6661301693782c1daf85d16249f856d536732122d811fc2b7cd9e
MD5 78eb075ae62b764529260ab5f994e046
BLAKE2b-256 eef7024974e1b212a2f9cfdec65f6e78c3c1c3ade9ab1dd5fa28f5193095c869

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page