Skip to main content

Data Analysis Baseline Library

Project description

dabl

The data analysis baseline library.

  • "Mr Sanchez, are you a data scientist?"
  • "I dabl, Mr president."

Find more information on the website.

Try it out

pip install dabl

or Binder

Current scope and upcoming features

This library is very much still under development. Current code focuses mostly on exploratory visualization and preprocessing. There are also drop-in replacements for GridSearchCV and RandomizedSearchCV using successive halfing. There are preliminary portfolios in the style of POSH auto-sklearn to find strong models quickly. In essence that boils down to a quick search over different gradient boosting models and other tree ensembles and potentially kernel methods.

Check out the the website and example gallery to get an idea of the visualizations that are available.

Stay Tuned!

Related packages

Pandas Profiling

Pandas Profiling can provide a thorough summary of the data in only a single line of code. Using the ProfileReport() method, you are able to access a HTML report of your data that can help you find correlations and identify missing data.

dabl focuses less on statistical measures of individual columns, and more on providing a quick overview via visualizations, as well as convienient preprocessing and model search for machine learning.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dabl-0.2.2.tar.gz (554.0 kB view details)

Uploaded Source

Built Distribution

dabl-0.2.2-py3-none-any.whl (559.0 kB view details)

Uploaded Python 3

File details

Details for the file dabl-0.2.2.tar.gz.

File metadata

  • Download URL: dabl-0.2.2.tar.gz
  • Upload date:
  • Size: 554.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.9.1

File hashes

Hashes for dabl-0.2.2.tar.gz
Algorithm Hash digest
SHA256 68e2eeea699fb72a5c2fdd00589eed8cfc2e5f1e826136f38938723aaa80f73f
MD5 821f6324d8d64c14333c7ce4f457459a
BLAKE2b-256 b12c92b72e48d6ce8742e4ee5f686d52bb5e5ed97ac90d5f7efd6df8764fde56

See more details on using hashes here.

Provenance

File details

Details for the file dabl-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: dabl-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 559.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.9.1

File hashes

Hashes for dabl-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a5f729ff42b75c1e8528cb4b3ded16e090eca9dd7ee38a7d51b475ea5f45b871
MD5 ab71e3fe90b34912ee139399f13e5e7c
BLAKE2b-256 55db4aa72145226c34d9c25a0cdaa5161586f6468b44a6b49627588087ce04b4

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page