Skip to main content

Data Analysis Baseline Library

Project description

dabl

The data analysis baseline library.

  • "Mr Sanchez, are you a data scientist?"
  • "I dabl, Mr president."

Find more information on the website.

State of the library

Right now, this library is still a prototype. API might change, and you shouldn't rely on it in any critical settings.

Try it out

pip install dabl

or Binder

Current scope and upcoming features

This library is very much still under development. Current code focuses mostly on exploratory visualization and preprocessing. There are also drop-in replacements for GridSearchCV and RandomizedSearchCV using successive halfing. There are preliminary portfolios in the style of POSH auto-sklearn to find strong models quickly. In essence that boils down to a quick search over different gradient boosting models and other tree ensembles and potentially kernel methods.

Stay Tuned!

Pandas Profiling package

The Pandas Profiling package is useful for initial data analysis. Using Pandas Profiling can provide a thorough summary of the data in only a single line of code. Using the ProfileReport() method, you are able to access a HTML report of your data that can help you find correlations and identify missing data.

Try it out

pip install pandas-profiling

or [https://github.com/pandas-profiling/pandas-profiling]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dabl-0.1.9.tar.gz (566.4 kB view details)

Uploaded Source

File details

Details for the file dabl-0.1.9.tar.gz.

File metadata

  • Download URL: dabl-0.1.9.tar.gz
  • Upload date:
  • Size: 566.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0.post20200814 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.6

File hashes

Hashes for dabl-0.1.9.tar.gz
Algorithm Hash digest
SHA256 21f3fe8b39b6503441e3a856d7d976c7914def1fc0419a17723cd4af9839b7ee
MD5 5d5d967f413a7cecff3cb8cdcbfa12ce
BLAKE2b-256 032b44cda7092e17e477bba5145ec2b10db9969eff38218bc2f256ec976b9820

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page