Skip to main content

A library to build and test machine learning features

Project description

This library provides a set of tools that can be useful in many machine learning applications (classification, clustering, regression, etc.), and particularly helpful if you use scikit-learn (although this can work if you have a different algorithm).

Most machine learning problems involve an step of feature definition and preprocessing. Feature Forge helps you with:

  • Defining and documenting features

  • Testing your features against specified cases and against randomly generated cases (stress-testing). This helps you making your application more robust against invalid/misformatted input data. This also helps you checking that low-relevance results when doing feature analysis is actually because the feature is bad, and not because there’s a slight bug in your feature code.

  • Evaluating your features on a data set, producing a feature evaluation matrix. The evaluator has a robust mode that allows you some tolerance both for invalid data and buggy features.

Installation

Just pip install featureforge.

Documentation

Documentation is available at http://feature-forge.readthedocs.org/en/latest/

Contact information

Feature Forge is © 2014 Machinalis (http://www.machinalis.com/). Its primary authors are:

Any contributions or suggestions are welcome, the official channel for this is submitting github pull requests or issues.

Changelog

0.1.1:
  • Added support for python 3

  • Added support for bag-of-words features

0.1:
  • Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

featureforge-0.1.1.tar.gz (32.4 kB view details)

Uploaded Source

File details

Details for the file featureforge-0.1.1.tar.gz.

File metadata

File hashes

Hashes for featureforge-0.1.1.tar.gz
Algorithm Hash digest
SHA256 bdb3229dff7562eddcf863266f6b54dd5ad15c0544abbf54804a25b1c3455fda
MD5 951b444f7b84f5b66ec1dec138ec58e4
BLAKE2b-256 52ae38cc83f4100c6ea91304a13fd62a8ccb6c918efdacdde64976bc8654becb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page