A library to build and test machine learning features
Project description
This library provides a set of tools that can be useful in many machine learning applications (classification, clustering, regression, etc.), and particularly helpful if you use scikit-learn (although this can work if you have a different algorithm).
Most machine learning problems involve an step of feature definition and preprocessing. Feature Forge helps you with:
Defining and documenting features
Testing your features against specified cases and against randomly generated cases (stress-testing). This helps you making your application more robust against invalid/misformatted input data. This also helps you checking that low-relevance results when doing feature analysis is actually because the feature is bad, and not because there’s a slight bug in your feature code.
Evaluating your features on a data set, producing a feature evaluation matrix. The evaluator has a robust mode that allows you some tolerance both for invalid data and buggy features.
Installation
Just pip install featureforge.
Documentation
Documentation is available at http://feature-forge.readthedocs.org/en/latest/
Contact information
Feature Forge is © 2014 Machinalis (http://www.machinalis.com/). Its primary authors are:
Javier Mansilla <jmansilla@machinalis.com> (jmansilla at github)
Daniel Moisset <dmoisset@machinalis.com> (dmoisset at github)
Rafael Carrascosa <rcarrascosa@machinalis.com> (rafacarrascosa at github)
Any contributions or suggestions are welcome, the official channel for this is submitting github pull requests or issues.
Changelog
- 0.1.1:
Added support for python 3
Added support for bag-of-words features
- 0.1:
Initial release
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file featureforge-0.1.1.tar.gz
.
File metadata
- Download URL: featureforge-0.1.1.tar.gz
- Upload date:
- Size: 32.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bdb3229dff7562eddcf863266f6b54dd5ad15c0544abbf54804a25b1c3455fda |
|
MD5 | 951b444f7b84f5b66ec1dec138ec58e4 |
|
BLAKE2b-256 | 52ae38cc83f4100c6ea91304a13fd62a8ccb6c918efdacdde64976bc8654becb |