skll · PyPI

SciKit-Learn Laboratory makes it easier to run machinelearning experiments with scikit-learn.

These details have not been verified by PyPI

Project links

Homepage

Project description

https://img.shields.io/coveralls/EducationalTestingService/skll/stable.svg

This Python package provides command-line utilities to make it easier to run machine learning experiments with scikit-learn. One of the primary goals of our project is to make it so that you can run scikit-learn experiments without actually needing to write any code other than what you used to generate/extract the features.

Command-line Interface

The main utility we provide is called run_experiment and it can be used to easily run a series of learners on datasets specified in a configuration file like:

[General]
experiment_name = Titanic_Evaluate_Tuned
# valid tasks: cross_validate, evaluate, predict, train
task = evaluate

[Input]
# these directories could also be absolute paths
# (and must be if you're not running things in local mode)
train_directory = train
test_directory = dev
# Can specify multiple sets of feature files that are merged together automatically
# (even across formats)
featuresets = [["family.ndj", "misc.csv", "socioeconomic.arff", "vitals.csv"]]
# List of scikit-learn learners to use
learners = ["RandomForestClassifier", "DecisionTreeClassifier", "SVC", "MultinomialNB"]
# Column in CSV containing labels to predict
label_col = Survived
# Column in CSV containing instance IDs (if any)
id_col = PassengerId

[Tuning]
# Should we tune parameters of all learners by searching provided parameter grids?
grid_search = true
# Function to maximize when performing grid search
objective = accuracy

[Output]
# again, these can/should be absolute paths
log = output
results = output
predictions = output
models = output

For more information about getting started with run_experiment, please check out our tutorial, or our config file specs.

We also provide utilities for:

converting between machine learning toolkit formats (e.g., ARFF, CSV, MegaM)
filtering feature files
joining feature files
other common tasks

Python API

If you just want to avoid writing a lot of boilerplate learning code, you can also use our simple Python API. The main way you’ll want to use the API is through the Learner and Reader classes. For more details on our API, see the documentation.

While our API can be broadly useful, it should be noted that the command-line utilities are intended as the primary way of using SKLL. The API is just a nice side-effect of our developing the utilities.

A Note on Pronunciation

SciKit-Learn Laboratory (SKLL) is pronounced “skull”: that’s where the learning happens.

Requirements

Python 2.7+
scikit-learn
six
PrettyTable
BeautifulSoup 4
Grid Map (only required if you plan to run things in parallel on a DRMAA-compatible cluster)
joblib
PyYAML
configparser (only required for Python 2.7)
logutils (only required for Python 2.7)
mock (only required for Python 2.7)

Talks

Simpler Machine Learning with SKLL 1.0, Dan Blanchard, PyData NYC 2014 (video | slides)
Simpler Machine Learning with SKLL, Dan Blanchard, PyData NYC 2013 (video | slides)

Books

SKLL is featured in Data Science at the Command Line by Jeroen Janssens.

Changelog

See GitHub releases.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

5.0.1

Mar 8, 2024

5.0.0

Feb 22, 2024

4.0.1

Nov 14, 2023

4.0.0

Jul 17, 2023

3.2.0

Jan 19, 2023

3.1.0

Sep 14, 2022

3.0.0

Dec 21, 2021

2.5.0

Feb 26, 2021

2.1

Mar 13, 2020

2.0

Oct 24, 2019

1.5.3

Dec 14, 2018

1.5.2

Apr 12, 2018

1.5.1

Jan 31, 2018

1.5

Dec 13, 2017

1.3

Feb 13, 2017

1.2.1

May 20, 2016

1.2

Feb 24, 2016

1.1.1

Oct 23, 2015

This version

1.1.0

Jul 20, 2015

1.0.1

Feb 20, 2015

1.0.0

Nov 22, 2014

0.28.1

Nov 1, 2014

0.28.0

Oct 10, 2014

0.27.0

Aug 13, 2014

0.26.0

Jul 11, 2014

0.25.0

Jul 1, 2014

0.24.0

Jun 4, 2014

0.23.1

Jan 10, 2014

0.23.0

Jan 2, 2014

0.22.5

Dec 10, 2013

0.22.4

Dec 9, 2013

0.22.3

Dec 9, 2013

0.22.2

Dec 5, 2013

0.22.1

Dec 5, 2013

0.22.0

Nov 27, 2013

0.21.0

Nov 9, 2013

0.20.0

Nov 4, 2013

0.19.0

Oct 29, 2013

0.18.1

Oct 24, 2013

0.18.0

Oct 12, 2013

0.17.1

Oct 9, 2013

0.17.0

Oct 2, 2013

0.16.1

Sep 30, 2013

0.16.0

Sep 26, 2013

0.15.0

Sep 25, 2013

0.14.0

Sep 16, 2013

0.13.2

Sep 11, 2013

0.13.1

Sep 10, 2013

0.13.0

Sep 10, 2013

0.12.0

Sep 10, 2013

0.11.0

Sep 6, 2013

0.10.1

Sep 5, 2013

0.9.17

Sep 4, 2013

0.9.16

Sep 2, 2013

0.9.15

Aug 28, 2013

0.9.14

Aug 27, 2013

0.9.13

Aug 27, 2013

0.9.12

Aug 26, 2013

0.9.11

Aug 23, 2013

0.9.10

Aug 22, 2013

0.9.9

Aug 21, 2013

0.9.8

Aug 20, 2013

0.9.7

Aug 16, 2013

0.9.6

Aug 14, 2013

0.9.5

Aug 14, 2013

0.9.4

Aug 9, 2013

0.9.3

Aug 9, 2013

0.9.2

Aug 7, 2013

0.9.1

Aug 6, 2013

0.9

Aug 2, 2013

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skll-1.1.0.tar.gz (2.7 MB view details)

Uploaded Jul 20, 2015 Source

Built Distribution

skll-1.1.0-py2.py3-none-any.whl (76.0 kB view details)

Uploaded Jul 20, 2015 Python 2 Python 3

File details

Details for the file skll-1.1.0.tar.gz.

File metadata

Download URL: skll-1.1.0.tar.gz
Upload date: Jul 20, 2015
Size: 2.7 MB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for skll-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9afb07a3bffbbdde693da74fb11c21a6c7f656416111ee8d912b3d75e70b9e9b`
MD5	`764963f22ac62a9f109ef685511af6b4`
BLAKE2b-256	`e67e4d00f648233835b2f1cd091cc16bd600754d579c745d80298b6f4c13b25a`

See more details on using hashes here.

File details

Details for the file skll-1.1.0-py2.py3-none-any.whl.

File metadata

Download URL: skll-1.1.0-py2.py3-none-any.whl
Upload date: Jul 20, 2015
Size: 76.0 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for skll-1.1.0-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`f7e7b67bbd10925c8a8ef63c97afd25fd2bccb6a3db999ef1717f52d137cf9fa`
MD5	`0243fd91adc6ff7f679d9412a6c22fe1`
BLAKE2b-256	`480995c6dae730d9a1e05a4b06f88f4561abc317d55f2ab65cb672e6866e8620`