Common loaders for MIR datasets.
Project description
mirdata
common loaders for Music Information Retrieval (MIR) datasets. Find the API documentation here.
This library provides tools for working with common MIR datasets, including tools for:
- downloading datasets to a common location and format
- validating that the files for a dataset are all present
- loading annotation files to a common format, consistent with the format required by mir_eval
- parsing track level metadata for detailed evaluations
Installation
To install, simply run:
pip install mirdata
Quick example
import mirdata
import random
orchset = mirdata.initialize('orchset')
orchset.download() # download the dataset
orchset.validate() # validate that all the expected files are there
example_track = orchset.choice_track() # choose a random example track
print(example_track) # see the available data
See the documentation for more examples and the API reference.
Currently supported datasets
Supported datasets include AcousticBrainz, DALI, Guitarset, MAESTRO, TinySOL, among many others.
For the complete list of supported datasets, see the documentation
Citing
There are two ways of citing mirdata:
If you are using the library for your work, please cite the version you used as indexed at Zenodo:
DOI
If you refer to mirdata's design principles, motivation etc., please cite the following paper:
"mirdata: Software for Reproducible Usage of Datasets"
Rachel M. Bittner, Magdalena Fuentes, David Rubinstein, Andreas Jansson, Keunwoo Choi, and Thor Kell
in International Society for Music Information Retrieval (ISMIR) Conference, 2019
@inproceedings{
bittner_fuentes_2019,
title={mirdata: Software for Reproducible Usage of Datasets},
author={Bittner, Rachel M and Fuentes, Magdalena and Rubinstein, David and Jansson, Andreas and Choi, Keunwoo and Kell, Thor},
booktitle={International Society for Music Information Retrieval (ISMIR) Conference},
year={2019}
}
Contributing a new dataset loader
We welcome contributions to this library, especially new datasets. Please see contributing for guidelines.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for mirdata-0.3.0b2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f7b74ff8eba167d65194f4f5a8d9e638d68397acdc916e6e507388e1adbaaf9e |
|
MD5 | ce8f79ea0833b663ffe9370e212d1f5e |
|
BLAKE2b-256 | 06e96d80137e8a69f7f3efd2d76d7781acb982f96ea67657b8e184bff5722547 |