Python library for loading and working with sound datasets.
Project description
soundata
Common loaders for sound datasets. Find the API documentation here. Inspired by and based on mirdata. (https://github.com/soundata/soundata)
This library provides tools for working with common sound datasets, including tools for:
- Downloading datasets to a common location and format
- Validating that the files for a dataset are all present
- Loading annotation files to a common format
- Parsing clip-level metadata for detailed evaluations
Installation
To install, simply run:
pip install soundata
Quick example
import soundata
urbansound8k = soundata.initialize('urbansound8k')
urbansound8k.download() # download the dataset
urbansound8k.validate() # validate that all the expected files are there
example_clip = urbansound8k.choice_clip() # choose a random example clip
print(example_clip) # see the available data
See the documentation for more examples and the API reference.
Currently supported datasets
- ESC-50
- TAU Urban Acoustic Scenes 2019
- TAU Urban Acoustic Scenes 2020 Mobile
- TUT Sound events 2017
- URBAN-SED
- UrbanSound8K
- More added soon!
For the complete list of supported datasets, see the documentation
Citing
TODO
paper
bibtex
When working with datasets, please cite the version of soundata
that you are using (given by the DOI
above) AND include the reference of the dataset,
which can be found in the respective dataset loader using the cite()
method.
Contributing a new dataset loader
We welcome contributions to this library, especially new datasets. Please see contributing for guidelines.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for soundata-0.1.0rc7-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d934054a38ae2866bbc09596f1624bb356ffd6e84278677a4b6c200ee3397a5d |
|
MD5 | 772f5db10ac05bfe4ab3ccb866de0c2f |
|
BLAKE2b-256 | 4f8d8df46eea1c98f88061de888200cff1f35fd32aacd01b54b09fa025a5d3f4 |