Skip to main content

Python library for loading and working with sound datasets.

Project description

soundata

Common loaders for sound datasets. Find the API documentation here. Inspired by and based on mirdata. (https://github.com/soundata/soundata)

CircleCI codecov Documentation Status GitHub

This library provides tools for working with common sound datasets, including tools for:

  • Downloading datasets to a common location and format
  • Validating that the files for a dataset are all present
  • Loading annotation files to a common format
  • Parsing clip-level metadata for detailed evaluations

Installation

To install, simply run:

pip install soundata

Quick example

import soundata

urbansound8k = soundata.initialize('urbansound8k')
urbansound8k.download()  # download the dataset
urbansound8k.validate()  # validate that all the expected files are there

example_clip = urbansound8k.choice_clip()  # choose a random example clip
print(example_clip)  # see the available data

See the documentation for more examples and the API reference.

Currently supported datasets

  • ESC-50
  • FSD50K
  • FSDnoisy18K
  • TAU Urban Acoustic Scenes 2019
  • TAU Urban Acoustic Scenes 2020 Mobile
  • TUT Sound events 2017
  • URBAN-SED
  • UrbanSound8K
  • More added soon!

For the complete list of supported datasets, see the documentation

Citing

TODO

paper
bibtex

When working with datasets, please cite the version of soundata that you are using (given by the DOI above) AND include the reference of the dataset, which can be found in the respective dataset loader using the cite() method.

Contributing a new dataset loader

We welcome contributions to this library, especially new datasets. Please see contributing for guidelines.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soundata-0.1.0rc10.tar.gz (5.0 MB view details)

Uploaded Source

Built Distribution

soundata-0.1.0rc10-py3-none-any.whl (5.1 MB view details)

Uploaded Python 3

File details

Details for the file soundata-0.1.0rc10.tar.gz.

File metadata

  • Download URL: soundata-0.1.0rc10.tar.gz
  • Upload date:
  • Size: 5.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.10

File hashes

Hashes for soundata-0.1.0rc10.tar.gz
Algorithm Hash digest
SHA256 4372fefbb55d0d2944d55337249a624ed18442ddf004f0857b17ea9d7aa267bf
MD5 2a42d2dd9a41cf36ab0ece52c956a5f9
BLAKE2b-256 f63a9c14b9ccee9cc417e79926de88a2b9c11ff24ec1fbeca00f187b9aafe2f3

See more details on using hashes here.

Provenance

File details

Details for the file soundata-0.1.0rc10-py3-none-any.whl.

File metadata

  • Download URL: soundata-0.1.0rc10-py3-none-any.whl
  • Upload date:
  • Size: 5.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.10

File hashes

Hashes for soundata-0.1.0rc10-py3-none-any.whl
Algorithm Hash digest
SHA256 a8ec7a5583d499cdeebeff7a510795786585407776b2b580c0ebdd7df8732a31
MD5 d68fa440699b4b546cda8cc426936790
BLAKE2b-256 4db1ca15e68fc0dfe87ea58d95684e0e685a86a9d6995b95345a8e67b051e4e0

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page