Skip to main content

Deep learning with PyTorch and audio

Project description

audtorch on TravisCI audtorch's documentation on Read the Docs audtorch's MIT license

Deep learning with PyTorch and audio.

Documentation: https://audtorch.readthedocs.io

If you are interested in PyTorch and audio you should also check out the efforts to integrate more audio directly into PyTorch:

Installation

audtorch is supported by Python 3.5 or higher. To install it run (preferable in a virtual environment):

pip install audtorch

Usage

audtorch automates the data iteration process for deep neural network training using PyTorch. It provides a set of feature extraction transforms that can be implemented on-the-fly on the CPU.

The following example creates a data set of speech samples that are cut to a fixed length of 10240 samples. In addition they are augmented on the fly during data loading by a transform that adds samples from another data set:

>>> import sounddevice as sd
>>> from audtorch import datasets, transforms
>>> noise = datasets.WhiteNoise(duration=10240, sampling_rate=16000)
>>> augment = transforms.Compose([transforms.RandomCrop(10240),
...                               transforms.RandomAdditiveMix(noise)])
>>> data = datasets.LibriSpeech(root='~/LibriSpeech', sets='dev-clean',
...                             download=True, transform=augment)
>>> signal, label = data[8]
>>> sd.play(signal.transpose(), data.sampling_rate)

Besides data sets and transforms the package provides standard evaluation metrics, samplers, and necessary collate functions for training deep neural networks for audio tasks.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audtorch-0.2.0.tar.gz (51.8 kB view details)

Uploaded Source

Built Distribution

audtorch-0.2.0-py3-none-any.whl (48.6 kB view details)

Uploaded Python 3

File details

Details for the file audtorch-0.2.0.tar.gz.

File metadata

  • Download URL: audtorch-0.2.0.tar.gz
  • Upload date:
  • Size: 51.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.5.3

File hashes

Hashes for audtorch-0.2.0.tar.gz
Algorithm Hash digest
SHA256 3e0a2a7e96a2969f87ac0916d414caa394abd93f7bac04f0c7164852deff4804
MD5 451a77d6c510b7fbb741bea3ed53c8d9
BLAKE2b-256 fc45002cf46c04cde730a900e0a0b04b0aa567e88fc85ba00d5cc406b51068b4

See more details on using hashes here.

File details

Details for the file audtorch-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: audtorch-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 48.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.5.3

File hashes

Hashes for audtorch-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4a17f5ff8be20e9bf89f0c881fd3baf1369174d1f09328e420261b466acee16f
MD5 2759df22b70dc8e42cf00a1e2c3068eb
BLAKE2b-256 fda0ceb8e34087f514c4f5811f1bbd39c3b40b1fbe6a31a6203a61f09957daef

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page