Skip to main content

Deep learning with PyTorch and audio

Project description

audtorch on TravisCI audtorch's documentation on Read the Docs audtorch's MIT license

Deep learning with PyTorch and audio.

Documentation: https://audtorch.readthedocs.io

If you are interested in PyTorch and audio you should also check out the efforts to integrate more audio directly into PyTorch:

Installation

audtorch is supported by Python 3.5 or higher. To install it run (preferable in a virtual environment):

pip install audtorch

Usage

audtorch automates the data iteration process for deep neural network training using PyTorch. It provides a set of feature extraction transforms that can be implemented on-the-fly on the CPU.

The following example creates a data set of speech samples that are cut to a fixed length of 10240 samples. In addition they are augmented on the fly during data loading by a transform that adds samples from another data set:

>>> import sounddevice as sd
>>> from audtorch import datasets, transforms
>>> noise = datasets.WhiteNoise(duration=10240, sampling_rate=16000)
>>> augment = transforms.Compose([transforms.RandomCrop(10240),
...                               transforms.RandomAdditiveMix(noise)])
>>> data = datasets.LibriSpeech(root='~/LibriSpeech', sets='dev-clean',
...                             download=True, transform=augment)
>>> signal, label = data[8]
>>> sd.play(signal.transpose(), data.sampling_rate)

Besides data sets and transforms the package provides standard evaluation metrics, samplers, and necessary collate functions for training deep neural networks for audio tasks.

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

Version 0.4.2 (2019-11-04)

  • Fixed: critical bug of missing files in wheel package (#60)

Version 0.4.1 (2019-10-25)

  • Fixed: default axis values for Masking transforms (#59)

Version 0.4.0 (2019-10-21)

  • Added: masking transforms in time and frequency domain

Version 0.3.2 (2019-10-04)

  • Fixed: long description in setup.cfg

Version 0.3.1 (2019-10-04)

  • Changed: define package in setup.cfg

Version 0.3.0 (2019-09-13)

  • Added: datasets.SpeechCommands (#49)

  • Removed: LogSpectrogram (#52)

Version 0.2.1 (2019-08-01)

  • Changed: Remove os.system call for moving files (#43)

  • Fixed: Remove broken logos from issue templates (#31)

  • Fixed: Wrong Spectrogram output shape in documentation (#40)

  • Fixed: Broken data set loading for relative paths (#33)

Version 0.2.0 (2019-06-28)

  • Added: Standardize, Log (#29)

  • Changed: Switch to Keep a Changelog format (#34)

  • Deprecated: LogSpectrogram (#29)

  • Fixed: normalize axis (#28)

Version 0.1.1 (2019-05-23)

  • Fixed: Broken API documentation on RTD

Version 0.1.0 (2019-05-22)

  • Added: Public release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audtorch-0.4.2.tar.gz (56.2 kB view details)

Uploaded Source

Built Distribution

audtorch-0.4.2-py3-none-any.whl (52.9 kB view details)

Uploaded Python 3

File details

Details for the file audtorch-0.4.2.tar.gz.

File metadata

  • Download URL: audtorch-0.4.2.tar.gz
  • Upload date:
  • Size: 56.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.8

File hashes

Hashes for audtorch-0.4.2.tar.gz
Algorithm Hash digest
SHA256 c084f1cf80dc73129a681569bc0a2142f20e091e479d2eb68c7b1bec0a8570f6
MD5 4ab3f742be2e9f94d6b234a3dbaeb399
BLAKE2b-256 c6a1c51f4d0587cda0677814f0bf58143cdec62df85912e0400b1646804e332c

See more details on using hashes here.

File details

Details for the file audtorch-0.4.2-py3-none-any.whl.

File metadata

  • Download URL: audtorch-0.4.2-py3-none-any.whl
  • Upload date:
  • Size: 52.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.8

File hashes

Hashes for audtorch-0.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 9d836a7b8d2d21e3c0ca7cfb4830edc5ceb2a7e3654cac595fde6b97ef3b1018
MD5 130acd33f0e7335d7fef157a936b850c
BLAKE2b-256 7f4a6923eb2a6e3696f47ab332f80d73d3a35945a2b36259a662aceca9645bcb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page