Deep learning with PyTorch and audio
Project description
Deep learning with PyTorch and audio.
Documentation: https://audtorch.readthedocs.io
If you are interested in PyTorch and audio you should also check out the efforts to integrate more audio directly into PyTorch:
Installation
audtorch is supported by Python 3.5 or higher. To install it run (preferable in a virtual environment):
pip install audtorch
Usage
audtorch automates the data iteration process for deep neural network training using PyTorch. It provides a set of feature extraction transforms that can be implemented on-the-fly on the CPU.
The following example creates a data set of speech samples that are cut to a fixed length of 10240 samples. In addition they are augmented on the fly during data loading by a transform that adds samples from another data set:
>>> import sounddevice as sd
>>> from audtorch import datasets, transforms
>>> noise = datasets.WhiteNoise(duration=10240, sampling_rate=16000)
>>> augment = transforms.Compose([transforms.RandomCrop(10240),
... transforms.RandomAdditiveMix(noise)])
>>> data = datasets.LibriSpeech(root='~/LibriSpeech', sets='dev-clean',
... download=True, transform=augment)
>>> signal, label = data[8]
>>> sd.play(signal.transpose(), data.sampling_rate)
Besides data sets and transforms the package provides standard evaluation metrics, samplers, and necessary collate functions for training deep neural networks for audio tasks.
Changelog
All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
Version 0.4.1 (2019-10-25)
Fixed: default axis values for Masking transforms (#59)
Version 0.4.0 (2019-10-21)
Added: masking transforms in time and frequency domain
Version 0.3.2 (2019-10-04)
Fixed: long description in setup.cfg
Version 0.3.1 (2019-10-04)
Changed: define package in setup.cfg
Version 0.3.0 (2019-09-13)
Added: datasets.SpeechCommands (#49)
Removed: LogSpectrogram (#52)
Version 0.2.1 (2019-08-01)
Changed: Remove os.system call for moving files (#43)
Fixed: Remove broken logos from issue templates (#31)
Fixed: Wrong Spectrogram output shape in documentation (#40)
Fixed: Broken data set loading for relative paths (#33)
Version 0.2.0 (2019-06-28)
Added: Standardize, Log (#29)
Changed: Switch to Keep a Changelog format (#34)
Deprecated: LogSpectrogram (#29)
Fixed: normalize axis (#28)
Version 0.1.1 (2019-05-23)
Fixed: Broken API documentation on RTD
Version 0.1.0 (2019-05-22)
Added: Public release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file audtorch-0.4.1.tar.gz
.
File metadata
- Download URL: audtorch-0.4.1.tar.gz
- Upload date:
- Size: 56.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 59c8d33c64eef23f6bd7acb4dcc6a7ec7f5c96180e675e9e475a049935adf01a |
|
MD5 | 95a1aedc344e4946e975fd5dc1a2e7be |
|
BLAKE2b-256 | 1387a4aa2495f801eb91e955d1f104a5b5c3498b084d7adde171941d3d73aae7 |
File details
Details for the file audtorch-0.4.1-py3-none-any.whl
.
File metadata
- Download URL: audtorch-0.4.1-py3-none-any.whl
- Upload date:
- Size: 12.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | eca5d6cf438419fbde9a24e25693f34f9f54bede9a566580f03985ab2a9c3e4e |
|
MD5 | d7b65548177248b1df2de0f77fe461b9 |
|
BLAKE2b-256 | 2719a08a89eb1f2499589adde8aabaa8959b8c8b9deac60383a495590ad2f279 |