Skip to main content

Bob's audio processing utilities

Project description

http://img.shields.io/badge/docs-v2.1.10-yellow.svg http://img.shields.io/badge/docs-latest-orange.svg https://gitlab.idiap.ch/bob/bob.ap/badges/v2.1.10/build.svg https://gitlab.idiap.ch/bob/bob.ap/badges/v2.1.10/coverage.svg https://img.shields.io/badge/gitlab-project-0000c0.svg http://img.shields.io/pypi/v/bob.ap.svg

Audio Processing for Bob

This package is part of the signal-processing and machine learning toolbox Bob. It contains basic audio processing utilities. Currently, the following cepstral-based features are available: using rectangular (RFCC), mel-scaled triangular (MFCC) [Davis1980], inverted mel-scaled triangular (IMFCC), and linear triangular (LFCC) filters [Furui1981], spectral flux-based features (SSFC) [Scheirer1997], subband centroid frequency (SCFC) [Le2011]. We are planning to update and add more features in the near future.

Please note that the implementation of MFCC and LFCC features has changed compared to an earlier version of the package, as we corrected pre-emphasis and DCT computations. Delta and delta-delta computations were slightly changed too.

Installation

Complete Bob’s installation instructions. Then, to install this package, run:

$ conda install bob.ap

Contact

For questions or reporting issues to this software package, contact our development mailing list.

[Davis1980]

S. Davis and P. Mermelstein, “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences”, in IEEE Transactions on Acoustics, Speech, and Signal Processing, 1980, num 4, vol. 28, pages 357-366.

[Furui1981]

S. Furui, Cepstral analysis technique for automatic speaker verification, in IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981, num 2 vol 29, pages 254-272.

[Scheirer1997]

E. Scheirer and M. Slaney, Construction and evaluation of a robust multifeature speech/music discriminator, in IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 1997, vol 2, pages 1331-1334.

[Le2011]

P. N. Le, E. Ambikairajah, J. Epps, V. Sethu, E. H. C. Choi, Investigation of Spectral Centroid Features for Cognitive Load Classification, in Speech Commun., April, 2011, num 4, vol 53, pages 540–551.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bob.ap-2.1.10.zip (88.8 kB view details)

Uploaded Source

File details

Details for the file bob.ap-2.1.10.zip.

File metadata

  • Download URL: bob.ap-2.1.10.zip
  • Upload date:
  • Size: 88.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for bob.ap-2.1.10.zip
Algorithm Hash digest
SHA256 77eb4b0a8bdc0faa5cd8acc159df78a0075df5ca6080535f303c020326b62f8b
MD5 bdd5c29c2359b4af289a0d37e61bda36
BLAKE2b-256 22e2068ee8d9f13451f8ca0c34c9c03cecffa6f577a14d6e9b24e65f2b752d9c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page