Deep audio and image embeddings, based on Look, Listen, and Learn approach
Project description
OpenL3
OpenL3 is an open-source Python library for computing deep audio and (eventually) image embeddings.
Please refer to the documentation for detailed instructions and examples.
The audio and image embedding models provided here are published as part of [1], and are based on the Look, Listen and Learn approach [2]. For details about the embedding models and how they were trained, please see:
Look, Listen, and Learn More: Design Choices for Deep Audio Embeddings
Jason Cramer, Ho-Hsiang Wu, Justin Salamon and Juan Pablo Bello
Under review, 2018.
Installing OpenL3
Dependencies
Tensorflow
Because Tensorflow comes in CPU-only and GPU variants, we leave it up to the user to install the version that best fits their usecase.
On most platforms, either of the following commands should properly install Tensorflow:
pip install tensorflow # CPU-only version
pip install tensorflow-gpu # GPU version
For more detailed information, please consult the Tensorflow installation documentation.
libsndfile
OpenL3 depends on the pysoundfile
module to load audio files, which depends on the non-Python library
libsndfile
. On Windows and macOS, these will be installed via pip
and you can therefore skip this step.
However, on Linux this must be installed manually via your platform's package manager.
For Debian-based distributions (such as Ubuntu), this can be done by simply running
apt-get install libsndfile1
For more detailed information, please consult the
pysoundfile
installation documentation.
Installing OpenL3
The simplest way to install OpenL3 is by using pip
, which will also install the additional required dependencies
if needed. To install OpenL3 using pip
, simply run
pip install openl3
To install the latest version of OpenL3 from source:
-
Clone or pull the lastest version:
git clone git@github.com:marl/openl3.git
-
Install using pip to handle python dependencies:
cd openl3 pip install -e .
Using OpenL3
To help you get started with OpenL3 please see the tutorial.
Acknowledging OpenL3
Please cite the following papers when using OpenL3 in your work:
[1] Look, Listen, and Learn More: Design Choices for Deep Audio Embeddings
Jason Cramer, Ho-Hsiang Wu, Justin Salamon and Juan Pablo Bello
Under review, 2018.
[2] Look, Listen and Learn
Relja Arandjelović and Andrew Zisserman
IEEE International Conference on Computer Vision (ICCV), Venice, Italy, Oct. 2017.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.