Skip to main content

Voice commands for JupyterLab

Project description

jupyterlab-voice-control

Github Actions Status Binder PyPI version

⚠️ This extension is experimental ⚠️

Voice commands for JupyterLab extension relies on experimental Speech Recognition API and is not suitable for use in production environments. You can check the current support on caniuse.com/speech-recognition.

Please be aware that the Speech Recognition method is an implementation detail of the browser, and some browsers, including Google Chrome, may send recordings of your voice to cloud for recognition.

Text edition capabilities are severely limited (e.g. it is only possible to delete words typed using the voice commands), but many issues are expected to be resolved with upgrade to CodeMirror 6, anticipated for JupyterLab 4.

While it is currently just proof of concept, please do get in touch by opening a new issue if you found this extension useful/promising and would like to leave any feedback which would be greatly appreciated.

Usage

To use this extension look for a microphone (microphone icon) icon on the status bar.

Initial icon location in the status bar

Press the disabled microphone icon (disabled microphone icon) to start voice recognition (or use Alt + v shortcut). You will need to agree for the browser to use your microphone:

Microphone access dialog in Chrome

You can use any of the commands available in the command palette, for example try "Download" or "Run selected cells".

Sometimes the speech recognition may be inaccurate, or you may misremember the name of the command. In those cases the voice control assistant will prompt you with suggestions with words "Did you mean..." and one or more suggestions on the status bar:

Suggestion saying "Did you mean New Launcher?"

  • You can accept the first suggestion by saying "yes".
  • To choose the second suggestion say "second suggestion", and so on.
  • Speech synthesis option is available (in the settings) if you prefer it over checking the status bar continuously.
  • The threshold for inclusion of a similar-sounding phrase as a suggestion can be adjusted in the settings.

If there are no errors or suggestions, the recognised speech will be shown on the status bar together with a confidence estimate:

Recognised phrase "close all tabs" with 87% confidence

By default commands recognised with low confidence (<50%) will not be executed (as it is deemed safer in case of any potentially destructive actions), but you can change the threshold in the settings.

Creating custom commands

In the settings editor you can assign custom phrases to commands in the same way as you can configure shortcuts. We refer to the words you need to say in order to execute a command as trigger phrase.

The voice control extension provides additional commands for interaction with the interface such as typing text (vc:insert-text), accepting suggestions (vc:accept-suggestion), stopping recognition (vc:stop-listening), etc.

The trigger phrase can be a regular expression and include named capturing groups. The captured groups will be passed as arguments to the command (/merged with pre-specified arguments if any, with groups overriding pre-specified arguments in case of a collision). For example, to create a command for opening files we could specify:

{
  "phrase": "^open file (?<path>.*)",
  "command": "filebrowser:open-path"
}

^ ensures that only phrases starting with "open file" will be matched.

We provide command vc:open-notebook (by default under open notebook (?<path>.*)) which automatically appends notebook file extension (.ipynb) as it can be problematic for speech recognition software.

Requirements

  • JupyterLab >= 3.4 (may work with older versions of 3.x, but not tested)

Install

To install the extension, execute:

pip install jupyterlab-voice-control

Uninstall

To remove the extension, execute:

pip uninstall jupyterlab-voice-control

Contributing

Development install

Note: You will need NodeJS to build the extension package.

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

# Clone the repo to your local environment
# Change directory to the jupyterlab-voice-control directory
# Install package in development mode
pip install -e .
# Link your development version of the extension with JupyterLab
jupyter labextension develop . --overwrite
# Rebuild extension Typescript source after making changes
jlpm build

You can watch the source directory and run JupyterLab at the same time in different terminals to watch for changes in the extension's source and automatically rebuild the extension.

# Watch the source directory in one terminal, automatically rebuilding when needed
jlpm watch
# Run JupyterLab in another terminal
jupyter lab

With the watch command running, every saved change will immediately be built locally and available in your running JupyterLab. Refresh JupyterLab to load the change in your browser (you may need to wait several seconds for the extension to be rebuilt).

By default, the jlpm build command generates the source maps for this extension to make it easier to debug using the browser dev tools. To also generate source maps for the JupyterLab core extensions, you can run the following command:

jupyter lab build --minimize=False

Development uninstall

pip uninstall jupyterlab-voice-control

In development mode, you will also need to remove the symlink created by jupyter labextension develop command. To find its location, you can run jupyter labextension list to figure out where the labextensions folder is located. Then you can remove the symlink named jupyterlab-voice-control within that folder.

Testing the extension

Frontend tests

This extension is using Jest for JavaScript code testing.

To execute them, execute:

jlpm
jlpm test

Integration tests

This extension uses Playwright for the integration tests (aka user level tests). More precisely, the JupyterLab helper Galata is used to handle testing the extension in JupyterLab.

More information are provided within the ui-tests README.

Packaging the extension

See RELEASE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jupyterlab-voice-control-0.2.1.tar.gz (345.1 kB view details)

Uploaded Source

Built Distribution

jupyterlab_voice_control-0.2.1-py3-none-any.whl (453.8 kB view details)

Uploaded Python 3

File details

Details for the file jupyterlab-voice-control-0.2.1.tar.gz.

File metadata

File hashes

Hashes for jupyterlab-voice-control-0.2.1.tar.gz
Algorithm Hash digest
SHA256 c1796e96ef4fd4c5c792ba845bac23f732542856fbd93b3432d1f5871058810c
MD5 89248de3f9f870774253e9fad962d3c1
BLAKE2b-256 56e3173ff9d6f5ea6cad0d9bde6c0f5468e587aabc7a3c3d3290ee6f1ea92028

See more details on using hashes here.

File details

Details for the file jupyterlab_voice_control-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for jupyterlab_voice_control-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 96b2fe5e0e320bea3e1f0933677bbc77eda3fef60a978fd77021aea4bfc8a726
MD5 9dac364d2982d7239c76f8ad736fa86c
BLAKE2b-256 7ef395b9318385b16426b116ac5022268b72a253edfcf64e9622874209ba9b46

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page