Skip to main content

Static image export for web-based visualization libraries with zero dependencies

Project description

Overview

Kaleido is a cross-platform library for generating static images (e.g. png, svg, pdf, etc.) for web-based visualization libraries, with a particular focus on eliminating external dependencies. The project's initial focus is on the export of plotly.js images from Python for use by plotly.py, but it is designed to be relatively straight-forward to extend to other web-based visualization libraries, and other programming languages. The primary focus of Kaleido (at least initially) is to serve as a dependency of web-based visualization libraries like plotly.py. As such, the focus is on providing a programmatic-friendly, rather than user-friendly, API.

Try it out from Python

Install the kaleido wheel.

$ pip install kaleido

Install plotly.

$ pip install plotly

Export a plotly figure as a png image.

from kaleido.scopes.plotly import PlotlyScope
import plotly.graph_objects as go
scope = PlotlyScope()

fig = go.Figure(data=[go.Scatter(y=[1, 3, 2])])
with open("figure.png", "wb") as f:
    f.write(scope.transform(fig, format="png"))

Then, open figure.png in the current working directory.

figure

Background

As simple as it sounds, programmatically generating static images from web-based visualization libraries is a difficult problem. The core difficulty is that these libraries don't actually render plots (i.e. color the pixels) on their own, instead they delegate this work to web technologies like SVG, Canvas, WebGL, etc. This means that they are entirely dependent on the presence of a complete web browser to operate.

When the figure is displayed in a browser window, it's relatively straight-forward for a visualization library to provide an export-image button because it has full access to the browser for rendering. The difficulty arises when trying to export an image programmatically (e.g. from Python) without displaying it in a browser and without user interaction. To accomplish this, the Python portion of the visualization library needs programmatic access to a full web browser.

There are three main approaches that are currently in use among Python web-based visualization libraries.

  1. bokeh, altair, bqplot, and ipyvolume rely on the Selenium Python library to control a system web browser such as Firefox or Chrome/Chromium to perform image rendering.
  2. plotly.py relies on Orca, which is a custom headless Electron application that uses the Chromium browser engine built into Electron to perform image rendering. Orca runs as a local web server and plotly.py sends requests to it using a local port.
  3. When operating in the Jupyter notebook or JupyterLab, ipyvolume also supports programmatic image export by sending export requests to the browser using the ipywidgets protocol.

While options 1 and 2 can both be installed using conda, they still rely on the presence of some components that must be installed externally. For example, on Linux both require the installation of system libraries like libXss that are not typically included in headless Linux installations like you find in JupyterHub installations, Binder, Colab, Azure notebooks, SageMaker, etc. Also, conda is not as universally available as the pip package manager and neither approach is installable using pip packages.

Additionally, both 1 and 2 communicate between the Python process and the web browser over a local network port. While not typically a problem, certain firewall and container configurations can interfere with this local network connection.

The advantage of options 3 is that it introduces no additional system dependencies. The disadvantage is that it only works when running in a notebook, not in standalone Python scripts.

The end result is that all of these libraries have in-depth documentation pages on how to get image export working, and how to troubleshoot the inevitable failures and edge cases. While this is a great improvement over the state of affairs just a couple of years ago, and a lot of excellent work has gone into making these approaches work as seamlessly as possible, the fundamental limitations detailed above still result in sub-optimal user experiences. This is especially true when comparing web-based plotting libraries to traditional plotting libraries like matplotlib and ggplot2 where there's never a question of whether image export will work

The goal of Kaleido is to make static image export of web-based visualization libraries as universally available and reliable as that of matplotlib and ggplot2.

Approach

To accomplish this goal, Kaleido introduces a new approach. The core of Kaleido is a standalone C++ application that embeds Chromium as a library. This architecture allows Kaleido to communicate with the browser engine using the C++ API rather than requiring a local network connection. A thin Python wrapper runs the Kaledo C++ application as a subprocess and communicates with it by writing image export JSON requests to standard-in and retrieving results by reading from standard-out. Other language wrappers (e.g. R, Julia, Scala, Rust, etc.) can fairly easily be written in the future because the interface relies only on standard-in / standard-out communication using JSON requests.

By compiling Chromium as a library, we have a degree of control over what is included in the Chromium build. In particular, on Linux we can build Chromium in headless mode, which eliminates a large number of runtime dependencies (e.g. the libXss library mentioned above). The remaining dependencies are small enough to bundle with the library, making it possible to run Kaleido in the most minimal Linux environments with no additional dependencies required. So for example, the C++ Kaleido executable can run inside an ubuntu:16.04 docker container without anything be installed using apt.

The Python wrapper and the Kaleido executable can then be packaged as operating system dependent Python wheels that can be distributed on PyPI.

Disadvantages

While this approach has many advantages, the main disadvantage is that building Chromium is not for the faint of heart. Even on powerful workstations, downloading and building the Chromium code base takes 50+ GB and several hours. On Linux this work can be done once and distributed as a docker container, but we don't have a similar shortcut for Windows and MacOS. Because of this, we're still working on finding a CI solution for MacOS and Windows.

Scope (Plugin) architecture

While motivated by the needs of plotly.py, we made the decision early on to design Kaleido to make it fairly straightforward to add support for additional libraries. Plugins in Kaleido are called "scopes". For more information, see https://github.com/plotly/Kaleido/wiki/Scope-(Plugin)-Architecture.

Language wrapper architecture

While Python is the initial target language for Kaleido, it has been designed to make it fairly straightforward to add support for additional languages. For more information, see https://github.com/plotly/Kaleido/wiki/Language-wrapper-architecture.

Building Kaledo

Instructions for building Kaleido differ slightly across operating systems. All of these approaches assume that the Kaleido repository has been cloned and that the working directory is set to the repository root.

$ git clone git@github.com:plotly/Kaleido.git
$ cd Kaleido

Update version

Before building, generate the version string based on the git commit log with

$ python repos\version\build_pep440_version.py

Linux

There are two approaches to building Kaleido on Linux, both of which rely on Docker.

Method 1

This approach relies on the jonmmease/kaleido-builder docker image, and the scripts in repos/linux_full_scripts, to compile Kaleido. This docker image is over 30GB, but in includes a precompiled instance of the Chromium source tree making it possible to compile Kaleido in just a few 10s of seconds. The downside of this approach is that the chromium source tree is not visible outside of the docker image so it may be difficult for development environments to index it. This is the approach used for Continuous Integration on Linux.

Download docker image

$ docker pull jonmmease/kaleido-builder:0.7

Build Kaleido

$ /repos/linux_full_scripts/build_kaleido

Method 2

This approach relies on the jonmmease/chromium-builder docker image, and the scripts in repos/linux_scripts, to download the chromium source to a local folder and then build it. This takes longer to get set up than method 2 because the Chromium source tree must be compiled from scratch, but it downloads a copy of the chromium source tree to repos/src which makes it possible for development environments like CLion to index the Chromium codebase to provide code completion.

Download docker image

$ docker pull jonmmease/chromium-builder:0.7

Fetch the Chromium codebase and checkout the specific tag, then sync all dependencies

$ /repos/linux_scripts/fetch_chromium

Then build the kaleido application to repos/build/kaleido, and bundle shared libraries and fonts. The input source for this application is stored under repos/kaleido/cc/. The build step will also create the Python wheel under repos/kaleido/py/dist/

$ /repos/linux_scripts/build_kaleido

MacOS

To build on MacOS, first install XCode version 11.0+, nodejs 12, and Python 3. See https://chromium.googlesource.com/chromium/src/+/master/docs/mac_build_instructions.md for more information on build requirements.

Then fetch the chromium codebase

$ /repos/mac_scripts/fetch_chromium

Then build Kaleido to repos/build/kaleido. The build step will also create the Python wheel under repos/kaleido/py/dist/

$ /repos/mac_scripts/build_kaleido

Windows

To build on Windows, first install Visual Studio 2019 (community edition is fine), nodejs 12, and Python 3. See https://chromium.googlesource.com/chromium/src/+/master/docs/windows_build_instructions.md for more information on build requirements.

Then fetch the chromium codebase from a Power Shell command prompt

$ /repos/win_scripts/fetch_chromium.ps1

Then build Kaleido to repos/build/kaleido. The build step will also create the Python wheel under repos/kaleido/py/dist/

$ /repos/mac_scripts/build_kaleido.ps1

Building Docker containers

chromium-builder

The chromium-builder container mostly follows the instructions at https://chromium.googlesource.com/chromium/src/+/master/docs/linux/build_instructions.md to install depot_tools and run install-build-deps.sh to install the required build dependencies the appropriate stable version of Chromium. The image is based on ubuntu 16.04, which is the recommended OS for building Chromium on Linux.

Build container with:

$ docker build -t jonmmease/chromium-builder:0.7 -f repos/linux_scripts/Dockerfile .

kaleido-builder

This container contains a pre-compiled version of chromium source tree. Takes several hours to build!

$ docker build -t jonmmease/kaleido-builder:0.7 -f repos/linux_full_scripts/Dockerfile .

Updating chromium version

To update the version of Chromium in the future, the docker images will need to be updated. Follow the instructions for the DEPOT_TOOLS_COMMIT and CHROMIUM_TAG environment variables in linux_scripts/Dockerfile.

Find a stable chromium version tag from https://chromereleases.googleblog.com/search/label/Desktop%20Update. Look up date of associated tag in GitHub at https://github.com/chromium/chromium/ E.g. Stable chrome version tag on 05/19/2020: 83.0.4103.61, set CHROMIUM_TAG="83.0.4103.61"

Search through depot_tools commitlog (https://chromium.googlesource.com/chromium/tools/depot_tools/+log) for commit hash of commit from the same day. E.g. depot_tools commit hash from 05/19/2020: e67e41a, set DEPOT_TOOLS_COMMIT=e67e41a

The environment variable must also be updated in the repos/linux_scripts/checkout_revision, repos/mac_scripts/fetch_chromium, and repos/win_scripts/fetch_chromium.ps1 scripts.

CMakeLists.txt

The CMakeLists.txt file in repos/ is only there to help IDE's like CLion/KDevelop figure out how to index the chromium source tree. It can't be used to actually build chromium. Using this approach, it's possible to get full completion and code navigation from repos/kaleido/cc/kaleido.cc in CLion.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

kaleido-0.0.1rc5-py2.py3-none-win_amd64.whl (55.3 MB view details)

Uploaded Python 2 Python 3 Windows x86-64

kaleido-0.0.1rc5-py2.py3-none-manylinux2014_x86_64.whl (73.9 MB view details)

Uploaded Python 2 Python 3

kaleido-0.0.1rc5-py2.py3-none-macosx_10_9_x86_64.whl (67.6 MB view details)

Uploaded Python 2 Python 3 macOS 10.9+ x86-64

File details

Details for the file kaleido-0.0.1rc5-py2.py3-none-win_amd64.whl.

File metadata

  • Download URL: kaleido-0.0.1rc5-py2.py3-none-win_amd64.whl
  • Upload date:
  • Size: 55.3 MB
  • Tags: Python 2, Python 3, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.3.1.post20200622 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.10

File hashes

Hashes for kaleido-0.0.1rc5-py2.py3-none-win_amd64.whl
Algorithm Hash digest
SHA256 7d782dea7b8b6e1c2e609ac5a1b57654c09f464cf51917d6cdd52d03e95b8331
MD5 00f64142826f6b03cb724278554fa09b
BLAKE2b-256 6fe1dd3c2a60ba0f995323b980af7b162cadebb4d1db54d9b889dd9e8304ad3c

See more details on using hashes here.

Provenance

File details

Details for the file kaleido-0.0.1rc5-py2.py3-none-manylinux2014_x86_64.whl.

File metadata

  • Download URL: kaleido-0.0.1rc5-py2.py3-none-manylinux2014_x86_64.whl
  • Upload date:
  • Size: 73.9 MB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.3.1.post20200622 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.10

File hashes

Hashes for kaleido-0.0.1rc5-py2.py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5f4486a6512db542c1684dcf02fd7f07619db9862098b85e6706d5e965893b55
MD5 225ea56059df6d15e71702335d2ace9f
BLAKE2b-256 f35f6c2243990cc4fd3c00411d1d77a7a772d5f351b37dc31f763d7e89d620f1

See more details on using hashes here.

Provenance

File details

Details for the file kaleido-0.0.1rc5-py2.py3-none-macosx_10_9_x86_64.whl.

File metadata

  • Download URL: kaleido-0.0.1rc5-py2.py3-none-macosx_10_9_x86_64.whl
  • Upload date:
  • Size: 67.6 MB
  • Tags: Python 2, Python 3, macOS 10.9+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.3.1.post20200622 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/3.6.10

File hashes

Hashes for kaleido-0.0.1rc5-py2.py3-none-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 f3031b66892551c6847651c1f0a4bcb18b813506b5d2e2e27672cbd5e9523ed6
MD5 4d7726a1111af4e9ab2d06b7b58c997e
BLAKE2b-256 a35ee93b3a7075a8eb92b540a1f7d58573a4bea85e47566ec6701feec67bda6f

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page