Library to convert rich text from Draft.js raw ContentState to HTML
Project description
Draft.js exporter
Library to convert rich text from Draft.js raw ContentState to HTML.
It is developed alongside the Draftail rich text editor, for Wagtail. Check out the online demo, and our introductory blog post.
Why
Draft.js is a rich text editor framework for React. Its approach is different from most rich text editors because it does not store data as HTML, but rather in its own representation called ContentState. This exporter is useful when the ContentState to HTML conversion has to be done in a Python ecosystem.
The initial use case was to gain more control over the content managed by rich text editors in a Wagtail/Django site. If you want to read the full story, have a look at our blog post: Rethinking rich text pipelines with Draft.js.
Features
This project adheres to Semantic Versioning, and measures performance and code coverage. Code is checked with mypy.
- Extensive configuration of the generated HTML.
- Default, extensible block & inline style maps for common HTML elements.
- Convert line breaks to
<br>
elements. - Define any attribute in the block map – custom class names for elements.
- React-like API to create custom components.
- Automatic conversion of entity data to HTML attributes (int & boolean to string, style object to style string).
- Nested lists (
<li>
elements go inside<ul>
or<ol>
, with multiple levels). - Output inline styles as inline elements (
<em>
,<strong>
, pick and choose, with any attribute). - Overlapping inline style and entity ranges.
- Python 3.5+ type annotations.
Usage
Draft.js stores data in a JSON representation based on blocks, representing lines of content in the editor, annotated with entities and styles to represent rich text. For more information, this article covers the concepts further.
Getting started
This exporter takes the Draft.js ContentState data as input, and outputs HTML based on its configuration. To get started, install the package:
pip install draftjs_exporter
We support the following Python versions: 3.5, 3.6, 3.7, 3.8. For Python 2.7 and 3.4 support, please use v2.1.7 of the exporter.
In your code, create an exporter and use the render
method to create HTML:
from draftjs_exporter.dom import DOM
from draftjs_exporter.html import HTML
# Configuration options are detailed below.
config = {}
# Initialise the exporter.
exporter = HTML(config)
# Render a Draft.js `contentState`
html = exporter.render({
'entityMap': {},
'blocks': [{
'key': '6mgfh',
'text': 'Hello, world!',
'type': 'unstyled',
'depth': 0,
'inlineStyleRanges': [],
'entityRanges': []
}]
})
print(html)
You can also run an example by downloading this repository and then using python example.py
, or by using our online Draft.js demo.
Configuration
The exporter output is extensively configurable to cater for varied rich text requirements.
# draftjs_exporter provides default configurations and predefined constants for reuse.
from draftjs_exporter.constants import BLOCK_TYPES, ENTITY_TYPES
from draftjs_exporter.defaults import BLOCK_MAP, STYLE_MAP
from draftjs_exporter.dom import DOM
config = {
# `block_map` is a mapping from Draft.js block types to a definition of their HTML representation.
# Extend BLOCK_MAP to start with sane defaults, or make your own from scratch.
'block_map': dict(BLOCK_MAP, **{
# The most basic mapping format, block type to tag name.
BLOCK_TYPES.HEADER_TWO: 'h2',
# Use a dict to define props on the block.
BLOCK_TYPES.HEADER_THREE: {'element': 'h3', 'props': {'class': 'u-text-center'}},
# Add a wrapper (and wrapper_props) to wrap adjacent blocks.
BLOCK_TYPES.UNORDERED_LIST_ITEM: {
'element': 'li',
'wrapper': 'ul',
'wrapper_props': {'class': 'bullet-list'},
},
# Use a custom component for more flexibility (reading block data or depth).
BLOCK_TYPES.BLOCKQUOTE: blockquote,
BLOCK_TYPES.ORDERED_LIST_ITEM: {
'element': list_item,
'wrapper': ordered_list,
},
# Provide a fallback component (advanced).
BLOCK_TYPES.FALLBACK: block_fallback
}),
# `style_map` defines the HTML representation of inline elements.
# Extend STYLE_MAP to start with sane defaults, or make your own from scratch.
'style_map': dict(STYLE_MAP, **{
# Use the same mapping format as in the `block_map`.
'KBD': 'kbd',
# The `style` prop can be defined as a dict, that will automatically be converted to a string.
'HIGHLIGHT': {'element': 'strong', 'props': {'style': {'textDecoration': 'underline'}}},
# Provide a fallback component (advanced).
INLINE_STYLES.FALLBACK: style_fallback,
}),
'entity_decorators': {
# Map entities to components so they can be rendered with their data.
ENTITY_TYPES.IMAGE: image,
ENTITY_TYPES.LINK: link
# Lambdas work too.
ENTITY_TYPES.HORIZONTAL_RULE: lambda props: DOM.create_element('hr'),
# Discard those entities.
ENTITY_TYPES.EMBED: None,
# Provide a fallback component (advanced).
ENTITY_TYPES.FALLBACK: entity_fallback,
},
'composite_decorators': [
# Use composite decorators to replace text based on a regular expression.
{
'strategy': re.compile(r'\n'),
'component': br,
},
{
'strategy': re.compile(r'#\w+'),
'component': hashtag,
},
{
'strategy': LINKIFY_RE,
'component': linkify,
},
],
}
See examples.py for more details.
Advanced usage
Custom components
To generate arbitrary markup with dynamic data, the exporter comes with an API to create rendering components. This API mirrors React’s createElement API (what JSX compiles to).
# All of the API is available from a single `DOM` namespace
from draftjs_exporter.dom import DOM
# Components are simple functions that take `props` as parameter and return DOM elements.
def image(props):
# This component creates an image element, with the relevant attributes.
return DOM.create_element('img', {
'src': props.get('src'),
'width': props.get('width'),
'height': props.get('height'),
'alt': props.get('alt'),
})
def blockquote(props):
# This component uses block data to render a blockquote.
block_data = props['block']['data']
# Here, we want to display the block's content so we pass the `children` prop as the last parameter.
return DOM.create_element('blockquote', {
'cite': block_data.get('cite')
}, props['children'])
def button(props):
href = props.get('href', '#')
icon_name = props.get('icon', None)
text = props.get('text', '')
return DOM.create_element('a', {
'class': 'icon-text' if icon_name else None,
'href': href,
},
# There can be as many `children` as required.
# It is also possible to reuse other components and render them instead of HTML tags.
DOM.create_element(icon, {'name': icon_name}) if icon_name else None,
DOM.create_element('span', {'class': 'icon-text'}, text) if icon_name else text
)
Apart from create_element
, a parse_html
method is also available. Use it to interface with other HTML generators, like template engines.
See examples.py
in the repository for more details.
Fallback components
When dealing with changes in the content schema, as part of ongoing development or migrations, some content can go stale. To solve this, the exporter allows the definition of fallback components for blocks, styles, and entities. This feature is only used for development at the moment, if you have a use case for this in production we would love to hear from you. Please get in touch!
Add the following to the exporter config,
config = {
'block_map': dict(BLOCK_MAP, **{
# Provide a fallback for block types.
BLOCK_TYPES.FALLBACK: block_fallback
}),
}
This fallback component can now control the exporter behavior when normal components are not found. Here is an example:
def block_fallback(props):
type_ = props['block']['type']
if type_ == 'example-discard':
logging.warn('Missing config for "%s". Discarding block, keeping content.' % type_)
# Directly return the block's children to keep its content.
return props['children']
elif type_ == 'example-delete':
logging.error('Missing config for "%s". Deleting block.' % type_)
# Return None to not render anything, removing the whole block.
return None
else:
logging.warn('Missing config for "%s". Using div instead.' % type_)
# Provide a fallback.
return DOM.create_element('div', {}, props['children'])
See examples.py
in the repository for more details.
Alternative backing engines
By default, the exporter uses a dependency-free engine called string
to build the DOM tree. There are two alternative backing engines: html5lib
(via BeautifulSoup) and lxml
.
The string
engine is the fastest, and does not have any dependencies. Its only drawback is that the parse_html
method does not escape/sanitise HTML like that of other engines.
- For
html5lib
, dopip install draftjs_exporter[html5lib]
. - For
lxml
, dopip install draftjs_exporter[lxml]
. It also requireslibxml2
andlibxslt
to be available on your system.
Then, use the engine
attribute of the exporter config:
config = {
# Specify which DOM backing engine to use.
'engine': DOM.HTML5LIB,
# Or for lxml:
'engine': DOM.LXML,
}
Custom backing engines
The exporter supports using custom engines to generate its output via the DOM
API. This can be useful to implement custom export formats, e.g. to Markdown (experimental).
Here is an example implementation:
from draftjs_exporter import DOMEngine
class DOMListTree(DOMEngine):
"""
Element tree using nested lists.
"""
@staticmethod
def create_tag(t, attr=None):
return [t, attr, []]
@staticmethod
def append_child(elt, child):
elt[2].append(child)
@staticmethod
def render(elt):
return elt
exporter = HTML({
# Use the dotted module syntax to point to the DOMEngine implementation.
'engine': 'myproject.example.DOMListTree'
})
Type annotations
The exporter’s codebase is typed with annotations from the Python 3.5+ standard library, checked with mypy. Reusable types are made available:
from draftjs_exporter.dom import DOM
from draftjs_exporter.types import Element, Props
# Components are simple functions that take `props` as parameter and return DOM elements.
def image(props: Props) -> Element:
# This component creates an image element, with the relevant attributes.
return DOM.create_element('img', {
'src': props.get('src'),
'width': props.get('width'),
'height': props.get('height'),
'alt': props.get('alt'),
})
Contributing
See anything you like in here? Anything missing? We welcome all support, whether on bug reports, feature requests, code, design, reviews, tests, documentation, and more. Please have a look at our contribution guidelines.
If you just want to set up the project on your own computer, the contribution guidelines also contain all of the setup commands.
Credits
This project is made possible by the work of Springload, a New Zealand digital agency, and. The beautiful demo site is the work of @thibaudcolas.
View the full list of contributors. MIT licensed.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file draftjs_exporter-3.0.0.tar.gz
.
File metadata
- Download URL: draftjs_exporter-3.0.0.tar.gz
- Upload date:
- Size: 30.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e98d3ff0af0974c3f3640145e194e5fb0ae2aa8a9ddc403e5fd9c9c0200bc176 |
|
MD5 | c35c0d63b890c2d2eca15a8a3f7d4196 |
|
BLAKE2b-256 | cb891102c688c0783bee5882e68a227abde7374e839bbd37f1f5ac259cbf7b23 |