A JIT implementation for Marshmallow to speed up dumping and loading objects.
Project description
*************************************************************
:fire:toastedmarshmallow:fire:: Makes Marshmallow Toasty Fast
*************************************************************
Toasted Marshmallow implements a JIT for marshmallow that speeds up dumping
objects 10-25X (depending on your schema). Toasted Marshmallow allows you to
have the great API that
`Marshmallow <https://github.com/marshmallow-code/marshmallow>`_ provides
without having to sacrifice performance!
::
Benchmark Result:
Original Time: 2682.61 usec/dump
Optimized Time: 176.38 usec/dump
Optimized (Cython) Time: 125.77 usec/dump
Speed up: 15.21x
Cython Speed up: 21.33x
Even ``PyPy`` benefits from ``toastedmarshmallow``!
::
Benchmark Result:
Original Time: 189.78 usec/dump
Optimized Time: 20.03 usec/dump
Speed up: 9.48x
Installing toastedmarshmallow
-----------------------------
.. code-block:: bash
pip install toastedmarshmallow
This will *also* install a slightly-forked ``marshmallow`` that includes some
hooks Toastedmarshmallow needs enable the JIT to run before falling back
to the original marshmallow code. These changes are minimal making it easier
to track upstream. You can find the changes
`Here <https://github.com/marshmallow-code/marshmallow/pull/629/files>`_.
This means you should **remove** ``marshmallow`` from your requirements and
replace it with ``toastedmarshmallow``. By default there is no
difference unless you explicitly enable Toasted Marshmallow.
Enabling Toasted Marshmallow
----------------------------
Enabling Toasted Marshmallow on an existing Schema is just one line of code,
set the ``jit`` property on any ``Schema`` instance to
``toastedmarshmallow.Jit``. For example:
.. code-block:: python
from datetime import date
import toastedmarshmallow
from marshmallow import Schema, fields, pprint
class ArtistSchema(Schema):
name = fields.Str()
class AlbumSchema(Schema):
title = fields.Str()
release_date = fields.Date()
artist = fields.Nested(ArtistSchema())
schema = AlbumSchema()
# Specify the jit method as toastedmarshmallow's jit
schema.jit = toastedmarshmallow.Jit
# And that's it! Your dump methods are 15x faster!
It's also possible to use the ``Meta`` class on the ``Marshmallow`` schema
to specify all instances of a given ``Schema`` should be optimized:
.. code-block:: python
import toastedmarshmallow
from marshmallow import Schema, fields, pprint
class ArtistSchema(Schema):
class Meta:
jit = toastedMarshmallow.Jit
name = fields.Str()
You can also enable Toasted Marshmallow globally by setting the environment
variable ``MARSHMALLOW_SCHEMA_DEFAULT_JIT`` to ``toastedmarshmallow.Jit`` .
Future versions of Toasted Marshmallow may make this the default.
How it works
------------
Toasted Marshmallow works by generating code at runtime to optimize dumping
objects without going through layers and layers of reflection. The generated
code optimistically assumes the objects being passed in are schematically valid,
falling back to the original marshmallow code on failure.
For example, taking ``AlbumSchema`` from above, Toastedmarshmallow will
generate the following 3 methods:
.. code-block:: python
def InstanceSerializer(obj):
res = {}
value = obj.release_date; value = value() if callable(value) else value; res["release_date"] = _field_release_date__serialize(value, "release_date", obj)
value = obj.artist; value = value() if callable(value) else value; res["artist"] = _field_artist__serialize(value, "artist", obj)
value = obj.title; value = value() if callable(value) else value; value = str(value) if value is not None else None; res["title"] = value
return res
def DictSerializer(obj):
res = {}
if "release_date" in obj:
value = obj["release_date"]; value = value() if callable(value) else value; res["release_date"] = _field_release_date__serialize(value, "release_date", obj)
if "artist" in obj:
value = obj["artist"]; value = value() if callable(value) else value; res["artist"] = _field_artist__serialize(value, "artist", obj)
if "title" in obj:
value = obj["title"]; value = value() if callable(value) else value; value = str(value) if value is not None else None; res["title"] = value
return res
def HybridSerializer(obj):
res = {}
try:
value = obj["release_date"]
except (KeyError, AttributeError, IndexError, TypeError):
value = obj.release_date
value = value; value = value() if callable(value) else value; res["release_date"] = _field_release_date__serialize(value, "release_date", obj)
try:
value = obj["artist"]
except (KeyError, AttributeError, IndexError, TypeError):
value = obj.artist
value = value; value = value() if callable(value) else value; res["artist"] = _field_artist__serialize(value, "artist", obj)
try:
value = obj["title"]
except (KeyError, AttributeError, IndexError, TypeError):
value = obj.title
value = value; value = value() if callable(value) else value; value = str(value) if value is not None else None; res["title"] = value
return res
Toastedmarshmallow will invoke the proper serializer based upon the input.
Since Toastedmarshmallow is generating code at runtime, it's critical you
re-use Schema objects. If you're creating a new Schema object every time you
serialize/deserialize an object you'll likely have much worse performance.
:zap::microscope: Experimental :microscope::zap:
--------------------------------------------------
Toastedmarshmallow also has an experimental Cython based jit. It takes the
generated code above and runs it through Cython first, getting another 1.5x
win. Generally the generated Python code is fast enough, but this is a useful
option when you've got to squeeze out every last bit of performance.
To use the Cython jit, replace `Jit` with `CythonJit`:
.. code-block:: python
schema.jit = toastedmarshmallow.CythonJit
:fire:toastedmarshmallow:fire:: Makes Marshmallow Toasty Fast
*************************************************************
Toasted Marshmallow implements a JIT for marshmallow that speeds up dumping
objects 10-25X (depending on your schema). Toasted Marshmallow allows you to
have the great API that
`Marshmallow <https://github.com/marshmallow-code/marshmallow>`_ provides
without having to sacrifice performance!
::
Benchmark Result:
Original Time: 2682.61 usec/dump
Optimized Time: 176.38 usec/dump
Optimized (Cython) Time: 125.77 usec/dump
Speed up: 15.21x
Cython Speed up: 21.33x
Even ``PyPy`` benefits from ``toastedmarshmallow``!
::
Benchmark Result:
Original Time: 189.78 usec/dump
Optimized Time: 20.03 usec/dump
Speed up: 9.48x
Installing toastedmarshmallow
-----------------------------
.. code-block:: bash
pip install toastedmarshmallow
This will *also* install a slightly-forked ``marshmallow`` that includes some
hooks Toastedmarshmallow needs enable the JIT to run before falling back
to the original marshmallow code. These changes are minimal making it easier
to track upstream. You can find the changes
`Here <https://github.com/marshmallow-code/marshmallow/pull/629/files>`_.
This means you should **remove** ``marshmallow`` from your requirements and
replace it with ``toastedmarshmallow``. By default there is no
difference unless you explicitly enable Toasted Marshmallow.
Enabling Toasted Marshmallow
----------------------------
Enabling Toasted Marshmallow on an existing Schema is just one line of code,
set the ``jit`` property on any ``Schema`` instance to
``toastedmarshmallow.Jit``. For example:
.. code-block:: python
from datetime import date
import toastedmarshmallow
from marshmallow import Schema, fields, pprint
class ArtistSchema(Schema):
name = fields.Str()
class AlbumSchema(Schema):
title = fields.Str()
release_date = fields.Date()
artist = fields.Nested(ArtistSchema())
schema = AlbumSchema()
# Specify the jit method as toastedmarshmallow's jit
schema.jit = toastedmarshmallow.Jit
# And that's it! Your dump methods are 15x faster!
It's also possible to use the ``Meta`` class on the ``Marshmallow`` schema
to specify all instances of a given ``Schema`` should be optimized:
.. code-block:: python
import toastedmarshmallow
from marshmallow import Schema, fields, pprint
class ArtistSchema(Schema):
class Meta:
jit = toastedMarshmallow.Jit
name = fields.Str()
You can also enable Toasted Marshmallow globally by setting the environment
variable ``MARSHMALLOW_SCHEMA_DEFAULT_JIT`` to ``toastedmarshmallow.Jit`` .
Future versions of Toasted Marshmallow may make this the default.
How it works
------------
Toasted Marshmallow works by generating code at runtime to optimize dumping
objects without going through layers and layers of reflection. The generated
code optimistically assumes the objects being passed in are schematically valid,
falling back to the original marshmallow code on failure.
For example, taking ``AlbumSchema`` from above, Toastedmarshmallow will
generate the following 3 methods:
.. code-block:: python
def InstanceSerializer(obj):
res = {}
value = obj.release_date; value = value() if callable(value) else value; res["release_date"] = _field_release_date__serialize(value, "release_date", obj)
value = obj.artist; value = value() if callable(value) else value; res["artist"] = _field_artist__serialize(value, "artist", obj)
value = obj.title; value = value() if callable(value) else value; value = str(value) if value is not None else None; res["title"] = value
return res
def DictSerializer(obj):
res = {}
if "release_date" in obj:
value = obj["release_date"]; value = value() if callable(value) else value; res["release_date"] = _field_release_date__serialize(value, "release_date", obj)
if "artist" in obj:
value = obj["artist"]; value = value() if callable(value) else value; res["artist"] = _field_artist__serialize(value, "artist", obj)
if "title" in obj:
value = obj["title"]; value = value() if callable(value) else value; value = str(value) if value is not None else None; res["title"] = value
return res
def HybridSerializer(obj):
res = {}
try:
value = obj["release_date"]
except (KeyError, AttributeError, IndexError, TypeError):
value = obj.release_date
value = value; value = value() if callable(value) else value; res["release_date"] = _field_release_date__serialize(value, "release_date", obj)
try:
value = obj["artist"]
except (KeyError, AttributeError, IndexError, TypeError):
value = obj.artist
value = value; value = value() if callable(value) else value; res["artist"] = _field_artist__serialize(value, "artist", obj)
try:
value = obj["title"]
except (KeyError, AttributeError, IndexError, TypeError):
value = obj.title
value = value; value = value() if callable(value) else value; value = str(value) if value is not None else None; res["title"] = value
return res
Toastedmarshmallow will invoke the proper serializer based upon the input.
Since Toastedmarshmallow is generating code at runtime, it's critical you
re-use Schema objects. If you're creating a new Schema object every time you
serialize/deserialize an object you'll likely have much worse performance.
:zap::microscope: Experimental :microscope::zap:
--------------------------------------------------
Toastedmarshmallow also has an experimental Cython based jit. It takes the
generated code above and runs it through Cython first, getting another 1.5x
win. Generally the generated Python code is fast enough, but this is a useful
option when you've got to squeeze out every last bit of performance.
To use the Cython jit, replace `Jit` with `CythonJit`:
.. code-block:: python
schema.jit = toastedmarshmallow.CythonJit
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
toastedmarshmallow-0.2.0.tar.gz
(59.7 kB
view details)
Built Distribution
File details
Details for the file toastedmarshmallow-0.2.0.tar.gz
.
File metadata
- Download URL: toastedmarshmallow-0.2.0.tar.gz
- Upload date:
- Size: 59.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c844c57c89498114e6888543bf483018cafd183675193176b75eace63d505a6b |
|
MD5 | 228a8ec5d54ad76c23ceaab4cbe269a3 |
|
BLAKE2b-256 | 9422863fd4584dead948b4dc45f86b2727efdf01dbbdfdda0a2cd3c4f0601598 |
File details
Details for the file toastedmarshmallow-0.2.0-py2.py3-none-any.whl
.
File metadata
- Download URL: toastedmarshmallow-0.2.0-py2.py3-none-any.whl
- Upload date:
- Size: 65.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5f0f24792198eaf52619aaba27a06ff44d17ee41a054bc92d860e464eab52bf7 |
|
MD5 | 85312407587db39c50e886b13fa41a5b |
|
BLAKE2b-256 | 0bf20bd41df7a818111e300392e26844572300acf5b155bd9801f29b8dad04c9 |