Skip to main content

PDF generation in python using wkhtmltopdf suitable for heroku

Project description

BuildStatus codecov PyPI license

PDF generation in python using wkhtmltopdf.

Wkhtmltopdf binaries are precompiled and included in the package making pydf easier to use, in particular this means pydf works on heroku.

Currently using wkhtmltopdf 0.12.4 (with patched qt), requires Python 3.6+.

Install

pip install python-pdf

For python 2 use pip install python-pdf==0.30.0.

Basic Usage

import pydf
pdf = pydf.generate_pdf('<h1>this is html</h1>')
with open('test_doc.pdf', 'wb') as f:
    f.write(pdf)

Async Usage

Generation of lots of documents with wkhtmltopdf can be slow as wkhtmltopdf can only generate one document per process. To get round this pydf uses python 3’s asyncio create_subprocess_exec to generate multiple pdfs at the same time. Thus the time taken to spin up processes doesn’t slow you down.

from pathlib import Path
from pydf import AsyncPydf

async def generate_async():
    apydf = AsyncPydf()

    async def gen(i):
        pdf_content = await apydf.generate_pdf('<h1>this is html</h1>')
        Path(f'output_{i:03}.pdf').write_bytes(pdf_content)

    coros = [gen(i) for i in range(50)]
    await asyncio.gather(*coros)

loop = asyncio.get_event_loop()
loop.run_until_complete(generate_async())

See benchmarks/run.py for a full example.

Locally generating an entire invoice goes from 0.372s/pdf to 0.035s/pdf with the async model.

API

generate_pdf(source, [**kwargs])

Generate a pdf from either a url or a html string.

After the html and url arguments all other arguments are passed straight to wkhtmltopdf

For details on extra arguments see the output of get_help() and get_extended_help()

All arguments whether specified or caught with extra_kwargs are converted to command line args with '--' + original_name.replace('_', '-').

Arguments which are True are passed with no value eg. just –quiet, False and None arguments are missed, everything else is passed with str(value).

Arguments:

  • source: html string to generate pdf from or url to get

  • quiet: bool

  • grayscale: bool

  • lowquality: bool

  • margin_bottom: string eg. 10mm

  • margin_left: string eg. 10mm

  • margin_right: string eg. 10mm

  • margin_top: string eg. 10mm

  • orientation: Portrait or Landscape

  • page_height: string eg. 10mm

  • page_width: string eg. 10mm

  • page_size: string: A4, Letter, etc.

  • image_dpi: int default 600

  • image_quality: int default 94

  • extra_kwargs: any exotic extra options for wkhtmltopdf

Returns string representing pdf

get_version()

Get version of pydf and wkhtmltopdf binary

get_help()

get help string from wkhtmltopdf binary uses -h command line option

get_extended_help()

get extended help string from wkhtmltopdf binary uses -H command line option

execute_wk(*args)

Low level function to call wkhtmltopdf, arguments are added to wkhtmltopdf binary and passed to subprocess with not processing.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python-pdf-0.32.tar.gz (15.3 MB view details)

Uploaded Source

Built Distribution

python_pdf-0.32-py36-none-any.whl (15.3 MB view details)

Uploaded Python 3.6

File details

Details for the file python-pdf-0.32.tar.gz.

File metadata

  • Download URL: python-pdf-0.32.tar.gz
  • Upload date:
  • Size: 15.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for python-pdf-0.32.tar.gz
Algorithm Hash digest
SHA256 611fcea2b4f386d63ee96e465b05d01facb76d352ad7d9ec346c1ec3debc239a
MD5 025b83d7b814002421368b1913b2895f
BLAKE2b-256 7f1537ce1a45bb7ef806ca384e768a6bb3742d530e64a189705bf9f9218bf161

See more details on using hashes here.

Provenance

File details

Details for the file python_pdf-0.32-py36-none-any.whl.

File metadata

File hashes

Hashes for python_pdf-0.32-py36-none-any.whl
Algorithm Hash digest
SHA256 6c84a775830bbdec1adbe177711597b57fad63245153f9523c8aa3bd201005a3
MD5 1a56d31883a4f9b8fa680b48b9d76730
BLAKE2b-256 62749ec799eb512dd01aa2b89fd19a14cab1a06d6aac893290e12d0bcd2e6b6a

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page