Skip to main content

PDF generation in python using wkhtmltopdf suitable for heroku

Project description

BuildStatus codecov PyPI license

PDF generation in python using wkhtmltopdf.

Wkhtmltopdf binaries are precompiled and included in the package making pydf easier to use, in particular this means pydf works on heroku.

Currently using wkhtmltopdf 0.12.4 (with patched qt), requires Python 3.6+.

Install

pip install python-pdf

For python 2 use pip install python-pdf==0.30.0.

Basic Usage

import pydf
pdf = pydf.generate_pdf('<h1>this is html</h1>')
with open('test_doc.pdf', 'wb') as f:
    f.write(pdf)

Async Usage

Generation of lots of documents with wkhtmltopdf can be slow as wkhtmltopdf can only generate one document per process. To get round this pydf uses python 3’s asyncio create_subprocess_exec to generate multiple pdfs at the same time. Thus the time taken to spin up processes doesn’t slow you down.

from pathlib import Path
from pydf import AsyncPydf

async def generate_async():
    apydf = AsyncPydf()

    async def gen(i):
        pdf_content = await apydf.generate_pdf('<h1>this is html</h1>')
        Path(f'output_{i:03}.pdf').write_bytes(pdf_content)

    coros = [gen(i) for i in range(50)]
    await asyncio.gather(*coros)

loop = asyncio.get_event_loop()
loop.run_until_complete(generate_async())

See benchmarks/run.py for a full example.

Locally generating an entire invoice goes from 0.372s/pdf to 0.035s/pdf with the async model.

API

generate_pdf(source, [**kwargs])

Generate a pdf from either a url or a html string.

After the html and url arguments all other arguments are passed straight to wkhtmltopdf

For details on extra arguments see the output of get_help() and get_extended_help()

All arguments whether specified or caught with extra_kwargs are converted to command line args with '--' + original_name.replace('_', '-').

Arguments which are True are passed with no value eg. just –quiet, False and None arguments are missed, everything else is passed with str(value).

Arguments:

  • source: html string to generate pdf from or url to get

  • quiet: bool

  • grayscale: bool

  • lowquality: bool

  • margin_bottom: string eg. 10mm

  • margin_left: string eg. 10mm

  • margin_right: string eg. 10mm

  • margin_top: string eg. 10mm

  • orientation: Portrait or Landscape

  • page_height: string eg. 10mm

  • page_width: string eg. 10mm

  • page_size: string: A4, Letter, etc.

  • image_dpi: int default 600

  • image_quality: int default 94

  • extra_kwargs: any exotic extra options for wkhtmltopdf

Returns string representing pdf

get_version()

Get version of pydf and wkhtmltopdf binary

get_help()

get help string from wkhtmltopdf binary uses -h command line option

get_extended_help()

get extended help string from wkhtmltopdf binary uses -H command line option

execute_wk(*args)

Low level function to call wkhtmltopdf, arguments are added to wkhtmltopdf binary and passed to subprocess with not processing.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python-pdf-0.33.tar.gz (15.3 MB view details)

Uploaded Source

Built Distribution

python_pdf-0.33-py36-none-any.whl (15.3 MB view details)

Uploaded Python 3.6

File details

Details for the file python-pdf-0.33.tar.gz.

File metadata

  • Download URL: python-pdf-0.33.tar.gz
  • Upload date:
  • Size: 15.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for python-pdf-0.33.tar.gz
Algorithm Hash digest
SHA256 54cc351cf11297ca698b72bfa558af41922052e99efea55b704ffb2c8b89a1e4
MD5 3b646015c54001bdc56c5c3ee23750c6
BLAKE2b-256 bbd4ba23747557ffc95e8b99fe599dd21e82422da263b5d69ffb6b8065a014b4

See more details on using hashes here.

Provenance

File details

Details for the file python_pdf-0.33-py36-none-any.whl.

File metadata

File hashes

Hashes for python_pdf-0.33-py36-none-any.whl
Algorithm Hash digest
SHA256 907ddd8e3baaee72492b4cc0a4df040039dfbc5c9f86339223af5128afc51f63
MD5 f4496c37972ff84f9d5eae49c6d2e4db
BLAKE2b-256 06e2bc3cdd001569d7c334a3d757c13f026a61ddade08f1d247130ba251e3aba

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page