Smart text extraction from PDF documents

These details have not been verified by PyPI

Project links

Project description

Tests

EDS-PDF

EDS-PDF provides modular framework to extract text from PDF documents.

You can use it out-of-the-box, or extend it to fit your use-case.

Getting started

Install the library with pip:

$ pip install edspdf

Visit the documentation for more information!

Citation

If you use EDS-PDF, please cite us as below.

@software{edspdf,
  author  = {Dura, Basile and Wajsburt, Perceval and Calliger, Alice and Gérardin, Christel and Bey, Romain},
  doi     = {10.5281/zenodo.6902977},
  license = {BSD-3-Clause},
  title   = {{EDS-PDF: Smart text extraction from PDF documents}},
  url     = {https://github.com/aphp/edspdf}
}

Acknowledgement

We would like to thank Assistance Publique – Hôpitaux de Paris and AP-HP Foundation for funding this project.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.9.1

Mar 19, 2024

0.9.0

Feb 26, 2024

0.8.1

Sep 26, 2023

0.8.0

Sep 7, 2023

This version

0.7.0

Jun 9, 2023

0.5.3

Aug 31, 2022

0.5.2

Aug 30, 2022

0.5.1

Jul 26, 2022

0.5.0

Jul 25, 2022

0.5.0b0 pre-release

Jul 25, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edspdf-0.7.0.tar.gz (66.9 kB view details)

Uploaded Jun 9, 2023 Source

File details

Details for the file edspdf-0.7.0.tar.gz.

File metadata

Download URL: edspdf-0.7.0.tar.gz
Upload date: Jun 9, 2023
Size: 66.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for edspdf-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`698329b5221dc17cc25428655f3e284bf85c2b2caaab8c74ae38e419a9c58133`
MD5	`5e47202698e5d1a1c8e3908fd0a337f8`
BLAKE2b-256	`8b7dff5369d21663d9e0e644bea890df2ea7c9ee8ededb2a17dd6dd644908f10`