Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.7+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web data extraction code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web-poet-0.12.0.tar.gz (65.6 kB view details)

Uploaded Source

Built Distribution

web_poet-0.12.0-py3-none-any.whl (44.0 kB view details)

Uploaded Python 3

File details

Details for the file web-poet-0.12.0.tar.gz.

File metadata

  • Download URL: web-poet-0.12.0.tar.gz
  • Upload date:
  • Size: 65.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for web-poet-0.12.0.tar.gz
Algorithm Hash digest
SHA256 6dc0d1a4f081351268b9d45a0dddfb93bf4ba3363458e81c87660878788cc368
MD5 4fee142bc905f04fd8691c781e2d1bd2
BLAKE2b-256 5d09c75d0bb828cf8f45864bf8a9fc94a7763eb6410be18358f085c743488e15

See more details on using hashes here.

Provenance

File details

Details for the file web_poet-0.12.0-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.12.0-py3-none-any.whl
  • Upload date:
  • Size: 44.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for web_poet-0.12.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d59a81d94ef7c51322c5847b3b4ece89316b0d7c33b4b6470d1e79931841c46a
MD5 55e627385bfac11e6ce17ff673b33aed
BLAKE2b-256 24e7fbf1f419d20fcf89e5be12b8e40bca15e537363ef45e3ea8796594f0ab14

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page