Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Build Status Coverage report Documentation Status

web-poet is a Python 3.7+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web data extraction code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web-poet-0.7.0.tar.gz (53.6 kB view details)

Uploaded Source

Built Distribution

web_poet-0.7.0-py3-none-any.whl (39.3 kB view details)

Uploaded Python 3

File details

Details for the file web-poet-0.7.0.tar.gz.

File metadata

  • Download URL: web-poet-0.7.0.tar.gz
  • Upload date:
  • Size: 53.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for web-poet-0.7.0.tar.gz
Algorithm Hash digest
SHA256 db3fef901a4cb3d29096e00c81e656816e07d182c4b314cbf2a866b274e90574
MD5 c6d9f393af0977d388f40c24705d667b
BLAKE2b-256 72d336fe82ac2d58b26abe36888754f74f6334a46cbb7c56227aa94bb7a9f0fd

See more details on using hashes here.

Provenance

File details

Details for the file web_poet-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 39.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for web_poet-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7d6bba33158a096c9d5ce1bb27e547635940ce20acacf93de49329149415e8dd
MD5 36ac7d53a8bb30e0116313c3763103ed
BLAKE2b-256 327edd072d0d5abf25c1b89c3ea4635073507b4daf9d8d22e42f5a63ef85719f

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page