Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.7+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web data extraction code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web-poet-0.11.0.tar.gz (64.8 kB view details)

Uploaded Source

Built Distribution

web_poet-0.11.0-py3-none-any.whl (43.3 kB view details)

Uploaded Python 3

File details

Details for the file web-poet-0.11.0.tar.gz.

File metadata

  • Download URL: web-poet-0.11.0.tar.gz
  • Upload date:
  • Size: 64.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for web-poet-0.11.0.tar.gz
Algorithm Hash digest
SHA256 a23dd2fec0d0844dbe036b2eebe0ae366ca5482caf940270aa146f643fb0a47f
MD5 60ddaf85c6e00f18f5fd6932a162cdda
BLAKE2b-256 b558fdc429260237cc9896dcc84577b59815b532fbc1606423de810002964eba

See more details on using hashes here.

Provenance

File details

Details for the file web_poet-0.11.0-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.11.0-py3-none-any.whl
  • Upload date:
  • Size: 43.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for web_poet-0.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cc54d1989186af2523a271bf8cea0b24e11ff7868988a756ebda1a70ef0fb4e7
MD5 2c33bda0ca6bf2312bcaa94459b41bfa
BLAKE2b-256 dc9ccbe2cf735909be609bf952e51bb0d95409bee19d81ce19370ac3f4408bbd

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page