Skip to main content

API to extract content from HTML & XML documents

Project description

Selection Documenation

Travis CI Test coverage

API to query DOM tree of HTML/XML document.

Usage Example

from selection import XpathSelector
from lxml.html import fromstring

html = '<div><h1>test</h1><ul id="items"><li>1</li><li>2</li></ul></div>'
sel = XpathSelector(fromstring(html))
print(sel.select('//h1')).text()
print(sel.select('//li').text_list()
print(sel.select('//ul').attr('id')

Installation

Run: pip install -U selection

Community

Telegram English chat: https://t.me/grablab

Telegram Russian chat: https://t.me/grablab_ru

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

selection-0.0.17.tar.gz (8.9 kB view details)

Uploaded Source

File details

Details for the file selection-0.0.17.tar.gz.

File metadata

  • Download URL: selection-0.0.17.tar.gz
  • Upload date:
  • Size: 8.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.8

File hashes

Hashes for selection-0.0.17.tar.gz
Algorithm Hash digest
SHA256 aaceebb623e4b329b96f361da58737ff054ddb74df1ca2351c8c8bf2bff554c5
MD5 502325d6e0d8d7df2e884dc92e30fc42
BLAKE2b-256 1b92fc5f454adc673a18c8b8ddaab43a779a04cdad7fcbbc63f19140ea150491

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page