Skip to main content

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Project description

https://img.shields.io/travis/scrapy/parsel.svg https://img.shields.io/pypi/v/parsel.svg Coverage report

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Features

  • Extract text using CSS or XPath selectors

  • Regular expression helper methods

Example:

>>> from parsel import Selector
>>> sel = Selector(text=u"""<html>
        <body>
            <h1>Hello, Parsel!</h1>
            <ul>
                <li><a href="http://example.com">Link 1</a></li>
                <li><a href="http://scrapy.org">Link 2</a></li>
            </ul
        </body>
        </html>""")
>>>
>>> sel.css('h1::text').extract_first()
u'Hello, Parsel!'
>>>
>>> sel.css('h1::text').re('\w+')
[u'Hello', u'Parsel']
>>>
>>> for e in sel.css('ul > li'):
        print(e.xpath('.//a/@href')).extract_first()
http://example.com
http://scrapy.org

History

1.0.0 (2015-08-22)

  • Documentation fixes

0.9.6 (2015-08-14)

  • Updated documentation

  • Extended test coverage

0.9.5 (2015-08-11)

  • Support for extending SelectorList

0.9.4 (2015-08-10)

  • Try workaround for travis-ci/dpl#253

0.9.3 (2015-08-07)

  • Add base_url argument

0.9.2 (2015-08-07)

  • Rename module unified -> selector and promoted root attribute

  • Add create_root_node function

0.9.1 (2015-08-04)

  • Setup Sphinx build and docs structure

  • Build universal wheels

  • Rename some leftovers from package extraction

0.9.0 (2015-07-30)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsel-1.0.0.tar.gz (27.1 kB view details)

Uploaded Source

Built Distribution

parsel-1.0.0-py2.py3-none-any.whl (9.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file parsel-1.0.0.tar.gz.

File metadata

  • Download URL: parsel-1.0.0.tar.gz
  • Upload date:
  • Size: 27.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for parsel-1.0.0.tar.gz
Algorithm Hash digest
SHA256 1d4c48bf4b574d415a53b940f8e5414b11ff0b54552dc32a44a9b93992836597
MD5 50ff63853dba51a331f0df2803471a52
BLAKE2b-256 36597ef189655fb4f9d38eb7b24982eb0a7dcd4256c78a055bd803d24cd5ef06

See more details on using hashes here.

File details

Details for the file parsel-1.0.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for parsel-1.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 11e799636f6c45c3e0e3ced93e9412e30eea3c9aba57e0e088de9eef4fc8cdc9
MD5 a6581a5f38a84a8da0d5b15e248ee8f0
BLAKE2b-256 bc0a463e2578daa55919cb23975d2cfd6f6f6df5d1722be7751a1de11dd90a70

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page