Skip to main content

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Project description

https://img.shields.io/travis/scrapy/parsel.svg https://img.shields.io/pypi/v/parsel.svg

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Features

  • Extract text using CSS or XPath selectors

  • Regular expression helper methods

Example:

>>> from parsel import Selector
>>> sel = Selector(text=u"""<html>
        <body>
            <h1>Hello, Parsel!</h1>
            <ul>
                <li><a href="http://example.com">Link 1</a></li>
                <li><a href="http://scrapy.org">Link 2</a></li>
            </ul
        </body>
        </html>""")
>>>
>>> sel.css('h1::text').extract_first()
u'Hello, Parsel!'
>>>
>>> sel.css('h1::text').re('\w+')
[u'Hello', u'Parsel']
>>>
>>> for e in sel.css('ul > li'):
        print(e.xpath('.//a/@href')).extract_first()
http://example.com
http://scrapy.org

History

0.9.4 (2015-08-10)

  • Try workaround for travis-ci/dpl#253

0.9.3 (2015-08-07)

  • Add base_url argument

0.9.2 (2015-08-07)

  • Rename module unified -> selector and promoted root attribute

  • Add create_root_node function

0.9.1 (2015-08-04)

  • Setup Sphinx build and docs structure

  • Build universal wheels

  • Rename some leftovers from package extraction

0.9.0 (2015-07-30)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsel-0.9.4.tar.gz (26.8 kB view details)

Uploaded Source

Built Distribution

parsel-0.9.4-py2.py3-none-any.whl (8.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file parsel-0.9.4.tar.gz.

File metadata

  • Download URL: parsel-0.9.4.tar.gz
  • Upload date:
  • Size: 26.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for parsel-0.9.4.tar.gz
Algorithm Hash digest
SHA256 4de1260362ad7c0774a2d2774259caf048ca9fa016e375b5d5621f57d9c48918
MD5 07c93ab0db67a6cb04c07e899e9d63ea
BLAKE2b-256 b7111f9fc79734f17fefc79fdd26c5ac36b4ad5455d41cf37deb993abc1a7685

See more details on using hashes here.

File details

Details for the file parsel-0.9.4-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for parsel-0.9.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 cfadf751d3a34c1f29dcfd077d55cd89389e4017c655217421712d06d85b16dc
MD5 3c022ee8e1db7daba9cc4bb65b668ecc
BLAKE2b-256 a8395b125374fba3c36d1bc88618c5f65efc2451cef20c1f57903de38a88c38f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page