Skip to main content

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Project description

https://img.shields.io/travis/scrapy/parsel.svg https://img.shields.io/pypi/v/parsel.svg Coverage report

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Features

  • Extract text using CSS or XPath selectors

  • Regular expression helper methods

Example:

>>> from parsel import Selector
>>> sel = Selector(text=u"""<html>
        <body>
            <h1>Hello, Parsel!</h1>
            <ul>
                <li><a href="http://example.com">Link 1</a></li>
                <li><a href="http://scrapy.org">Link 2</a></li>
            </ul
        </body>
        </html>""")
>>>
>>> sel.css('h1::text').extract_first()
u'Hello, Parsel!'
>>>
>>> sel.css('h1::text').re('\w+')
[u'Hello', u'Parsel']
>>>
>>> for e in sel.css('ul > li'):
        print(e.xpath('.//a/@href')).extract_first()
http://example.com
http://scrapy.org

History

0.9.5 (2015-08-11)

  • Support for extending SelectorList

0.9.4 (2015-08-10)

  • Try workaround for travis-ci/dpl#253

0.9.3 (2015-08-07)

  • Add base_url argument

0.9.2 (2015-08-07)

  • Rename module unified -> selector and promoted root attribute

  • Add create_root_node function

0.9.1 (2015-08-04)

  • Setup Sphinx build and docs structure

  • Build universal wheels

  • Rename some leftovers from package extraction

0.9.0 (2015-07-30)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsel-0.9.6.tar.gz (27.0 kB view details)

Uploaded Source

Built Distribution

parsel-0.9.6-py2.py3-none-any.whl (8.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file parsel-0.9.6.tar.gz.

File metadata

  • Download URL: parsel-0.9.6.tar.gz
  • Upload date:
  • Size: 27.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for parsel-0.9.6.tar.gz
Algorithm Hash digest
SHA256 28e021d5dfcf8e4d733cd6faa6c2b494c3855bad0b369f5490558116d0a0d241
MD5 77695eea3a156cedb3076c7521d25aeb
BLAKE2b-256 ea39b0e8339ce306b63ad7326500f1302fc42473007680225ce1d4270bd7274e

See more details on using hashes here.

File details

Details for the file parsel-0.9.6-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for parsel-0.9.6-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 83aabb38c5240fa38274de294717c16907d41b227a6a904c370bf6a17228a446
MD5 a90ec1a1f4d442292d5f7d65c0a3dc3d
BLAKE2b-256 0433c4186350b21627633565546c3b6262854bc2bb47110170d639dc040af0c5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page