Skip to main content

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Project description

https://img.shields.io/travis/scrapy/parsel.svg https://img.shields.io/pypi/v/parsel.svg

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Features

  • Extract text using CSS or XPath selectors

  • Regular expression helper methods

Example:

>>> from parsel import Selector
>>> sel = Selector(text=u"""<html>
        <body>
            <h1>Hello, Parsel!</h1>
            <ul>
                <li><a href="http://example.com">Link 1</a></li>
                <li><a href="http://scrapy.org">Link 2</a></li>
            </ul
        </body>
        </html>""")
>>>
>>> sel.css('h1::text').extract_first()
u'Hello, Parsel!'
>>>
>>> sel.css('h1::text').re('\w+')
[u'Hello', u'Parsel']
>>>
>>> for e in sel.css('ul > li'):
        print(e.xpath('.//a/@href')).extract_first()
http://example.com
http://scrapy.org

History

0.9.3 (2015-08-07)

  • Add base_url argument

0.9.2 (2015-08-07)

  • Rename module unified -> selector and promoted root attribute

  • Add create_root_node function

0.9.1 (2015-08-04)

  • Setup Sphinx build and docs structure

  • Build universal wheels

  • Rename some leftovers from package extraction

0.9.0 (2015-07-30)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parsel-0.9.3.tar.gz (27.3 kB view details)

Uploaded Source

Built Distribution

parsel-0.9.3-py2.py3-none-any.whl (8.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file parsel-0.9.3.tar.gz.

File metadata

  • Download URL: parsel-0.9.3.tar.gz
  • Upload date:
  • Size: 27.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for parsel-0.9.3.tar.gz
Algorithm Hash digest
SHA256 11e9b31ff13acc10fdbc3942837048c71fbb490a8bb3162987ead3c78139c174
MD5 72c7a2d057154200c09a23c084a14d8c
BLAKE2b-256 9f8fe6de25f4fa7bb8c40a2c07ffa737964dd9345952920d25ed40d073282451

See more details on using hashes here.

File details

Details for the file parsel-0.9.3-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for parsel-0.9.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 d59353028f4bb71971e7d728cce858a0767ba7be8560680a593c02da7b13f29d
MD5 0578ff710fab0985167f24b687fc5acf
BLAKE2b-256 04079216c7cfaf29fb20d04f6e83fcfcdcba4e4a63ebf4d234df326204359d57

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page