Skip to main content

Extract text from HTML

Project description

HTML to Text

PyPI Version Build Status Code Coverage

Extract text from HTML

  • Free software: MIT license

Install

Install with pip:

pip install html-text

The package depends on lxml, so you might need to install some additional packages: http://lxml.de/installation.html

Usage

Extract text from HTML:

>>> import html_text
>>> text = html_text.extract_text(u'<h1>Hey</h1>')
u'Hey'

Credits

The code is extracted from utilities used in several projects, written by Mikhail Korobov.

History

0.1.1 (2017-01-16)

  • Travis-CI and codecov.io integrations added

0.1.0 (2016-09-27)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

html_text-0.1.1.tar.gz (10.4 kB view details)

Uploaded Source

File details

Details for the file html_text-0.1.1.tar.gz.

File metadata

  • Download URL: html_text-0.1.1.tar.gz
  • Upload date:
  • Size: 10.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for html_text-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c2ab75db89a3198fa54f4d833d9fbe24c8c5d92324656ca10d5a0d76a5136724
MD5 1e60cd771c4ce1b71985239f83b7e82a
BLAKE2b-256 dd9c2c189c3f54ca8bf83dfb3e1c0f1c44799784ba817eb2cdbd590859b8e2a8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page