Skip to main content

Library of web-related functions

Project description

https://secure.travis-ci.org/scrapy/w3lib.png?branch=master Coverage report

Overview

This is a Python library of web-related functions, such as:

  • remove comments, or tags from HTML snippets

  • extract base url from HTML snippets

  • translate entites on HTML strings

  • convert raw HTTP headers to dicts and vice-versa

  • construct HTTP auth header

  • converting HTML pages to unicode

  • sanitize urls (like browsers do)

  • extract arguments from urls

Requirements

Python 2.7 or Python 3.3+

Install

pip install w3lib

Documentation

See http://w3lib.readthedocs.org/

License

The w3lib library is licensed under the BSD license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

w3lib-1.14.2.tar.gz (41.2 kB view details)

Uploaded Source

Built Distribution

w3lib-1.14.2-py2.py3-none-any.whl (16.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file w3lib-1.14.2.tar.gz.

File metadata

  • Download URL: w3lib-1.14.2.tar.gz
  • Upload date:
  • Size: 41.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for w3lib-1.14.2.tar.gz
Algorithm Hash digest
SHA256 bd87eae62d208eef70869951abf05e96a8ee559714074a485168de4c5b190004
MD5 6d1df7b8998fcb68a516da83983b5b98
BLAKE2b-256 56a68d1250b5e799ddbc013810a33bbce06a38b67f83571f4a560b5d993032a8

See more details on using hashes here.

Provenance

File details

Details for the file w3lib-1.14.2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for w3lib-1.14.2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 cdc23cbc2c009a50b936f672009d596a84c86cb238fa44890064ae5706ffe1ec
MD5 f8c7c9233cf1ced184b1ada12c5316a6
BLAKE2b-256 0e620efbd05918ca20302ee07c01550b089ac83bd655426ba8c443588c7efd9b

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page