A high-level Web Crawling and Web Scraping framework
Project description
Scrapy
Overview
Scrapy is a BSD-licensed fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.
Scrapy is maintained by Zyte (formerly Scrapinghub) and many other contributors.
Check the Scrapy homepage at https://scrapy.org for more information, including a list of features.
Requirements
Python 3.8+
Works on Linux, Windows, macOS, BSD
Install
The quick way:
pip install scrapy
See the install section in the documentation at https://docs.scrapy.org/en/latest/intro/install.html for more details.
Documentation
Documentation is available online at https://docs.scrapy.org/ and in the docs directory.
Releases
You can check https://docs.scrapy.org/en/latest/news.html for the release notes.
Community (blog, twitter, mail list, IRC)
See https://scrapy.org/community/ for details.
Contributing
See https://docs.scrapy.org/en/master/contributing.html for details.
Code of Conduct
Please note that this project is released with a Contributor Code of Conduct.
By participating in this project you agree to abide by its terms. Please report unacceptable behavior to opensource@zyte.com.
Companies using Scrapy
See https://scrapy.org/companies/ for a list.
Commercial Support
See https://scrapy.org/support/ for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for Scrapy-2.11.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4be353d6abbb942a9f7e7614ca8b5f3d9037381176ac8d8859c8cac676e74fa0 |
|
MD5 | 5229fef617c4729199496347b3f5e24b |
|
BLAKE2b-256 | 0d4e6c06a4cfbb9a1e243c975da6a09982f085b1a14939fb195ea9e3e2f2b206 |