A high-level Web Crawling and Web Scraping framework
Project description
Overview
Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.
For more information including a list of features check the Scrapy homepage at: http://scrapy.org
Requirements
Python 2.7
Works on Linux, Windows, Mac OSX, BSD
Install
The quick way:
pip install scrapy
For more details see the install section in the documentation: http://doc.scrapy.org/en/latest/intro/install.html
Releases
You can download the latest stable and development releases from: http://scrapy.org/download/
Documentation
Documentation is available online at http://doc.scrapy.org/ and in the docs directory.
Community (blog, twitter, mail list, IRC)
Contributing
Companies using Scrapy
Commercial Support
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file Scrapy-1.0.6.tar.gz
.
File metadata
- Download URL: Scrapy-1.0.6.tar.gz
- Upload date:
- Size: 951.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 548999910fcd18216626e569f6362bc390acfda4cfc1dfe2423d56d9f1ad91c5 |
|
MD5 | 92c6d393e5fbf14a90543400e2c620d7 |
|
BLAKE2b-256 | 9dbbf88948090988641b4b4be26d8865f9668b7d0223b7b27e0ada351d41faaa |
Provenance
File details
Details for the file Scrapy-1.0.6-py2-none-any.whl
.
File metadata
- Download URL: Scrapy-1.0.6-py2-none-any.whl
- Upload date:
- Size: 291.7 kB
- Tags: Python 2
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9a7a67110cd658aca3e0e4ea7aa91fe909604a808cc1840e52ca4254e0e2be9e |
|
MD5 | c9d0f8643275b061668a67b8c8da8858 |
|
BLAKE2b-256 | 0c11b342f4a8bd2302023ab6ce8d5553c986ae297e6a3064c506cb7a21648265 |