grab

Web Scraping Framework

These details have not been verified by PyPI

Project links

Homepage

Project description

https://pypip.in/download/grab/badge.svg?period=month

https://landscape.io/github/lorien/grab/master/landscape.png

https://readthedocs.org/projects/grab/badge/?version=latest

What is Grab?

Grab is a python web scraping framework. Grab provides tons of helpful methods to scrape web sites and to process the scraped content:

Automatic cookies (session) support
HTTP and SOCKS proxy with and without authorization
Keep-Alive support
IDN support
Tools to work with web forms
Easy multipart file uploading
Flexible customization of HTTP requests
Automatic charset detection
Powerful API of extracting info from HTML documents with XPATH queries
Asynchronous API to make thousands of simultaneous queries. This part of library called Spider and it is too big to even list its features in this README.
Python 3 ready

Grab Example

from grab import Grab
import logging

logging.basicConfig(level=logging.DEBUG)
g = Grab()
g.go('https://github.com/login')
g.set_input('login', '***')
g.set_input('password', '***')
g.submit()
g.doc.save('/tmp/x.html')

g.doc('//span[contains(@class, "octicon-sign-out")]').assert_exists()
home_url = g.doc('//a[contains(@class, "header-nav-link name")]/@href').text()
repo_url = home_url + '?tab=repositories'

g.go(repo_url)
for elem in g.doc.select('//h3[@class="repo-list-name"]/a'):
    print('%s: %s' % (elem.text(),
                      g.make_url_absolute(elem.attr('href'))))

Grab::Spider Example

from grab.spider import Spider, Task
import logging

class ExampleSpider(Spider):
    def task_generator(self):
        for lang in ('python', 'ruby', 'perl'):
            url = 'https://www.google.com/search?q=%s' % lang
            yield Task('search', url=url, lang=lang)

    def task_search(self, grab, task):
        print('%s: %s' % (task.lang,
                          grab.doc('//div[@class="s"]//cite').text()))


logging.basicConfig(level=logging.DEBUG)
bot = ExampleSpider()
bot.run()

Installation

Pip is recommended way to install Grab and its dependencies:

$ pip install -U grab

See details here http://docs.grablib.org/en/latest/usage/installation.html

Documentation and Help

Documentation: http://docs.grablib.org/en/latest/

English mailing list: http://groups.google.com/group/grab-users/

Russian mailing list: http://groups.google.com/group/python-grab/

Contribution

To report a bug please use github issue tracker: https://github.com/lorien/grab/issues

If you want to develop new feature in Grab please use issue tracker to describe what you want to do or contact me at lorien@lorien.name

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.6.41

Jun 24, 2018

0.6.40

May 14, 2018

0.6.39

May 10, 2018

0.6.38

May 17, 2017

0.6.37

May 13, 2017

0.6.36

May 13, 2017

0.6.35

Feb 6, 2017

0.6.34

Feb 4, 2017

0.6.33

Jan 27, 2017

0.6.32

Dec 31, 2016

0.6.31

Dec 31, 2016

0.6.30

Nov 22, 2015

0.6.29

Oct 15, 2015

0.6.28

Oct 13, 2015

0.6.27

Oct 13, 2015

0.6.26

Oct 9, 2015

0.6.25

Sep 20, 2015

0.6.24

Sep 9, 2015

0.6.23

Aug 27, 2015

0.6.22

Aug 14, 2015

0.6.21

Jun 20, 2015

This version

0.6.20

Jun 8, 2015

0.6.19

Jun 5, 2015

0.6.18

Jun 5, 2015

0.6.17

Jun 5, 2015

0.6.16

Jun 3, 2015

0.6.15

May 31, 2015

0.6.14

May 18, 2015

0.6.13

May 12, 2015

0.6.12

May 7, 2015

0.6.11

May 7, 2015

0.6.10

Apr 30, 2015

0.6.9

Apr 29, 2015

0.6.8

Apr 26, 2015

0.6.7

Apr 26, 2015

0.6.6

Apr 23, 2015

0.6.5

Apr 16, 2015

0.6.4

Apr 12, 2015

0.6.3

Apr 10, 2015

0.6.2

Apr 9, 2015

0.6.1

Apr 8, 2015

0.6.0

Apr 6, 2015

0.5.5

Mar 27, 2015

0.5.4

Mar 7, 2015

0.5.3

Mar 7, 2015

0.5.2

Feb 22, 2015

0.5.1

Feb 16, 2015

0.5.0

Feb 9, 2015

0.4.13

Sep 12, 2013

0.4.12

Jul 25, 2013

0.4.11

Jun 7, 2013

0.4.10

May 1, 2013

0.4.9

Apr 27, 2013

0.4.8

Nov 18, 2012

0.4.7

Aug 31, 2012

0.4.5

Jun 27, 2012

0.4.4

Jun 21, 2012

0.4.3

Jun 10, 2012

0.4.2

May 16, 2012

0.4.1

Apr 28, 2012

0.4.0

Apr 27, 2012

0.3.33

Apr 13, 2012

0.3.32

Apr 5, 2012

0.3.31

Mar 30, 2012

0.3.30

Mar 27, 2012

0.3.29

Mar 7, 2012

0.3.28

Mar 6, 2012

0.3.27

Mar 6, 2012

0.3.26

Mar 5, 2012

0.3.25

Mar 1, 2012

0.3.24

Feb 21, 2012

0.3.23

Jan 26, 2012

0.3.22

Jan 16, 2012

0.3.21

Jan 6, 2012

0.3.20

Dec 31, 2011

0.3.19

Dec 25, 2011

0.3.18

Dec 20, 2011

0.3.17

Dec 18, 2011

0.3.16

Dec 7, 2011

0.3.15

Dec 2, 2011

0.3.14

Nov 24, 2011

0.3.13

Nov 22, 2011

0.3.12

Nov 14, 2011

0.3.11

Nov 9, 2011

0.3.10

Nov 6, 2011

0.3.9

Nov 6, 2011

0.3.8

Nov 5, 2011

0.3.7

Nov 5, 2011

0.3.6

Nov 4, 2011

0.3.4

Oct 26, 2011

0.3.3

Oct 23, 2011

0.3.2

Oct 3, 2011

0.3.1

Sep 23, 2011

0.3

Sep 2, 2011

0.2.20

Aug 21, 2011

0.2.19

Aug 14, 2011

0.2.18

Jul 31, 2011

0.2.17

Jul 31, 2011

0.2.16

Jul 23, 2011

0.2.15

Jul 23, 2011

0.2.12

Jun 17, 2011

0.2.11

Jun 13, 2011

0.2.10

May 17, 2011

0.2.9

May 11, 2011

0.2.8

May 5, 2011

0.2.7

May 5, 2011

0.2.6

Mar 23, 2011

0.2.5

Dec 5, 2010

0.2.4

Dec 5, 2010

0.2.3

Nov 10, 2010

0.2.2

Nov 8, 2010

0.2.1

Nov 1, 2010

0.2.0

Nov 1, 2010

0.1.7

Sep 12, 2010

0.1.6

Sep 8, 2010

0.1.5

Sep 8, 2010

0.1.4

Sep 4, 2010

0.1.3

Sep 4, 2010

0.1.2

Sep 3, 2010

0.1.1

Aug 14, 2010

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grab-0.6.20.tar.gz (91.4 kB view details)

Uploaded Jun 8, 2015 Source

File details

Details for the file grab-0.6.20.tar.gz.

File metadata

Download URL: grab-0.6.20.tar.gz
Upload date: Jun 8, 2015
Size: 91.4 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for grab-0.6.20.tar.gz
Algorithm	Hash digest
SHA256	`4c40ca059d00df5e4b68ca52e5d0e6796ca8f9d00577568dcc25d857d8565bab`
MD5	`6b394f8881a10e7781b3c360f5eae9de`
BLAKE2b-256	`4d2559120663256795a57f834f76ba050477724802122e06de8bd59f54b1ebb3`