Skip to main content

Nasy Crawler Framework -- Never had such a pure crawler.

Project description

Table of Contents

Prologue

Never had such a pure crawler like this nacf.

Although I often write crawlers, I don’t like to use huge frameworks, such as scrapy, but prefer simple requests+bs4 or more general requests_html. However, these two are inconvenient for a crawler. E.g. Places, such as error retrying or parallel crawling, need to be handwritten by myself. It is not very difficult to write it while writing too much can be tedious. Hence I started writing this nacf (Nasy Crawler Framework), hoping to simplify some error retrying or parallel writing of crawlers.

Packages

Table 1: Packages
Package Version Description
requests-html 0.9.0 HTML Parsing for Humans.

Development Process

TODO Http Functions

DONE Get

CLOSED: [2018-12-25 Tue 17:36]

NEXT Post

Epoligue

History

Version 0.1.0

  • Date: <2018-12-23 Sun>
  • Commemorate Version: First Version
    • Basic Functions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nacf-0.1.0.tar.gz (13.0 kB view details)

Uploaded Source

Built Distribution

nacf-0.1.0-py3-none-any.whl (35.6 kB view details)

Uploaded Python 3

File details

Details for the file nacf-0.1.0.tar.gz.

File metadata

  • Download URL: nacf-0.1.0.tar.gz
  • Upload date:
  • Size: 13.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.10 CPython/3.7.1 Darwin/18.2.0

File hashes

Hashes for nacf-0.1.0.tar.gz
Algorithm Hash digest
SHA256 783cc43d0bcbaf23d5ceead8cfd5192ccc56de3dacb5456628643e8bfb5c7e7c
MD5 0ef17e0c72c181a319014d16ef21bb7b
BLAKE2b-256 2e3abadb8e735cfe4d7d7b2897b2cf4b79528165199ac46ad2fd65eff6e4bb24

See more details on using hashes here.

File details

Details for the file nacf-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: nacf-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 35.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.10 CPython/3.7.1 Darwin/18.2.0

File hashes

Hashes for nacf-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 58b21c3a058d1bb1c260eef5bed784020e4e088d2654f2ec6ac7e26083eb044e
MD5 62f7e7df48c7687991951174766ffdd5
BLAKE2b-256 71f0ef19035cc3c5699f43bfac6baa443be3a7b05fa6ab4745acb748dd189d1f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page