Skip to main content

Fetch a given sitemap and retrieve all URLs in it.

Project description

fetch-sitemap

Fetch a given sitemap and retrieve all URLs in it.

usage: fetch-sitemap 
    [-h] 
    [--basic-auth BASIC_AUTH] 
    [-l LIMIT] 
    [-c CONCURRENCY_LIMIT] 
    [-t REQUEST_TIMEOUT] 
    [--report-path REPORT_PATH] 
    sitemap_url

Fetch a given sitemap and retrieve all URLs in it.

positional arguments:
  sitemap_url           URL of the sitemap to fetch

options:
  -h, --help            show this help message and exit
  --basic-auth BASIC_AUTH
                        Basic auth information. Use: 'username:password'.
  -l LIMIT, --limit LIMIT
                        Max number of URLs to fetch from the given sitemap.xml. Default: All
  -c CONCURRENCY_LIMIT, --concurrency-limit CONCURRENCY_LIMIT
                        Max number of concurrent requests. Default: 10
  -t REQUEST_TIMEOUT, --request-timeout REQUEST_TIMEOUT
                        Timeout for fetching a URL. Default: 30
  --random              Append a random string like ?12334232343 to each URL to bypass frontend cache. Default: False
  --report-path REPORT_PATH
                        Store results in a CSV file. Example: ./report.csv

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fetch-sitemap-1.tar.gz (5.4 kB view details)

Uploaded Source

Built Distribution

fetch_sitemap-1-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file fetch-sitemap-1.tar.gz.

File metadata

  • Download URL: fetch-sitemap-1.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for fetch-sitemap-1.tar.gz
Algorithm Hash digest
SHA256 e513cd995582e39520660a6e1cd781d955b7d272689df8269635f98a4b415cad
MD5 d438670b42d2a63feb3444448fc3436d
BLAKE2b-256 273754bf5497d23fbe77a95cbbd56d8bdcc90c8cc49066f162fa6e3cb9aff9ed

See more details on using hashes here.

File details

Details for the file fetch_sitemap-1-py3-none-any.whl.

File metadata

  • Download URL: fetch_sitemap-1-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for fetch_sitemap-1-py3-none-any.whl
Algorithm Hash digest
SHA256 967b14859ec0ebd565b9a2ee4775ab255740bcf90ac9ed7618df287e9d6c922f
MD5 cea776f0a0d3695a50079163e90db025
BLAKE2b-256 f2ae15ddbd619e46b2bb5f9ce44c706bc36a68134648a03df7ff8bf0e7ed25f8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page