YoullDownload

Grab from a remote site page all resources that a browser will probably download visiting the page

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
License
- OSI Approved :: GNU General Public License (GPL)
Operating System
- OS Independent
Programming Language
- Python
- Python :: 2.7
Topic
- Internet :: WWW/HTTP
- Utilities

Project description

Quick info

Let say you need to use the HTTP load testing and benchmarking utility siege on a web page and you also want to use the --internet option, to simulate at best the behavior of a web browser.

When a web browser load a page, it also load all the resources inside that page:

Images
JavaScript files
CSS
Media resources

So you need a list of all URLs taken from that page.

This utility (its name mean “You Will Download”) will simply create this list for you.

You simply need to redirect the utility output to a file, then use also the siege --file option.

Usage

$ youlldownload http://host.com/section/page

Using with siege:

$ youlldownload http://host.com/section/page > list.txt
$ siege -i -f list.txt [other options]

Taken resouces

from script tags we’ll take the src URL
from link tags with rel equals to stylesheet we’ll take the href url
from img tags we’ll take the src URL
from object tags we’ll take the data URL
from embed tags we’ll take the src URL
from style tags we’ll take the URL inside if the tag is using an “@import url” directive
from iframe tags we’ll take the src URL
from source tags inside video we’ll take the src URL

Also: CSS sources are deeply analyzed for found additional resources inside them (like background images, fonts, …).

Authors

This product was developed by RedTurtle Technology team.

Changelog

0.3 (2015-05-28)

Remove duplicated URLs from final report [keul]
Do not include same version of an URL with anchors [keul]
Inspect also resources from CSS (backgroun images, fonts, …) [keul]
Script was not properly working outside homepage if a “base” tag was not provided [keul]

0.2 (2014-04-02)

Added support for src attribute of iframe tag [keul]
Added support for src attribute of source tag (HTML 5 video element) [keul]
Do not break if base tag is not present [keul]

0.1 (2013-01-30)

initial release

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
License
- OSI Approved :: GNU General Public License (GPL)
Operating System
- OS Independent
Programming Language
- Python
- Python :: 2.7
Topic
- Internet :: WWW/HTTP
- Utilities

Release history Release notifications | RSS feed

0.4

Nov 9, 2015

This version

0.3

May 28, 2015

0.2

Apr 2, 2014

0.1

Jan 30, 2013

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

YoullDownload-0.3.tar.gz (8.8 kB view hashes)

Uploaded May 28, 2015 Source

Hashes for YoullDownload-0.3.tar.gz

Hashes for YoullDownload-0.3.tar.gz
Algorithm	Hash digest
SHA256	`6c11585ea5c34f3bd54e182b138058fddc18579adff86a34842040c0986fac23`
MD5	`df7e91503d0c73b7100aa83cec832bb4`
BLAKE2b-256	`98eb3052adb1e5ecd038b641a0b8287cb4c420c47af608fb1b60fc4f75fa9a63`