Find the feed URLs for a website.
Project description
This is an asynchronous Python library for finding links feeds on a website.
It is based on the synchronous (requests based) feedfinder2, written by Dan Foreman-Mackey, which is based on feedfinder - originally written by Mark Pilgrim and subsequently maintained by Aaron Swartz until his untimely death.
Usage
Feedfinder2 offers a single public function: find_feeds. You would use it as following:
import asyncio from aio_feedfinder2 import find_feeds loop = asyncio.get_event_loop() task = asyncio.ensure_future(find_feeds("xkcd.com")) feeds = loop.run_until_complete(future)
Now, feeds is the list: ['http://xkcd.com/atom.xml', 'http://xkcd.com/rss.xml']. There is some attempt made to rank feeds from best candidate to worst but… well… you never know.
This asyncio variant is ideally suited to find feeds on multiple domains/ sites in an asynchronous way:
import asyncio from aio_feedfinder2 import find_feeds loop = asyncio.get_event_loop() tasks = [find_feeds(url) for url in ["xkcd.com", "abstrusegoose.com"]] feeds = loop.run_until_complete(asyncio.gather(*tasks)) >>> feeds ... [ ... ['http://xkcd.com/atom.xml', 'http://xkcd.com/rss.xml'], ... ['http://abstrusegoose.com/feed.xml', 'http://abstrusegoose.com/atomfeed.xml'] ... ]
License
Feedfinder2 is licensed under the MIT license (see LICENSE).
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for aio_feedfinder2-0.3.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f226f9c918ad229cc8a0623dd95258b7b02ce8b70d602e761aa51ac12d9bc4c |
|
MD5 | ce1a705849a5d074635efab97cbbd820 |
|
BLAKE2b-256 | d5d416a25bb277a9c189f20118c970c13d21b1fa9209cb4d9290738cc4fd1565 |