56 projects
pymarc
Read, write and modify MARC bibliographic data
pincushion
An archiving tool for Historypin
bagit
Create and validate BagIt packages
etudier
Collect a citation graph from Google Scholar
feediverse
Connect an RSS Feed to Mastodon
idloc
Get JSON-LD for a Library of Congress name or subject authority.
marctable
Convert MARC to CSV and Parquet
memento-cli
Examine snapshots in eeb archives such as the Internet Archive's Wayback Machine
twarc-csv
A twarc plugin to output Twitter data as CSV
markdown-to-respec
Convert specifications written in Markdown to ReSpec HTML
wikipediarevs
Download all the revisions for a set of Wikipedia articles
twarc
Archive tweets from the command line
bibdesk2zotero
convert BibDesk BibTeX files for import into Zotero
public-domains
Find possible host names in a source text
twitter-archive-unshorten
Unshorten the URLs in your Twitter archive
twarc-network
Generate network visualizations for Twitter data
twarc-edits
Find edited tweetes
waybackprov
Checks the provenance of a URL in the Wayback machine
twarc-timeline-archive
A twarc plugin to collect the timelines of a list of users
microdata
html5lib extension for parsing microdata
twarc-text
A twarc plugin to print tweets to the terminal
twarc-hashtags
A twarc plugin to extract hashtags from Twitter data
airwaves
Unlocking the Airwaves Utilities
xkcd2347
List the dependencies for a github project
twarc-videos
A twarc plugin to extract referenced video from tweet data
twarc-ids
A twarc plugin to read Twitter data and output the tweet ids
luckysocial
lookup social media accounts for names
wikieds
Command line tool to Print a markdown summary of editors for a Wikipedia article.
fusionbuilder
Parse Fusion Page Builder text.
inst341data
install data in jupyter notebook
bagcat
A command line utility for managing BagIt packages in Amazon S3
puid
Lookup a PRONOM Unique Identifier for a file.
dedoop
dedupe files and send them to the cloud
ptree
Work with PairTree file system convention
wikilinks
Get a list of Wikipedia articles that link to a website.
solrpy
Client for the Solr search service
htmldiff2
Diffs arbitrary HTML inline.
diffengine
Monitor changes to webpages in RSS feeds
oembedders
A utility for dispatching to known oembed providers
iacoll
Collect metadata for Internet Archive collections
storified
Download your Storify data
lastweet
Send Twitter/Mastodon updates about LastFM activity
nyaraka
Download Omeka data
wikidata_suggest
Interactively look up Wikidata entities from the command line
hathitables
Turn HathiTrust Collections into CSV
hathilda
Turn HathiTrust Data into JSON-LD
teizone
Add coordinates to TEI zones.
summoner
Work with the Serials Solutions Summon API
wplinks
find wikipedia articles that links to a website
opensearch
Interact with opensearch services
skosdict
Turn a SKOS concept scheme into a JSON dictionary
oai2pairtree
UNKNOWN
twitterator
iterating functions for twitter api
dflat
a command line tool for working with dflat digital preservation file systems
marcup
manage create/update/deletes marc feeds
marcdb
parse MARC data and store into a rdbms