47 projects
rigour
Financial crime domain data validation and normalization library.
servicelayer
Basic remote service functions for alephdata components
alephclient
Command-line client for Aleph API
followthemoney
None
countrynames
A library to map country names to ISO codes.
yente
None
nomenklatura
Make record linkages in followthemoney data.
datapatch
memorious
A minimalistic, recursive web crawling library for Python.
normality
Micro-library to normalize text strings
fingerprints
A library to generate entity fingerprints.
pantomime
MIME type normalisation and labels.
addressformatting
Formatting utility for international postal addresses
dataset
Toolkit for Python-based database access.
zavod
Data factory for followthemoney data.
articledata
Utility library for trading article data.
followthemoney-enrich
prefixdate
Formatting utility for international postal addresses
opensanctions
languagecodes
A library that normalises language codes
banal
Commons of banal micro-functions for Python.
babbage
A light-weight analytical engine for OLAP processing
pdflib
python bindings for poppler
balkhash
Cloud storage library to store raw and structured data from different datasets in a data lake
urlnormalizer
Normalize URLs. Mostly useful for deduplicating HTTP URLs.
storagelayer
Content-addressable storage for aleph and memorious
pgcsv
CSV to Postgres data puncher.
apikit
A set of utility functions for RESTful Flask applications.
exactitude
A library with real-world data parsers.
morphium
Tools for scrapers on morph.io
datafreeze
Export data from a SQL database to a set of file formats.
messytables
Parse messy tabular data in various formats
cronosparser
Parser for CronosPro / CronosPlus database files.
typecast
Convert types in source data.
jsonmapping
Map flat data to structured JSON via a mapping.
sparqlquery
SPARQL query builder, fork of sparqlquery
metafolder
Store a bunch of documents alongside basic metadata
jsongraph
Library for data integration using a JSON/RDF object graph.
mqlparser
Parser for MQL queries
jtssql
Generate database tables based on JSON Table Schema
thready
Simple wrapper for threaded execution.
fiscalmodel
Reference data for fiscal data classification
spendb
SpenDB
googlesheets
Simply read and write Google Spreadsheets from Python
osvalidate
OpenSpending Model/Data Validation
restpager
A RESTful pager class for Flask
pynomenklatura
Client library for nomenklatura, make record linkages on the web.