Skip to main content
Avatar for fgregg from gravatar.com
Username    fgregg

44 projects

opencivicdata

Last released

python opencivicdata library

probablepeople

Last released

Parse romanized names & companies using advanced NLP methods

usaddress

Last released

Parse US addresses using conditional random fields

python-crfsuite

Last released

Python binding for CRFsuite

parserator

Last released

Create parsers

census-area

Last released

Census data for arbitrary geographies

dedupe

Last released

A python library for accurate and scaleable data deduplication and entity-resolution

django-councilmatic

Last released

Core functions for councilmatic.org family

dedupe-variable-address

Last released

Address variable type for dedupe

dedupe-variable-datetime

Last released

DateTime variable type for dedupe

dedupe-variable-name

Last released

Name variable type for dedupe

parseratorvariable

Last released

Structured variable type for dedupe

pyhacrf-datamade

Last released

Hidden alignment conditional random field, a discriminative string edit distance

PyLBFGS

Last released

LBFGS and OWL-QN optimization algorithms

census

Last released

A wrapper for the US Census Bureau's API

datasette-datatable

Last released

Export Datasette records as a DataTable

kubra

Last released

command line tool for downloading utility outage data

govqa

Last released

Interact with GovQA, a public records request management platform owned by Granicus

pupa

Last released

scraping framework for muncipal data

chicagorequests

Last released

command line tool for downloading Chicago Open311 data

dedupe-Levenshtein-search

Last released

Search through documents for approximately matching strings. A fork of Matt Anderson's library for MIT licensing

scraper-legistar

Last released

Mixin classes for legistar scrapers

affinegap

Last released

A Cython implementation of the affine gap string distance

rlr

Last released

Case weighted L2 regularized logistic regression

DoubleMetaphone

Last released

Python wrapper for C++ Double Metaphone

dedupe-hcluster

Last released

Hierarchical Clustering Algorithms (Information Theory)

django-proxy-overrides

Last released

Overridable foreign key fields for Proxy models

nwss

Last released

A marshmallow schema for the National Wastewater Surveillance System

dedupe-variable-ilcs

Last released

Dedupe variable for Illinois Compiled Statute (ILCS) codes

ilcs-parser

Last released

Probabilistic parser for tagging data that references the Illinois Compiled Statutes (ILCS).

csvdedupe

Last released

Command line tools for deduplicating and merging csv files

dedupe-variable-number

Last released

Employer variable type for dedupe

django-councilmatic-notifications

Last released

Core functions for councilmatic.org family

datetime-distance

Last released

Compare string distances between dates, timestamps, or datetime objects.

simplecosine

Last released

Simple cosine distance

highered

Last released

Learnable Edit Distance Using PyHacrf

categorical-distance

Last released

Compare two categorical variables

dedupe-variable-person

Last released

Variable type for American Person Names

companyparser

Last released

UNKNOWN

probableparsing

Last released

Common methods for propbable parsers

dedupe-variable-employer

Last released

Employer variable type for dedupe

dedupe-variable-fuzzycategory

Last released

Fuzzy Categoy variable type for dedupe

fuzzycategory

Last released

A context comparison

canonicalize

Last released

canonicalize a cluster of records

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page