Skip to main content

CRATE: clinical records anonymisation and text extraction

Project description

https://img.shields.io/badge/code%20style-black-000000.svg

Purpose

  • Anonymises relational databases.

  • Performs some specific preprocessing tasks; e.g.

    • preprocesses some specific databases (e.g. Servelec RiO EMR);

    • drafts a “data dictionary” for anonymisation, with special knowledge of some databases (e.g. TPP SystmOne);

    • fetches some word lists, e.g. forenames/surnames/eponyms.

  • Provides a natural language processing (NLP) pipeline.

  • Web app for

    • querying the anonymised database

    • managing a consent-to-contact process

Documentation

See https://crateanon.readthedocs.io

Sources

Licence

  • Copyright (C) 2015, University of Cambridge, Department of Psychiatry. Created by Rudolf Cardinal (rnc1001@cam.ac.uk).

  • Licensed under the GNU GPL v3+: see LICENSE file.

  • Some third-party libraries have slightly different licences:

    • aspects of CamAnonGatePipeline.java are based on demonstration GATE code, copyright (C); University of Sheffield, and licensed under the GNU LGPL; see https://gate.ac.uk/.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crate-anon-0.19.4.tar.gz (19.9 MB view details)

Uploaded Source

File details

Details for the file crate-anon-0.19.4.tar.gz.

File metadata

  • Download URL: crate-anon-0.19.4.tar.gz
  • Upload date:
  • Size: 19.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/2.7.18

File hashes

Hashes for crate-anon-0.19.4.tar.gz
Algorithm Hash digest
SHA256 54c07dc55929091ebd26c87037e46f3739b1cb3c57cb7a7b2718bec49014f7a1
MD5 07a09c815a04d5d213e8bbdd3df7210a
BLAKE2b-256 bb3169b85303db4ad67066cd715b56a56d04899e5e7ade8eeced42c0edc7ab66

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page