Skip to main content

Acromine based Disambiguation of Entities From Text

Project description

Adeft

License Build Documentation PyPI version Python 3

Adeft (Acromine based Disambiguation of Entities From Text context) is a utility for building models to disambiguate acronyms and other abbreviations of biological terms in the scientific literature. It makes use of an implementation of the Acromine algorithm developed by the NaCTeM at the University of Manchester to identify possible longform expansions for shortforms in a text corpus. It allows users to build disambiguation models to disambiguate shortforms based on their text context. A growing number of pretrained disambiguation models are publicly available to download through adeft.

Installation

Adeft works with Python versions 3.5 and above. It is available on PyPi and can be installed with the command

$ pip install adeft

Adeft's pretrained machine learning models can then be downloaded with the command

$ python -m adeft.download

Using Adeft

A dictionary of available models can be imported with from adeft import available_models

The dictionary maps shortforms to model names. It's possible for multiple equivalent shortforms to map to the same model.

Here's an example of running a disambiguator for ER on a list of texts

from adeft.disambiguate import load_disambiguator

er_dd = load_disambiguator('ER')

    ...

er_dd.disambiguate(texts)

Users may also build and train their own disambiguators. See the documention for more info.

Documentation

Documentation is available at https://adeft.readthedocs.io

Jupyter notebooks illustrating Adeft workflows are available under notebooks:

Testing

Adeft uses nosetests for unit testing, and is integrated with the Travis continuous integration environment. To run tests locally, make sure to install the test-specific requirements listed in setup.py as

pip install adeft[test]

and download all pre-trained models as shown above. Then run nosetests in the top-level adeft folder.

Funding

Development of this software was supported by the Defense Advanced Research Projects Agency under award W911NF018-1-0124 and the National Cancer Institute under award U54-CA225088.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adeft-0.4.0.tar.gz (24.2 kB view details)

Uploaded Source

Built Distribution

adeft-0.4.0-py3-none-any.whl (29.5 kB view details)

Uploaded Python 3

File details

Details for the file adeft-0.4.0.tar.gz.

File metadata

  • Download URL: adeft-0.4.0.tar.gz
  • Upload date:
  • Size: 24.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for adeft-0.4.0.tar.gz
Algorithm Hash digest
SHA256 e1eebc1ff7fc06e40a9f1b22b8690ad7764ab5a86aca7f02b17e71f50f4f9c78
MD5 fa8fb57ce1b00410b99097bb47b63e44
BLAKE2b-256 62e090ff9f75744ea5b66b4263541b91a7f2813f172b7e56f34bf51ce7099887

See more details on using hashes here.

File details

Details for the file adeft-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: adeft-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 29.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.2

File hashes

Hashes for adeft-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ff97c95cfc6f01b74059e5c81f9b34f3a42b760196242c8c9daf93494bbc4a5d
MD5 e01b7b1ad259cf05c99517fbe5107f83
BLAKE2b-256 f4e677c797e55542100a2369ac549e4716418fbe375c61c926f6b5e43f060398

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page