Skip to main content

Top level package

Project description

PumpkinPy - Semantic similarity implemented in python

About

PumpkinPy uses IC ordered bitmaps for fast ranking of genes and diseases. This is useful for larger ontologies such as Upheno and large datasets such as ranking all mouse genes given a set of input HPO terms. This approach was first used in OWLTools and OwlSim-v3.

The goal of this project was to build an implementation of the PhenoDigm algorithm in python. There are also implementations for common measures for distance and similarity (euclidean, cosine, Jin-Conrath, Resnik, jaccard)

Disclaimer: This is a side project and has little documetation and only a modest amount of testing

Getting Started

Requires python 3.8+ and python3-dev to install pyroaring

python3.8 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
export PYTHONPATH=.:$PYTHONPATH
Fetching annotations and closures

Uses robot and sparql to generate closures and class labels

Annotation data is fetched from the latest Monarch release

  • Requires >Java 8

cd data/monarch/ && make

PhenoDigm Reference: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3649640/
Exomiser: https://github.com/exomiser/Exomiser
OWLTools: https://github.com/owlcollab/owltools
OWLSim-v3: https://github.com/monarch-initiative/owlsim-v3

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pumpkin_py-0.0.1a3.tar.gz (18.7 MB view details)

Uploaded Source

Built Distribution

pumpkin_py-0.0.1a3-py3-none-any.whl (20.5 kB view details)

Uploaded Python 3

File details

Details for the file pumpkin_py-0.0.1a3.tar.gz.

File metadata

  • Download URL: pumpkin_py-0.0.1a3.tar.gz
  • Upload date:
  • Size: 18.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.26.0

File hashes

Hashes for pumpkin_py-0.0.1a3.tar.gz
Algorithm Hash digest
SHA256 052056d182af26b811c0a9592c7e0aa51ff30ad03f1299903e111a77df022a9b
MD5 15e2a8cc20773d30aaa0a331c284f1d7
BLAKE2b-256 7292da45dc8f18d5b58273fa137f284c8a03bc5dbcf7ecb8b6ecb96f520c1074

See more details on using hashes here.

File details

Details for the file pumpkin_py-0.0.1a3-py3-none-any.whl.

File metadata

File hashes

Hashes for pumpkin_py-0.0.1a3-py3-none-any.whl
Algorithm Hash digest
SHA256 8ba0c5a1a6e721ff114479b0d75302f62b031bff138898757eca0eafd06a8042
MD5 ea79a3ad71474a37f85d81b2033a652e
BLAKE2b-256 5076d63df6976171fffc4091ca38c017be86d96082ea71675e1aaa56ad96b93f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page