Skip to main content

Top level package

Project description

PumpkinPy - Semantic similarity implemented in python

About

PumpkinPy uses IC ordered bitmaps for fast ranking of genes and diseases. This is useful for larger ontologies such as Upheno and large datasets such as ranking all mouse genes given a set of input HPO terms. This approach was first used in OWLTools and OwlSim-v3.

The goal of this project was to build an implementation of the PhenoDigm algorithm in python. There are also implementations for common measures for distance and similarity (euclidean, cosine, Jin-Conrath, Resnik, jaccard)

Disclaimer: This is a side project and has little documetation and only a modest amount of testing

Getting Started

Requires python 3.8+ and python3-dev to install pyroaring

python3.8 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
export PYTHONPATH=.:$PYTHONPATH
Fetching annotations and closures

Uses robot and sparql to generate closures and class labels

Annotation data is fetched from the latest Monarch release

  • Requires >Java 8

cd data/monarch/ && make

PhenoDigm Reference: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3649640/
Exomiser: https://github.com/exomiser/Exomiser
OWLTools: https://github.com/owlcollab/owltools
OWLSim-v3: https://github.com/monarch-initiative/owlsim-v3

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pumpkin_py-0.0.1a1.tar.gz (18.7 MB view details)

Uploaded Source

Built Distribution

pumpkin_py-0.0.1a1-py3-none-any.whl (20.3 kB view details)

Uploaded Python 3

File details

Details for the file pumpkin_py-0.0.1a1.tar.gz.

File metadata

  • Download URL: pumpkin_py-0.0.1a1.tar.gz
  • Upload date:
  • Size: 18.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.26.0

File hashes

Hashes for pumpkin_py-0.0.1a1.tar.gz
Algorithm Hash digest
SHA256 e54cd23e295b102b4050a2ac2ae3eadc793b92c214b01002d547eae25906e342
MD5 fef8f4d48ece9f84866f94df360943ab
BLAKE2b-256 ebb095c25d35ed826c28c77bddfcfcee19b6aa61b0e9c2d3c7875c17d05b1706

See more details on using hashes here.

File details

Details for the file pumpkin_py-0.0.1a1-py3-none-any.whl.

File metadata

File hashes

Hashes for pumpkin_py-0.0.1a1-py3-none-any.whl
Algorithm Hash digest
SHA256 e4d7f9212bac679932b7cd67c09d34b39f5a44a6553c2013d475c087089bdb9e
MD5 ef6248eaf5a98a848da055ce01ecc87b
BLAKE2b-256 2bf53cf1c611446b7e8c549e406508201a652f2986fe33e699f90f89736dcfbb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page