biokeen

A package for training and evaluating biological knowledge graph embeddings

These details have not been verified by PyPI

Project links

Development Status
- 1 - Planning
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Project description

BioKEEN (Biological KnowlEdge EmbeddiNgs) is a package for training and evaluating biological knowledge graph embeddings built on PyKEEN.

Because we use PyKEEN as the underlying software package, implementations of 10 knowledge graph embedding models are currently available for BioKEEN. Furthermore, BioKEEN can be run in training mode in which users provide their own set of hyper-parameter values, or in hyper-parameter optimization mode to find suitable hyper-parameter values from set of user defined values.

Through the integration of the Bio2BEL [2] software numerous biomedical databases are directly accessible within BioKEEN.

BioKEEN can also be run without having experience in programing by using its interactive command line interface that can be started with the command “biokeen” from a terminal.

Tutorials

A brief tutorial on how to get started with BioKEEN is available here.

https://i.vimeocdn.com/video/755767182.jpg?mw=1100&mh=619&q=70

Further tutorials are can be found in the notebooks directory and in our documentation.

Citation

If you find BioKEEN useful in your work, please consider citing:

Note: ComPath has been updated, for this reason we have uploaded the dataset version that we have used for our experiments: dataset

Installation

To install biokeen, Python 3.6+ is required, and we recommend to install it on Linux or Mac OS systems. Please run following command:

$ pip install git+https://github.com/SmartDataAnalytics/BioKEEN.git

Alternatively, it can be installed from the source for development with:

$ git clone https://github.com/SmartDataAnalytics/BioKEEN.git biokeen
$ cd biokeen
$ pip install -e .

Contributing

Contributions, whether filing an issue, making a pull request, or forking, are appreciated. See CONTRIBUTING.rst for more information on getting involved.

CLI Usage

To show BioKEEN’s available commands, please run following command:

biokeen

Starting the Training/HPO Pipeline - Set Up Your Experiment within 60 seconds

To configure an experiment via the CLI, please run following command:

biokeen start

To start BioKEEN with an existing configuration file, please run the following command:

biokeen start -f /path/to/config.json

Starting the Prediction Pipeline

To make prediction based on a trained model, please run following command:

biokeen predict -m /path/to/model/directory -d /path/to/data/directory

where the value for the argument -m is the directory containing the model, in more detail following files must be contained in the directory:

configuration.json
entities_to_embeddings.json
relations_to_embeddings.json
trained_model.pkl

These files are created automatically created after model is trained (and evaluated) and exported in your specified output directory.

The value for the argument -d is the directory containing the data for which inference should be applied, and it needs to contain following files:

entities.tsv
relations.tsv

where entities.tsv contains all entities of interest, and relations.tsv all relations. Both files should contain should contain a single column containing all the entities/relations. Based on these files, PyKEEN will create all triple permutations, and computes the predictions for them, and saves them in data directory in predictions.tsv.

Summarize the Results of All Experiments

To summarize the results of all experiments, please run following command:

biokeen summarize -d /path/to/experiments/directory -o /path/to/output/file.csv

Getting Bio2BEL Data

To download and structure the data from a Bio2BEL repository, run:

biokeen data get <name>

Where <name> can be any repository name in Bio2BEL such as hippie, mirtarbase.

References

Project details

These details have not been verified by PyPI

Project links

Development Status
- 1 - Planning
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

This version

0.0.14

Dec 11, 2019

0.0.13

Apr 18, 2019

0.0.12

Feb 4, 2019

0.0.11

Jan 18, 2019

0.0.10

Dec 30, 2018

0.0.9

Dec 30, 2018

0.0.8

Dec 12, 2018

0.0.7

Nov 26, 2018

0.0.6

Nov 21, 2018

0.0.5

Nov 21, 2018

0.0.4

Nov 21, 2018

0.0.3

Nov 8, 2018

0.0.2

Nov 7, 2018

0.0.1

Oct 9, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biokeen-0.0.14.tar.gz (271.7 kB view details)

Uploaded Dec 11, 2019 Source

Built Distribution

biokeen-0.0.14-py3-none-any.whl (17.2 kB view details)

Uploaded Dec 11, 2019 Python 3

File details

Details for the file biokeen-0.0.14.tar.gz.

File metadata

Download URL: biokeen-0.0.14.tar.gz
Upload date: Dec 11, 2019
Size: 271.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for biokeen-0.0.14.tar.gz
Algorithm	Hash digest
SHA256	`63a13217bd33bb938ad085bc7cfbdbf0b04c2ba6679bf3d245991a56f56eb50a`
MD5	`60edd7ec57bfa793b4329a9c0e1e74e2`
BLAKE2b-256	`dce696b1415378bb1c6579900dd7265a72ff8eb0842996a6e7a8184faaa48eec`

See more details on using hashes here.

Provenance

File details

Details for the file biokeen-0.0.14-py3-none-any.whl.

File metadata

Download URL: biokeen-0.0.14-py3-none-any.whl
Upload date: Dec 11, 2019
Size: 17.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for biokeen-0.0.14-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7b965e246a1709fdd7859306176f251bb82c4ed60d4f24a65ef53847ae998382`
MD5	`00acf7a897e1b43b9d11ccd6aa594893`
BLAKE2b-256	`b0d59ad943e7b929859d6c4d2ac834482376048f3d93766acf303c45323d3425`

See more details on using hashes here.

biokeen 0.0.14

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Tutorials

Citation

Installation

Contributing

CLI Usage

Starting the Training/HPO Pipeline - Set Up Your Experiment within 60 seconds

Starting the Prediction Pipeline

Summarize the Results of All Experiments

Getting Bio2BEL Data

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

biokeen 0.0.14

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Share Your Experimental Artifacts

Tutorials

Citation

Installation

Contributing

CLI Usage

Starting the Training/HPO Pipeline - Set Up Your Experiment within 60 seconds

Starting the Prediction Pipeline

Summarize the Results of All Experiments

Getting Bio2BEL Data

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance