Skip to main content

A Hybrid Data and Knowledge Driven Framework for Generating Patient Representations

Project description

CLEP: A Hybrid Data- and Knowledge- Driven Framework for Generating Patient Representations
Documentation Status DOI CLEP on PyPI CLEP Python versions CLEP Software License

Table of Contents

General Info

CLEP is a framework that contains novel methods for generating patient representations from any patient level data and its corresponding prior knowledge encoded in a knowledge graph. The framework is depicted in the graphic below

Installation

NOTE: The installation of CLEP requires R to be installed on your system along with limma package for R. R can be downloaded from CRAN. The limma package can be installed in R with the following command:

if (!require("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("limma")

The code can be installed from PyPI with:

$ pip install clep

The most recent code can be installed from the source on GitHub with:

$ pip install git+https://github.com/hybrid-kg/clep.git

For developers, the repository can be cloned from GitHub and installed in editable mode with:

$ git clone https://github.com/hybrid-kg/clep.git
$ cd clep
$ pip install -e .

Documentation

Read the official docs for more information.

Input Data Formats

Data

Symbol Sample_1 Sample_2 Sample_3
HGNC_ID_1 0.354 2.568 1.564
HGNC_ID_2 1.255 1.232 0.26452
HGNC_ID_3 3.256 1.5 1.5462

Note: The data must be in a tab separated file format.

Design

FileName Target
Sample_1 Abnormal
Sample_2 Abnormal
Sample_3 Control

Note: The data must be in a tab separated file format.

Knowledge Graph

The graph format CLEP can handle is a modified version of the Edge List Format. Which looks as follows:

Source Relation Target
HGNC_ID_1 association HGNC_ID_2
HGNC_ID_2 decreases HGNC_ID_3
HGNC_ID_3 increases HGNC_ID_1

Note: The data must be in a tab separated file format & if your knowledge graph does not have relations between the source and the target, just populate the relation column with "No Relation".

Usage

Note: These are very basic commands for clep, and the detailed options for each command can be found in the documentation

  1. Radical Searching The following command finds the extreme samples with extreme feature values based on the control population.
$ clep sample-scoring radical-search --data <DATA_FILE> --design <DESIGN_FILE> --control Control --threshold 2.5 --control_based --ret_summary --out <OUTPUT_DIR>
  1. Graph Generation The following command generates the patient-gene network based on the method chosen (Interaction_network).
$ clep embedding generate-network --data <SCORED_DATA_FILE> --method interaction_network --ret_summary --out <OUTPUT_DIR>
  1. Knowledge Graph Embedding

The following command generates the embedding of the network passed to it.

$ clep embedding kge --data <NETWORK_FILE> --design <DESIGN_FILE> --model_config <MODEL_CONFIG.json> --train_size 0.8 --validation_size 0.1 --out <OUTPUT_DIR>
  1. Classification

The following command carries out classification on the given data file for a chosen model (Elastic Net) with 100 hyper-parameter optimization trials.

$ clep classify --data <EMBEDDING_FILE> --model elastic_net --num-trials 100 --out <OUTPUT_DIR>

Issues

If you have difficulties using CLEP, please open an issue at our GitHub repository.

Acknowledgements

Citation

If you have found CLEP useful in your work, please consider citing:

CLEP: A Hybrid Data- and Knowledge- Driven Framework for Generating Patient Representations.
Bharadhwaj, V. S., Ali, M., Birkenbihl, C., Mubeen, S., Lehmann, J., Hofmann-Apitius, M., Hoyt, C. T., & Domingo-Fernandez, D. (2020).
Bioinformatics, btab340.

Graphics

The CLEP logo and framework graphic was designed by Carina Steinborn.

Disclaimer

CLEP is a scientific software that has been developed in an academic capacity, and thus comes with no warranty or guarantee of maintenance, support, or back-up of data.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

clep-0.0.4.tar.gz (371.9 kB view details)

Uploaded Source

Built Distribution

clep-0.0.4-py3-none-any.whl (38.0 kB view details)

Uploaded Python 3

File details

Details for the file clep-0.0.4.tar.gz.

File metadata

  • Download URL: clep-0.0.4.tar.gz
  • Upload date:
  • Size: 371.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for clep-0.0.4.tar.gz
Algorithm Hash digest
SHA256 04119bb46f54f411c35242e25cdb44e8dca04fc9bec50fc75e901bfea7cb7cf9
MD5 ad641b116a2d30c8229a5b55b0ba4c10
BLAKE2b-256 35207eea0883b2d15ae990c8dffe13b7b7897da7fa8e0ba6db0d22f7f1be4533

See more details on using hashes here.

File details

Details for the file clep-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: clep-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 38.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for clep-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 6f7742b5ea0669b279fc40e2f9634ce57eb395d631ee86352bb06421efe29ce2
MD5 be232912bb68633e0005ab9bb9369b65
BLAKE2b-256 989c2c0e41d9c26d52db4548c4768e99d3756bb1905cae93b080e037a4187c55

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page