Skip to main content

A Python package for comparing the effect of pathway database choice in functional enrichment and classification methods

Project description

A Python package for benchmarking pathway databases with functional enrichment and prediction methods tasks.

Installation Current version on PyPI Stable Supported Python Versions Apache-2.0

pathway_forte can be installed from PyPI with the following command in your terminal:

$ python3 -m pip install pathway_forte

The latest code can be installed from GitHub with:

$ python3 -m pip install git+https://github.com/pathwayforte/pathway-forte.git

For developers, the code can be installed with:

$ git clone https://github.com/pathwayforte/pathway-forte.git
$ cd pathway-forte
$ python3 -m pip install -e .

Main Commands

The table below lists the main commands of PathwayForte.

Command

Action

datasets

Lists of Cancer Datasets

export

Export Gene Sets using ComPath

ora

List of ORA Analyses

fcs

List of FCS Analyses

prediction

List of Prediction Methods

Functional Enrichment Methods

  • ora. Lists Over-Representation Analyses (e.g., one-tailed hyper-geometric test).

  • fcs. Lists Functional Class Score Analyses such as GSEA and ssGSEA using GSEAPy.

Prediction Methods

pathway_forte enables three classification methods (i.e., binary classification, training SVMs for multi-classification tasks, or survival analysis) using individualized pathway activity scores. The scores can be calculated from any pathway with a variety of tools (see [1]) using any pathway database that enables to export its gene sets.

  • binary. Trains an elastic net model for a binary classification task (e.g., tumor vs. normal patients). The training is conducted using a nested cross validation approach (the number of cross validation in both loops can be selected). The model used can be easily changed since most of the models in scikit-learn (the machine learning library used by this package) required the same input.

  • subtype. Trains a SVM model for a multi-class classification task (e.g., predict tumor subtypes). The training is conducted using a nested cross validation approach (the number of cross validation in both loops can be selected). Similarly as the previous classification task, other models can quickly be implemented.

  • survival. Trains a Cox’s proportional hazard’s model with elastic net penalty. The training is conducted using a nested cross validation approach with a grid search in the inner loop. This analysis requires pathway activity scores, patient classes and lifetime patient information.

Other

  • export. Export GMT files with current gene sets for the pathway databases included in ComPath [2].

  • datasets. Lists the TCGA data sets [3] that are ready to run in pathway_forte.

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pathway-forte-0.0.2.tar.gz (1.4 MB view details)

Uploaded Source

File details

Details for the file pathway-forte-0.0.2.tar.gz.

File metadata

  • Download URL: pathway-forte-0.0.2.tar.gz
  • Upload date:
  • Size: 1.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/36.5.0 requests-toolbelt/0.8.0 tqdm/4.31.1 CPython/3.6.2

File hashes

Hashes for pathway-forte-0.0.2.tar.gz
Algorithm Hash digest
SHA256 c37fcbee5cf7d6bd9388aba17b98861bb48a7295d7f7a2946b46371ac4e7cdc7
MD5 13fd515852f7dcafe112f75b835cf2b7
BLAKE2b-256 eb6cdb03257fa64c9cc127e93b2c13d9ba601f3070181e793a3509d4d18444bf

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page