Skip to main content

DeepLC: Retention time prediction for (modified) peptides using Deep Learning.

Project description



PyPI - Python Version PyPI Conda GitHub release Build Status GitHub issues GitHub

DeepLC: Retention time prediction for (modified) peptides using Deep Learning.



Introduction

DeepLC is a retention time predictor for (modified) peptides that employs Deep Learning. It's strength lies in the fact that it can accurately predict retention times for modified peptides, even if hasn't seen said modification during training.

DeepLC can be run with a graphical user interface (GUI) or as a Python package. In the latter case, DeepLC can be used from the command line, or as a python module.

Graphical user interface

Installation

Download GUI

  1. Download deeplc_gui.zip from the latest release and unzip.
  2. Install DeepLC GUI with install_gui_windows.bat or install_gui_linux.sh, depending on your operating system.
  3. Run DeepLC GUI by running the deeplc_gui.jar.

Python package

Installation

install with bioconda install with pip container

Install with conda, using the bioconda and conda-forge channels:
conda install -c bioconda -c conda-forge deeplc

Or install with pip:
pip install deeplc

Command line interface

To use the DeepLC CLI, run:

deeplc --file_pred <path/to/peptide_file.csv>

We highly recommend to add a peptide file with known retention times for calibration:

deeplc --file_pred  <path/to/peptide_file.csv> --file_cal <path/to/peptide_file_with_tr.csv>

For an overview of all CLI arguments, run deeplc --help.

Python module

Minimal example:

import pandas as pd
from deeplc import DeepLC

peptide_file = "datasets/test_pred.csv"
calibration_file = "datasets/test_train.csv"

pep_df = pd.read_csv(peptide_file, sep=",")
pep_df['modifications'] = pep_df['modifications'].fillna("")

cal_df = pd.read_csv(calibration_file, sep=",")
cal_df['modifications'] = cal_df['modifications'].fillna("")

dlc = DeepLC()
dlc.calibrate_preds(seq_df=cal_df)
preds = dlc.make_preds(seq_df=pep_df)

For a more elaborate example, see examples/deeplc_example.py .

Input files

DeepLC expects comma-separated values (CSV) with the following columns:

  • seq: unmodified peptide sequences
  • modifications: MS2PIP-style formatted modifications: Every modification is listed as location|name, separated by a pipe (|) between the location, the name, and other modifications. location is an integer counted starting at 1 for the first AA. 0 is reserved for N-terminal modifications, -1 for C-terminal modifications. name has to correspond to a Unimod (PSI-MS) name.
  • tr: retention time (only required for calibration)

For example:

seq,modifications,tr
AAGPSLSHTSGGTQSK,,12.1645
AAINQKLIETGER,6|Acetyl,34.095
AANDAGYFNDEMAPIEVKTK,12|Oxidation|18|Acetyl,37.3765

See examples/datasets for more examples.

Prediction models

DeepLC comes with multiple CNN models trained on data from various experimental settings:

Model filename Experimental settings Publication
full_hc_dia_fixed_mods.hdf5 Reverse phase Rosenberger et al. 2014
full_hc_LUNA_HILIC_fixed_mods.hdf5 HILIC Spicer et al. 2018
full_hc_LUNA_SILICA_fixed_mods.hdf5 HILIC Spicer et al. 2018
full_hc_PXD000954_fixed_mods.hdf5 Reverse phase Rosenberger et al. 2014

By default, DeepLC selects the best model based on the calibration dataset. If no calibration is performed, the first default model is selected. Always keep note of the used models and the DeepLC version.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeplc-0.1.4.tar.gz (29.3 MB view details)

Uploaded Source

Built Distribution

deeplc-0.1.4-py3-none-any.whl (29.3 MB view details)

Uploaded Python 3

File details

Details for the file deeplc-0.1.4.tar.gz.

File metadata

  • Download URL: deeplc-0.1.4.tar.gz
  • Upload date:
  • Size: 29.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for deeplc-0.1.4.tar.gz
Algorithm Hash digest
SHA256 cc9a506ef6bda65977e8140b8cdd8d762a673f11a369723e7bd108374e98dba9
MD5 122c85165d2b7de1e417d4e454d92906
BLAKE2b-256 ec0ab69c8ef803080043049857a1c1d3516418197bd05f0b0f7a8f04d33931ca

See more details on using hashes here.

Provenance

File details

Details for the file deeplc-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: deeplc-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 29.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.6

File hashes

Hashes for deeplc-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 3bb98f03dae617a4804dc6a9b47fe7351681be44959b06078cf00ec0cab0075e
MD5 df7778809ea1377b8aff6eded7308d47
BLAKE2b-256 362fc8935fb62beb6479adf817c0421ebbba75d6e83632d72e388582272e7871

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page