Skip to main content

DeepLC: Retention time prediction for (modified) peptides using Deep Learning.

Project description



PyPI - Python Version PyPI Conda GitHub release Build Status GitHub issues GitHub

DeepLC: Retention time prediction for (modified) peptides using Deep Learning.



Introduction

DeepLC is a retention time predictor for (modified) peptides that employs Deep Learning. It's strength lies in the fact that it can accurately predict retention times for modified peptides, even if hasn't seen said modification during training.

DeepLC can be run with a graphical user interface (GUI) or as a Python package. In the latter case, DeepLC can be used from the command line, or as a python module.

Graphical user interface

Installation

  • Download deeplc_gui.zip from the latest release and unzip.
  • Install DeepLC GUI with install_gui_windows.bat or install_gui_linux.sh, depending on your operating system.
  • Run DeepLC GUI with run_gui_windows.bat or run_gui_linux.sh, depending on your operating system.

Python package

Installation

Install with pip:

pip install deeplc

Command line interface

To use the DeepLC CLI, run:

deeplc --file_pred <path/to/peptide_file.csv>

We highly recommend to add a peptide file with known retention times for calibration:

deeplc --file_pred  <path/to/peptide_file.csv> --file_cal <path/to/peptide_file_with_tr.csv>

For an overview of all CLI arguments, run deeplc --help.

Python module

Minimal example:

import pandas as pd
from deeplc import DeepLC

peptide_file = "datasets/test_pred.csv"
calibration_file = "datasets/test_train.csv"

pep_df = pd.read_csv(peptide_file, sep=",")
pep_df['modifications'] = pep_df['modifications'].fillna("")

cal_df = pd.read_csv(calibration_file, sep=",")
cal_df['modifications'] = cal_df['modifications'].fillna("")

dlc = DeepLC()
dlc.calibrate_preds(seq_df=cal_df)
preds = dlc.make_preds(seq_df=pep_df)

For a more elaborate example, see examples/deeplc_example.py .

Input files

DeepLC expects comma-separated values (CSV) with the following columns:

  • seq: unmodified peptide sequences
  • modifications: MS2PIP-style formatted modifications: Every modification is listed as location|name, separated by a pipe (|) between the location, the name, and other modifications. location is an integer counted starting at 1 for the first AA. 0 is reserved for N-terminal modifications, -1 for C-terminal modifications. name has to correspond to a Unimod (PSI-MS) name.
  • tr: retention time (only required for calibration)

For example:

seq,modifications,tr
AAGPSLSHTSGGTQSK,,12.1645
AAINQKLIETGER,6|Acetyl,34.095
AANDAGYFNDEMAPIEVKTK,12|Oxidation|18|Acetyl,37.3765

See examples/datasets for more examples.

Prediction models

DeepLC comes with multiple CNN models trained on data from various experimental settings:

Model filename Experimental settings Publication PXD identifier
full_hc_dia_fixed_mods.hdf5
full_hc_LUNA_HILIC_fixed_mods.hdf5
full_hc_LUNA_SILICA_fixed_mods.hdf5
full_hc_PXD000954_fixed_mods.hdf5

By default, DeepLC selects the best model based on the calibration dataset. If no calibration is performed, the first default model is selected. Always keep note of the used models and the DeepLC version.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeplc-0.1.1.tar.gz (29.3 MB view details)

Uploaded Source

Built Distribution

deeplc-0.1.1-py3-none-any.whl (29.3 MB view details)

Uploaded Python 3

File details

Details for the file deeplc-0.1.1.tar.gz.

File metadata

  • Download URL: deeplc-0.1.1.tar.gz
  • Upload date:
  • Size: 29.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for deeplc-0.1.1.tar.gz
Algorithm Hash digest
SHA256 b1a3b3220499b90a690e4cab382d575732d8c4a7f763d84fc1b442c37ade1de4
MD5 0d4c65cd19f62f546f8d153f1706a1cc
BLAKE2b-256 e631ddb5e655770f881e37f1717365de003576ad15d43f181d5389623cda29f3

See more details on using hashes here.

Provenance

File details

Details for the file deeplc-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: deeplc-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 29.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for deeplc-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 acaf4900ea53f74c2b57568ea8756ad1ee08921590c6a9df728db60d6834f72c
MD5 88ac31cc233eebba352878cfd040fc7e
BLAKE2b-256 c2225530bffd8e368685da74948d0011be79256693f31108d4759bc46669bfac

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page