DeepLC: Retention time prediction for (modified) peptides using Deep Learning.
Project description
DeepLC: Retention time prediction for (modified) peptides using Deep Learning.
Introduction
DeepLC is a retention time predictor for (modified) peptides that employs Deep Learning. It's strength lies in the fact that it can accurately predict retention times for modified peptides, even if hasn't seen said modification during training.
DeepLC can be run with a graphical user interface (GUI) or as a Python package. In the latter case, DeepLC can be used from the command line, or as a python module.
Graphical user interface
Installation
- Download
deeplc_gui.zip
from the latest release and unzip. - Install DeepLC GUI with
install_gui_windows.bat
orinstall_gui_linux.sh
, depending on your operating system. - Run DeepLC GUI with
run_gui_windows.bat
orrun_gui_linux.sh
, depending on your operating system.
Python package
Installation
Install with pip:
pip install deeplc
Command line interface
To use the DeepLC CLI, run:
deeplc --file_pred <path/to/peptide_file.csv>
We highly recommend to add a peptide file with known retention times for calibration:
deeplc --file_pred <path/to/peptide_file.csv> --file_cal <path/to/peptide_file_with_tr.csv>
For an overview of all CLI arguments, run deeplc --help
.
Python module
Minimal example:
import pandas as pd
from deeplc import DeepLC
peptide_file = "datasets/test_pred.csv"
calibration_file = "datasets/test_train.csv"
pep_df = pd.read_csv(peptide_file, sep=",")
pep_df['modifications'] = pep_df['modifications'].fillna("")
cal_df = pd.read_csv(calibration_file, sep=",")
cal_df['modifications'] = cal_df['modifications'].fillna("")
dlc = DeepLC()
dlc.calibrate_preds(seq_df=cal_df)
preds = dlc.make_preds(seq_df=pep_df)
For a more elaborate example, see examples/deeplc_example.py .
Input files
DeepLC expects comma-separated values (CSV) with the following columns:
seq
: unmodified peptide sequencesmodifications
: MS2PIP-style formatted modifications: Every modification is listed aslocation|name
, separated by a pipe (|
) between the location, the name, and other modifications.location
is an integer counted starting at 1 for the first AA. 0 is reserved for N-terminal modifications, -1 for C-terminal modifications.name
has to correspond to a Unimod (PSI-MS) name.tr
: retention time (only required for calibration)
For example:
seq,modifications,tr
AAGPSLSHTSGGTQSK,,12.1645
AAINQKLIETGER,6|Acetyl,34.095
AANDAGYFNDEMAPIEVKTK,12|Oxidation|18|Acetyl,37.3765
See examples/datasets for more examples.
Prediction models
DeepLC comes with multiple CNN models trained on data from various experimental settings:
Model filename | Experimental settings | Publication | PXD identifier |
---|---|---|---|
full_hc_dia_fixed_mods.hdf5 | |||
full_hc_LUNA_HILIC_fixed_mods.hdf5 | |||
full_hc_LUNA_SILICA_fixed_mods.hdf5 | |||
full_hc_PXD000954_fixed_mods.hdf5 |
By default, DeepLC selects the best model based on the calibration dataset. If no calibration is performed, the first default model is selected. Always keep note of the used models and the DeepLC version.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file deeplc-0.1.1.tar.gz
.
File metadata
- Download URL: deeplc-0.1.1.tar.gz
- Upload date:
- Size: 29.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b1a3b3220499b90a690e4cab382d575732d8c4a7f763d84fc1b442c37ade1de4 |
|
MD5 | 0d4c65cd19f62f546f8d153f1706a1cc |
|
BLAKE2b-256 | e631ddb5e655770f881e37f1717365de003576ad15d43f181d5389623cda29f3 |
Provenance
File details
Details for the file deeplc-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: deeplc-0.1.1-py3-none-any.whl
- Upload date:
- Size: 29.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | acaf4900ea53f74c2b57568ea8756ad1ee08921590c6a9df728db60d6834f72c |
|
MD5 | 88ac31cc233eebba352878cfd040fc7e |
|
BLAKE2b-256 | c2225530bffd8e368685da74948d0011be79256693f31108d4759bc46669bfac |