A program to find key complex patterns in SAR data
Project description
Nonadditivity analysis
Synposis
A program to find key complex patterns in SAR data
Installation
The program has been tested on Python 3.6.
You will need a copy of the RDKit cheminformatics toolkit, available
from http://rdkit.org/. The easiest way is to install via PyPI with
pip install rdkit-pypi
.
Install directly from source with:
$ pip install git+https://github.com/KramerChristian/NonadditivityAnalysis.git
Install the code in development mode with:
$ git clone git+https://github.com/KramerChristian/NonadditivityAnalysis.git
$ cd NonadditivityAnalysis
$ pip install -e .
The path to mmpdb has to be set on line 44 of the Nonadditivity analysis code. If a special salt clean-up is required, the path to the salt definitions can be set on line 43.
How to run the program and get help
The code runs as a simple command-line tool. Command line options are printed via
$ python -m nonadditivity -h
Example usage
Using the test files supplied, an example run can be
$ python -m nonadditivity -in hERG_ChEMBL.txt -delimiter tab -series_column ASSAY_CHEMBLID -props PCHEMBL_VALUE -units nM
Input file format
IDENTIFIER [sep] SMILES [sep] DATA ...
where [sep] is the separator and can be chosen from tab, space, comma, and semicolon.
Publication
If you use this code for a publication, please cite Kramer, C. Nonadditivity Analysis. J. Chem. Inf. Model. 2019, 59, 9, 4034–4042.
https://pubs.acs.org/doi/10.1021/acs.jcim.9b00631
Background
The overall process is:
- Parse input:
- read structures
- clean and transform activity data
- remove Salts
2.) Compute MMPs
3.) Find double-transformation cycles
4.) Write to output & calculate statistics
1) Parse input
Ideally, the compounds are already standardized when input into nonadditivity analysis. The code will not correct tautomers and charge state, but it will attempt to desalt the input.
Since Nonadditivity analysis only makes sense on normally distributed data, the input activity data can be transformed depending on the input units. You can choose from "M", "mM", "uM", "nM", "pM", and "noconv". The 'xM' units will be transformed to pActivity wiht the corresponding factors. 'noconv' keeps the input as is and does not do any transformation.
For mulitplicate structures, only the first occurence will be kept.
2) Compute MMPs
Matched Pairs will be computed based on the cleaned structures. This is done by a subprocess call to the external mmpdb program. Per default, 20 parallel jobs are used for the fragmentation. This can be changed on line 681.
3) Find double-transformation cycles
This is the heart of the Nonadditivity algorithm. Here, sets of four compounds that are linked by two transformations are identified. For more details about the interpretation see publication above.
4) Write to output and calculate statistics
Information about the compounds making up the cycles and the distribution of nonadditivity is written to output files. [...] denotes the input file name. The file named
"Additivity_diffs"[...]".txt"
contains information about the cycles and the Probability distribution
The file named
"Additivity_diffs"[...]"_perCompound.txt"
contains information about the Nonadditivity aggregated per Compound across all cycles where a given compound occurs.
The file named
"Additivity_diffs"[...]_c2c.txt
links the two files above and can be used for examnple for visualizations in SpotFire.
Copyright
The NonadditivityAnalysis code is copyright 2015-2019 by F. Hoffmann-La Roche Ltd and distributed under the 3-clause BSD license (see LICENSE.txt).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nonadditivity-1.0.0.tar.gz
.
File metadata
- Download URL: nonadditivity-1.0.0.tar.gz
- Upload date:
- Size: 17.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 88ce82dc4b3b9d5ceeb00fcda8a62a2841cae29af4b001dd527f9c18f6d51c80 |
|
MD5 | 1dbb781621134c275fd42933a0cba57e |
|
BLAKE2b-256 | c205b128de7a9927ac283f8a209f80f0a8319926e30eba710486fa7edc54d6c9 |
Provenance
File details
Details for the file nonadditivity-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: nonadditivity-1.0.0-py3-none-any.whl
- Upload date:
- Size: 4.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 13811d88107a8d46a221795e17edc551226e58a1fd967f51a56f72e8eee22d8f |
|
MD5 | 0bff94e523d2309beed2377dbb4dbb78 |
|
BLAKE2b-256 | 7d1e91e25a37cbdff28a0dc2914cc6f4d775950fef6701204a35e752eb136d82 |