VICC normalization routine for variants
Project description
Variant Normalization
Services and guidelines for normalizing variant terms
Backend Services
Variant Normalization relies on some local data caches which you will need to set up. It uses pipenv to manage its environment, which you will also need to install.
Installation
Variant Normalization relies on seqrepo, which you must download yourself.
From the variant directory of the repository:
pipenv sync
pip install seqrepo
mkdir -p data/seqrepo
seqrepo -r data/seqrepo pull -i 2021-01-29
sudo chmod -R u+w data/seqrepo
cd data/seqrepo
seqrepo_date_dir=$(ls -d */)
sudo mv $seqrepo_date_dir latest
Variant Normalizer also uses uta.
To install:
uta_v=uta_20180821
docker pull biocommons/uta:$uta_v
export UTA_DB_URL=postgresql://anonymous@localhost:5432/uta/uta_20180821
docker-compose -f docker-compose.yml up
Data
Variant Normalization uses Ensembl BioMart to retrieve variant/data/transcript_mappings.tsv
. We currently use Human Genes (GRCh38.p13)
for the dataset and the following attributes we use are: Gene stable ID, Gene stable ID version, Transcript stable ID, Transcript stable ID version, Protein stable ID, Protein stable ID version, RefSeq match transcript (MANE Select), Gene name.
Setting up Gene Normalizer
Variant Normalization normalize
endpoint relies on data from Gene Normalization. To install:
pip install gene-normalizer
To setup, follow the instructions from the Gene Normalization README.
You must have the Gene Normalizer DynamoDB running for the variant normalize
endpoint to work.
Init coding style tests
Code style is managed by flake8 and checked prior to commit.
We use pre-commit to run conformance tests.
This ensures:
- Check code style
- Check for added large files
- Detect AWS Credentials
- Detect Private Key
Before first commit run:
pre-commit install
Testing
From the root directory of the repository:
pytest tests/
Starting the Variant Normalization Service
gene-normalizer
s dynamodb must be running and run the following:
docker-compose -f docker-compose.yml up
From the root directory of the repository:
uvicorn variant.main:app --reload
Next, view the OpenAPI docs on your local machine: http://127.0.0.1:8000/variant
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file variant-normalizer-0.2.4.tar.gz
.
File metadata
- Download URL: variant-normalizer-0.2.4.tar.gz
- Upload date:
- Size: 69.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.9.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 163d5ba824d8693f7d766aaa08010e01659c72192a95f62865662a40d098eb68 |
|
MD5 | c53ba922d78671a264216c6355dd51bb |
|
BLAKE2b-256 | 81de55b261f26800649a2273597e4d6abfcf353666a1bc65c930430dce7ec2a8 |
File details
Details for the file variant_normalizer-0.2.4-py3-none-any.whl
.
File metadata
- Download URL: variant_normalizer-0.2.4-py3-none-any.whl
- Upload date:
- Size: 7.8 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.0 CPython/3.9.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9804ca4585aa64a48724d52f38473c0af9fcc3c1968cde72a1c7bf434d0e93d8 |
|
MD5 | 1beaa2c871359246583c94124af94662 |
|
BLAKE2b-256 | 264c8f2d524a0d7eae9639cbabe120272b720b6075ce3d8f607c262669e4bef7 |