Skip to main content

VICC normalization routine for genes

Project description

Gene Normalizer

PyPI - Python Version tests DOI

Overview

The Gene Normalizer provides tools for resolving ambiguous human gene references to consistently-structured, normalized terms. For gene concepts extracted from NCBI Gene, Ensembl, and HGNC, it designates a CURIE, and provides additional metadata like current and previously-used symbols, aliases, database cross-references and associations, and coordinates.


Live service

Documentation · Installation · Usage · API reference


Install

The Gene Normalizer is available on PyPI:

python3 -m pip install gene-normalizer

See installation instruction in the documentation for a description of installation options and data setup requirements.

Examples

Use the live service to programmatically normalize gene terms, as in the following truncated example:

$ curl 'https://normalize.cancervariants.org/gene/normalize?q=BRAF' | python -m json.tool
{
    "query": "BRAF",
    "match_type": 100,
    "gene_descriptor": {
        "label": "BRAF",
        "gene_id": "hgnc:1097",
        "xrefs": [
            "ensembl:ENSG00000157764",
            "ncbigene:673"
        ],
        "alternate_labels": [
            "BRAF1",
            "B-RAF1",
            "NS7",
            "RAFB1",
            "B-raf",
            "BRAF-1"
        ]
    }
    # ...
}

Or utilize the Python API for fast access:

>>> from gene.database import create_db
>>> from gene.query import QueryHandler
>>> q = QueryHandler(create_db())
>>> result = q.normalize("KRAS")
>>> print(result.gene_descriptor.gene_id)
"hgnc:6407"

See the usage and normalization entries in the documentation for more.


Feedback and contributing

We welcome bug reports, feature requests, and code contributions from users and interested collaborators. The documentation contains guidance for submitting feedback and contributing new code.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gene-normalizer-0.1.40.dev0.tar.gz (53.7 kB view details)

Uploaded Source

Built Distribution

gene_normalizer-0.1.40.dev0-py3-none-any.whl (64.9 kB view details)

Uploaded Python 3

File details

Details for the file gene-normalizer-0.1.40.dev0.tar.gz.

File metadata

  • Download URL: gene-normalizer-0.1.40.dev0.tar.gz
  • Upload date:
  • Size: 53.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for gene-normalizer-0.1.40.dev0.tar.gz
Algorithm Hash digest
SHA256 077ce908a51268b9214975161426e70bffd16a913eb62019b83aedd91a2d084d
MD5 ece7a89536331b5ef8cc9fce1eedda1f
BLAKE2b-256 6d2154764958ae290387138839808b9fd1c7022f15f7725f3e9239ec13d38f96

See more details on using hashes here.

File details

Details for the file gene_normalizer-0.1.40.dev0-py3-none-any.whl.

File metadata

File hashes

Hashes for gene_normalizer-0.1.40.dev0-py3-none-any.whl
Algorithm Hash digest
SHA256 d604a14af74df8d11adf7569fbc9434191392c83d83e8106214a6d2db9d4ff4b
MD5 9d89b2d093f0ed4f4ef493c2da083055
BLAKE2b-256 93d099ed626e2ecfdc5885f8f19eb875626ba4eb38190d5932228f72fe647f7a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page