Skip to main content

VICC normalization routines for genes

Project description

Gene Normalizer

image image image Actions status DOI

Overview

The Gene Normalizer provides tools for resolving ambiguous human gene references to consistently-structured, normalized terms. For gene concepts extracted from NCBI Gene, Ensembl, and HGNC, it designates a CURIE, and provides additional metadata like current and previously-used symbols, aliases, database cross-references and associations, and coordinates.


Live service

Documentation · Installation · Usage · API reference


Install

The Gene Normalizer is available on PyPI:

python3 -m pip install gene-normalizer

See installation instruction in the documentation for a description of installation options and data setup requirements.

Examples

Use the live service to programmatically normalize gene terms, as in the following truncated example:

$ curl 'https://normalize.cancervariants.org/gene/normalize?q=BRAF' | python -m json.tool
{
    "query": "BRAF",
    "match_type": 100,
    "normalized_id": "hgnc:1097",
    "gene": {
        "type": "Gene",
        "id": "normalize.gene.hgnc:1097"
        "label": "BRAF",
        "gene_id": "hgnc:1097",
        "aliases": [
            "BRAF1",
            "B-RAF1",
            "NS7",
            "RAFB1",
            "B-raf",
            "BRAF-1"
        ]
    }
    # ...
}

Or utilize the Python API for fast access:

>>> from gene.database import create_db
>>> from gene.query import QueryHandler
>>> q = QueryHandler(create_db())
>>> result = q.normalize("KRAS")
>>> result.normalized_id
'hgnc:6407'

See the usage and normalization entries in the documentation for more.

Feedback and contributing

We welcome bug reports, feature requests, and code contributions from users and interested collaborators. The documentation contains guidance for submitting feedback and contributing new code.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gene_normalizer-0.4.0.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

gene_normalizer-0.4.0-py3-none-any.whl (58.2 kB view details)

Uploaded Python 3

File details

Details for the file gene_normalizer-0.4.0.tar.gz.

File metadata

  • Download URL: gene_normalizer-0.4.0.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for gene_normalizer-0.4.0.tar.gz
Algorithm Hash digest
SHA256 2bfc7b997d9cb98bcb0626d57db75f6c70270a89a2d401cf2beb2824d000950e
MD5 63b94097ee6b7aa9f0997e5184f3cc21
BLAKE2b-256 3367b219c880fe448dc3fe91a210faaf3695171168af98378173a891f07777d8

See more details on using hashes here.

File details

Details for the file gene_normalizer-0.4.0-py3-none-any.whl.

File metadata

File hashes

Hashes for gene_normalizer-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 028e8dda3553e54109c680e08dc0a232b3d2e3aece875656a0e1496414bf52d6
MD5 5aaa5509cd92bd12e59d4152eacb59c1
BLAKE2b-256 7b06badd5f897f3be1b0394dd0230910af546cfd91a6a377a5040533b8059600

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page