Skip to main content

Common RNAseq normalization methods

Project description

Build Status Code Style Black Documentation Status Version on PyPI Supported Python versions Number of downloads from PyPI

Python implementation of common RNAseq normalization methods:

  • CPM (Counts per million)

  • FPKM (Fragments per kilobase million)

  • TPM (Transcripts per million)

  • UQ (Upper quartile)

  • CUF (Counts adjusted with UQ factors)

  • TMM (Trimmed mean of M-values)

  • CTF (Counts adjusted with TMM factors)

Features

  • Pure Python implementation (no need for R, etc.)

  • Scikit-learn compatible

  • Command line interface

  • Verbose documentation (at least we hope so…)

  • Tested methods

Install

We recommend installing RNAnorm with pip:

pip install rnanorm

Quick start

Implemented methods can be used from Python or from the command line.

Normalize from Python

Most commonly normalization methods are run from Python. E.g.:

>>> from rnanorm.datasets import load_rnaseq_toy
>>> from rnanorm import FPKM
>>> dataset = load_rnaseq_toy()
>>> dataset.exp
       G1     G2      G3      G4       G5
S1  200.0  300.0   500.0  2000.0   7000.0
S2  400.0  600.0  1000.0  4000.0  14000.0
S3  200.0  300.0   500.0  2000.0  17000.0
S4  200.0  300.0   500.0  2000.0   2000.0
>>> fpkm = FPKM(dataset.gtf_path).set_output(transform="pandas")
>>> fpkm.fit_transform(dataset.exp)
          G1        G2        G3        G4        G5
S1  100000.0  100000.0  100000.0  200000.0  700000.0
S2  100000.0  100000.0  100000.0  200000.0  700000.0
S3   50000.0   50000.0   50000.0  100000.0  850000.0
S4  200000.0  200000.0  200000.0  400000.0  400000.0

Normalize from command line

Often it is handy to do normalization from the command line:

rnanorm fpkm exp.csv --gtf annotation.gtf --out exp_fpkm.csv

Contribute

To learn about contributing to the code base, read the Contributing section.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rnanorm-2.0.0rc2.tar.gz (129.6 kB view details)

Uploaded Source

Built Distribution

rnanorm-2.0.0rc2-py3-none-any.whl (119.2 kB view details)

Uploaded Python 3

File details

Details for the file rnanorm-2.0.0rc2.tar.gz.

File metadata

  • Download URL: rnanorm-2.0.0rc2.tar.gz
  • Upload date:
  • Size: 129.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for rnanorm-2.0.0rc2.tar.gz
Algorithm Hash digest
SHA256 af39805c097d4c9142612137668654fe9d94c2a68c6fd7e2e13daed90b40f768
MD5 0f819dc35f13afbee9afc17d0dfa7556
BLAKE2b-256 199ab55e531632a6057107901181c24272b7f4a208ab2a80b8f35c0ce9f09a2b

See more details on using hashes here.

File details

Details for the file rnanorm-2.0.0rc2-py3-none-any.whl.

File metadata

  • Download URL: rnanorm-2.0.0rc2-py3-none-any.whl
  • Upload date:
  • Size: 119.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for rnanorm-2.0.0rc2-py3-none-any.whl
Algorithm Hash digest
SHA256 4e1091a590a4c9de687e7504fd087299beeba75132cf4bdf5e90298bf4b6b52d
MD5 478cf65b6bf7b266982245dda7e9db06
BLAKE2b-256 57053f1a7bd3ca712a102df7c9799bb03eafd3efc6e9b88dc306e128b8877a04

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page