Skip to main content

Transfer learning with Architecture Surgery on Single-cell data

Project description

.. raw:: html

|PyPI| |PyPIDownloads| |Docs| |travis|

Single-cell architecture surgery (scArches) is a package for reference-based analysis of single-cell data.

Updates

  • (29.11.2022) We have added scPoli <https://www.biorxiv.org/content/10.1101/2022.11.28.517803v1>_ enabling population level integration of single-cell datasets and multi-scale analysis.

  • (22.11.2022) We have added Multigrate <https://www.biorxiv.org/content/10.1101/2022.03.16.484643v1=>_ enabling multimodal reference building and mapping.

  • (22.10.2022) We have added mvTCR <https://www.biorxiv.org/content/10.1101/2021.06.24.449733v2.abstract?%3Fcollection=>_ and SageNet <https://www.biorxiv.org/content/10.1101/2022.04.14.488419v1>_ enabling mapping multimodal immune profiling (TCR+scRNAreq) and scRNA-seq to spatial atlases, respectively.

  • (7.07.2022) We have added treeArches <https://www.biorxiv.org/content/10.1101/2022.07.07.499109v1>_ to scArches code base. treeArches enables building cell-type hierarchies to identify novel states (e.g., disease, subpopulations) in the query data when mapped to the reference. See tutorials here <https://scarches.readthedocs.io/>_ .

  • (6.02.2022) We have added expiMap <https://www.biorxiv.org/content/10.1101/2022.02.05.479217v1>_ to scArches code base. expiMap allows interpretable reference mapping. See tutorials here <https://scarches.readthedocs.io/>_ .

What is scArches?

scArches allows analysis of your single-cell query data by integrating it into a reference atlas. To map your data you need an integrated atlas using one of the reference building methods for different applications that are supported by scArches which are, including:

  • scVI (Lopez et al., 2018 <https://www.nature.com/articles/s41592-018-0229-2>_): Requires access to raw counts values for data integration and assumes count distribution on the data (NB, ZINB, Poisson).

  • trVAE (Lotfollahi et al.,2020 <https://academic.oup.com/bioinformatics/article/36/Supplement_2/i610/6055927?guestAccessKey=71253caa-1779-40e8-8597-c217db539fb5>_): It supports both normalized log-transformed or count data as input and applies additional MMD loss to have better merging in the latent space.

  • scANVI (Xu et al., 2019 <https://www.biorxiv.org/content/10.1101/532895v1>_): It needs cell type labels for reference data. Your query data can be either unlabeled or labeled. In the case of unlabeled query data, you can use this method also to classify your query cells using reference labels.

  • scGen (Lotfollahi et al., 2019 <https://www.nature.com/articles/s41592-019-0494-8>_): This method requires cell-type labels for both reference building and Mapping. The reference mapping for this method solely relies on the integrated reference and requires no fine-tuning.

  • expiMap (Lotfollahi*, Rybakov* et al., 2023 <https://www.nature.com/articles/s41556-022-01072-x>_): This method takes prior knowledge from gene sets databases or users allowing to analyze your query data in the context of known gene programs.

  • totalVI (Gayoso al., 2019 <https://www.biorxiv.org/content/10.1101/532895v1>_): This model can be used to build multi-modal CITE-seq reference atalses.

  • treeArches (Michielsen*, Lotfollahi* et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.07.07.499109v1>_): This model builds a hierarchical tree for cell-types in the reference atlas and when mapping the query data can annotate and also identify novel cell-states and populations present in the query data.

  • SageNet (Heidari et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.04.14.488419v1>_): This model allows constrcution of a spatial atlas by mapping query dissociated single cells/spots (e.g., from scRNAseq or visium datasets) into a common coordinate framework using one or more spatially resolved reference datasets.

  • mvTCR (Drost et al., 2022 <https://www.biorxiv.org/content/10.1101/2021.06.24.449733v2.abstract?%3Fcollection=>_): Using this model you will be able to integrate T-cell receptor (TCR, treated as a sequence) and scRNA-seq dataset across multiple donors into a joint representation capturing information from both modalities.

  • Multigrate (Litinetskaya*, Lotfollahi*, et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.03.16.484643v1>_): Using this model, you will be able to build multimodal atlases using paired and unpaired data from scRNA-seq, scATAC-seq, and CITE-seq. Additionally, combined with scArches, it allows you to map query data from either modality.

  • scPoli (De Donno et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.11.28.517803v1>_): This model allows data integration of scRNA-seq dataset, prototype-based label transfer and reference mapping. scPoli learns both sample embeddings and integrated cell embeddings, thus providing the user with a multi-scale view of the data, especially useful in the case of many samples to integrate.

Usage and installation

See here <https://scarches.readthedocs.io/>_ for documentation and tutorials.

Support and contribute

If you have a question or new architecture or a model that could be integrated into our pipeline, you can post an issue <https://github.com/theislab/scarches/issues/new>__ or reach us by email <mailto:cottoneyejoe.server@gmail.com,mo.lotfollahi@gmail.com,mohsen.naghipourfar@gmail.com>_.

Reference

If scArches is helpful in your research, please consider citing the following paper <https://www.nature.com/articles/s41587-021-01001-7>_: ::

   @article{lotfollahi2021mapping,
     title={Mapping single-cell data to reference atlases by transfer learning},
     author={Lotfollahi, Mohammad and Naghipourfar, Mohsen and Luecken, Malte D and Khajavi,
     Matin and B{\"u}ttner, Maren and Wagenstetter, Marco and Avsec, {\v{Z}}iga and Gayoso,
     Adam and Yosef, Nir and Interlandi, Marta and others},
     journal={Nature Biotechnology},
     pages={1--10},
     year={2021},
     publisher={Nature Publishing Group}}

.. |PyPI| image:: https://img.shields.io/pypi/v/scarches.svg :target: https://pypi-hypernode.com/project/scarches

.. |PyPIDownloads| image:: https://pepy.tech/badge/scarches :target: https://pepy.tech/project/scarches

.. |Docs| image:: https://readthedocs.org/projects/scarches/badge/?version=latest :target: https://scarches.readthedocs.io

.. |travis| image:: https://travis-ci.com/theislab/scarches.svg?branch=master :target: https://travis-ci.com/theislab/scarches

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scArches-0.5.7.tar.gz (41.3 MB view details)

Uploaded Source

Built Distribution

scArches-0.5.7-py3-none-any.whl (121.3 kB view details)

Uploaded Python 3

File details

Details for the file scArches-0.5.7.tar.gz.

File metadata

  • Download URL: scArches-0.5.7.tar.gz
  • Upload date:
  • Size: 41.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for scArches-0.5.7.tar.gz
Algorithm Hash digest
SHA256 55e5c8e018ffea3533a8f6271ea1055cda1a3f25c470c181c7f2401afdbb52b8
MD5 6f1a20bc8157e017e6db59f758e1f851
BLAKE2b-256 5576666f5924cc4f4032946726bbe312d981c529e08e22db99c8dda350353362

See more details on using hashes here.

File details

Details for the file scArches-0.5.7-py3-none-any.whl.

File metadata

  • Download URL: scArches-0.5.7-py3-none-any.whl
  • Upload date:
  • Size: 121.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for scArches-0.5.7-py3-none-any.whl
Algorithm Hash digest
SHA256 80a0ccb021c11fe467cf101d82172a879e839e1006b8bf2aa561e50e52b36a36
MD5 df99a988f55d5b012efeeb46cc5cc509
BLAKE2b-256 6cdfad7d2c04215ca3f66b1dd2c6dce80eeedc8417d6139f91bc704c2a3d88a5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page