Skip to main content

Transfer learning with Architecture Surgery on Single-cell data

Project description

.. raw:: html

|PyPI| |PyPIDownloads| |Docs| |travis|

Single-cell architecture surgery (scArches) is a package for reference-based analysis of single-cell data.

Updates

  • (29.11.2022) We have added scPoli <https://www.biorxiv.org/content/10.1101/2022.11.28.517803v1>_ enabling population level integration of single-cell datasets and multi-scale analysis.

  • (22.11.2022) We have added Multigrate <https://www.biorxiv.org/content/10.1101/2022.03.16.484643v1=>_ enabling multimodal reference building and mapping.

  • (22.10.2022) We have added mvTCR <https://www.biorxiv.org/content/10.1101/2021.06.24.449733v2.abstract?%3Fcollection=>_ and SageNet <https://www.biorxiv.org/content/10.1101/2022.04.14.488419v1>_ enabling mapping multimodal immune profiling (TCR+scRNAreq) and scRNA-seq to spatial atlases, respectively.

  • (7.07.2022) We have added treeArches <https://www.biorxiv.org/content/10.1101/2022.07.07.499109v1>_ to scArches code base. treeArches enables building cell-type hierarchies to identify novel states (e.g., disease, subpopulations) in the query data when mapped to the reference. See tutorials here <https://scarches.readthedocs.io/>_ .

  • (6.02.2022) We have added expiMap <https://www.biorxiv.org/content/10.1101/2022.02.05.479217v1>_ to scArches code base. expiMap allows interpretable reference mapping. See tutorials here <https://scarches.readthedocs.io/>_ .

What is scArches?

scArches allows analysis of your single-cell query data by integrating it into a reference atlas. To map your data you need an integrated atlas using one of the reference building methods for different applications that are supported by scArches which are, including:

  • scVI (Lopez et al., 2018 <https://www.nature.com/articles/s41592-018-0229-2>_): Requires access to raw counts values for data integration and assumes count distribution on the data (NB, ZINB, Poisson).

  • trVAE (Lotfollahi et al.,2020 <https://academic.oup.com/bioinformatics/article/36/Supplement_2/i610/6055927?guestAccessKey=71253caa-1779-40e8-8597-c217db539fb5>_): It supports both normalized log-transformed or count data as input and applies additional MMD loss to have better merging in the latent space.

  • scANVI (Xu et al., 2019 <https://www.biorxiv.org/content/10.1101/532895v1>_): It needs cell type labels for reference data. Your query data can be either unlabeled or labeled. In the case of unlabeled query data, you can use this method also to classify your query cells using reference labels.

  • scGen (Lotfollahi et al., 2019 <https://www.nature.com/articles/s41592-019-0494-8>_): This method requires cell-type labels for both reference building and Mapping. The reference mapping for this method solely relies on the integrated reference and requires no fine-tuning.

  • expiMap (Lotfollahi*, Rybakov* et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.02.05.479217v1>_): This method takes prior knowledge from gene sets databases or users allowing to analyze your query data in the context of known gene programs.

  • totalVI (Gayoso al., 2019 <https://www.biorxiv.org/content/10.1101/532895v1>_): This model can be used to build multi-modal CITE-seq reference atalses.

  • treeArches (Michielsen*, Lotfollahi* et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.07.07.499109v1>_): This model builds a hierarchical tree for cell-types in the reference atlas and when mapping the query data can annotate and also identify novel cell-states and populations present in the query data.

  • SageNet (Heidari et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.04.14.488419v1>_): This model allows constrcution of a spatial atlas by mapping query dissociated single cells/spots (e.g., from scRNAseq or visium datasets) into a common coordinate framework using one or more spatially resolved reference datasets.

  • mvTCR (Drost et al., 2022 <https://www.biorxiv.org/content/10.1101/2021.06.24.449733v2.abstract?%3Fcollection=>_): Using this model you will be able to integrate T-cell receptor (TCR, treated as a sequence) and scRNA-seq dataset across multiple donors into a joint representation capturing information from both modalities.

  • Multigrate (Litinetskaya*, Lotfollahi*, et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.03.16.484643v1>_): Using this model, you will be able to build multimodal atlases using paired and unpaired data from scRNA-seq, scATAC-seq, and CITE-seq. Additionally, combined with scArches, it allows you to map query data from either modality.

  • scPoli (De Donno et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.11.28.517803v1>_): This model allows data integration of scRNA-seq dataset, prototype-based label transfer and reference mapping. scPoli learns both sample embeddings and integrated cell embeddings, thus providing the user with a multi-scale view of the data, especially useful in the case of many samples to integrate.

Usage and installation

See here <https://scarches.readthedocs.io/>_ for documentation and tutorials.

Support and contribute

If you have a question or new architecture or a model that could be integrated into our pipeline, you can post an issue <https://github.com/theislab/scarches/issues/new>__ or reach us by email <mailto:cottoneyejoe.server@gmail.com,mo.lotfollahi@gmail.com,mohsen.naghipourfar@gmail.com>_.

Reference

If scArches is helpful in your research, please consider citing the following paper <https://www.nature.com/articles/s41587-021-01001-7>_: ::

   @article{lotfollahi2021mapping,
     title={Mapping single-cell data to reference atlases by transfer learning},
     author={Lotfollahi, Mohammad and Naghipourfar, Mohsen and Luecken, Malte D and Khajavi,
     Matin and B{\"u}ttner, Maren and Wagenstetter, Marco and Avsec, {\v{Z}}iga and Gayoso,
     Adam and Yosef, Nir and Interlandi, Marta and others},
     journal={Nature Biotechnology},
     pages={1--10},
     year={2021},
     publisher={Nature Publishing Group}}

.. |PyPI| image:: https://img.shields.io/pypi/v/scarches.svg :target: https://pypi-hypernode.com/project/scarches

.. |PyPIDownloads| image:: https://pepy.tech/badge/scarches :target: https://pepy.tech/project/scarches

.. |Docs| image:: https://readthedocs.org/projects/scarches/badge/?version=latest :target: https://scarches.readthedocs.io

.. |travis| image:: https://travis-ci.com/theislab/scarches.svg?branch=master :target: https://travis-ci.com/theislab/scarches

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scArches-0.5.6.tar.gz (37.5 MB view details)

Uploaded Source

Built Distribution

scArches-0.5.6-py3-none-any.whl (118.9 kB view details)

Uploaded Python 3

File details

Details for the file scArches-0.5.6.tar.gz.

File metadata

  • Download URL: scArches-0.5.6.tar.gz
  • Upload date:
  • Size: 37.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for scArches-0.5.6.tar.gz
Algorithm Hash digest
SHA256 87b5aa89676b37f65f076631faf3033515cc8380dfad076699410628a1838116
MD5 7894417d29feca3695625c5b5158ca90
BLAKE2b-256 26122ab79e58d6f1412d11daed323edfb424034e49567086b29543c10037c7ce

See more details on using hashes here.

File details

Details for the file scArches-0.5.6-py3-none-any.whl.

File metadata

  • Download URL: scArches-0.5.6-py3-none-any.whl
  • Upload date:
  • Size: 118.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/0.0.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for scArches-0.5.6-py3-none-any.whl
Algorithm Hash digest
SHA256 e0e468468de124415292453d3b6f196bd00bb948d28e76f28fc2b0b3a49adaf5
MD5 23054a086ca021be1501f73f03ab757b
BLAKE2b-256 0d0eadd686367cb8e865edbc795459314772bb4f0845f16cf018bd5be6c944a4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page