Transfer learning with Architecture Surgery on Single-cell data
Project description
.. raw:: html
|PyPI| |PyPIDownloads| |Docs| |travis|
Single-cell architecture surgery (scArches) is a package for reference-based analysis of single-cell data.
Updates
-
(29.11.2022) We have added
scPoli <https://www.biorxiv.org/content/10.1101/2022.11.28.517803v1>
_ enabling population level integration of single-cell datasets and multi-scale analysis. -
(22.11.2022) We have added
Multigrate <https://www.biorxiv.org/content/10.1101/2022.03.16.484643v1=>
_ enabling multimodal reference building and mapping. -
(22.10.2022) We have added
mvTCR <https://www.biorxiv.org/content/10.1101/2021.06.24.449733v2.abstract?%3Fcollection=>
_ andSageNet <https://www.biorxiv.org/content/10.1101/2022.04.14.488419v1>
_ enabling mapping multimodal immune profiling (TCR+scRNAreq) and scRNA-seq to spatial atlases, respectively. -
(7.07.2022) We have added
treeArches <https://www.biorxiv.org/content/10.1101/2022.07.07.499109v1>
_ to scArches code base. treeArches enables building cell-type hierarchies to identify novel states (e.g., disease, subpopulations) in the query data when mapped to the reference. See tutorialshere <https://scarches.readthedocs.io/>
_ . -
(6.02.2022) We have added
expiMap <https://www.biorxiv.org/content/10.1101/2022.02.05.479217v1>
_ to scArches code base. expiMap allows interpretable reference mapping. See tutorialshere <https://scarches.readthedocs.io/>
_ .
What is scArches?
scArches allows analysis of your single-cell query data by integrating it into a reference atlas. To map your data you need an integrated atlas using one of the reference building methods for different applications that are supported by scArches which are, including:
-
scVI (
Lopez et al., 2018 <https://www.nature.com/articles/s41592-018-0229-2>
_): Requires access to raw counts values for data integration and assumes count distribution on the data (NB, ZINB, Poisson). -
trVAE (
Lotfollahi et al.,2020 <https://academic.oup.com/bioinformatics/article/36/Supplement_2/i610/6055927?guestAccessKey=71253caa-1779-40e8-8597-c217db539fb5>
_): It supports both normalized log-transformed or count data as input and applies additional MMD loss to have better merging in the latent space. -
scANVI (
Xu et al., 2019 <https://www.biorxiv.org/content/10.1101/532895v1>
_): It needs cell type labels for reference data. Your query data can be either unlabeled or labeled. In the case of unlabeled query data, you can use this method also to classify your query cells using reference labels. -
scGen (
Lotfollahi et al., 2019 <https://www.nature.com/articles/s41592-019-0494-8>
_): This method requires cell-type labels for both reference building and Mapping. The reference mapping for this method solely relies on the integrated reference and requires no fine-tuning. -
expiMap (
Lotfollahi*, Rybakov* et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.02.05.479217v1>
_): This method takes prior knowledge from gene sets databases or users allowing to analyze your query data in the context of known gene programs. -
totalVI (
Gayoso al., 2019 <https://www.biorxiv.org/content/10.1101/532895v1>
_): This model can be used to build multi-modal CITE-seq reference atalses. -
treeArches (
Michielsen*, Lotfollahi* et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.07.07.499109v1>
_): This model builds a hierarchical tree for cell-types in the reference atlas and when mapping the query data can annotate and also identify novel cell-states and populations present in the query data. -
SageNet (
Heidari et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.04.14.488419v1>
_): This model allows constrcution of a spatial atlas by mapping query dissociated single cells/spots (e.g., from scRNAseq or visium datasets) into a common coordinate framework using one or more spatially resolved reference datasets. -
mvTCR (
Drost et al., 2022 <https://www.biorxiv.org/content/10.1101/2021.06.24.449733v2.abstract?%3Fcollection=>
_): Using this model you will be able to integrate T-cell receptor (TCR, treated as a sequence) and scRNA-seq dataset across multiple donors into a joint representation capturing information from both modalities. -
Multigrate (
Litinetskaya*, Lotfollahi*, et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.03.16.484643v1>
_): Using this model, you will be able to build multimodal atlases using paired and unpaired data from scRNA-seq, scATAC-seq, and CITE-seq. Additionally, combined with scArches, it allows you to map query data from either modality. -
scPoli (
De Donno et al., 2022 <https://www.biorxiv.org/content/10.1101/2022.11.28.517803v1>
_): This model allows data integration of scRNA-seq dataset, prototype-based label transfer and reference mapping. scPoli learns both sample embeddings and integrated cell embeddings, thus providing the user with a multi-scale view of the data, especially useful in the case of many samples to integrate.
Usage and installation
See here <https://scarches.readthedocs.io/>
_ for documentation and tutorials.
Support and contribute
If you have a question or new architecture or a model that could be integrated into our pipeline, you can
post an issue <https://github.com/theislab/scarches/issues/new>
__ or reach us by email <mailto:cottoneyejoe.server@gmail.com,mo.lotfollahi@gmail.com,mohsen.naghipourfar@gmail.com>
_.
Reference
If scArches is helpful in your research, please consider citing the following paper <https://www.nature.com/articles/s41587-021-01001-7>
_:
::
@article{lotfollahi2021mapping,
title={Mapping single-cell data to reference atlases by transfer learning},
author={Lotfollahi, Mohammad and Naghipourfar, Mohsen and Luecken, Malte D and Khajavi,
Matin and B{\"u}ttner, Maren and Wagenstetter, Marco and Avsec, {\v{Z}}iga and Gayoso,
Adam and Yosef, Nir and Interlandi, Marta and others},
journal={Nature Biotechnology},
pages={1--10},
year={2021},
publisher={Nature Publishing Group}}
.. |PyPI| image:: https://img.shields.io/pypi/v/scarches.svg :target: https://pypi-hypernode.com/project/scarches
.. |PyPIDownloads| image:: https://pepy.tech/badge/scarches :target: https://pepy.tech/project/scarches
.. |Docs| image:: https://readthedocs.org/projects/scarches/badge/?version=latest :target: https://scarches.readthedocs.io
.. |travis| image:: https://travis-ci.com/theislab/scarches.svg?branch=master :target: https://travis-ci.com/theislab/scarches
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file scArches-0.5.6.tar.gz
.
File metadata
- Download URL: scArches-0.5.6.tar.gz
- Upload date:
- Size: 37.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/0.0.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 87b5aa89676b37f65f076631faf3033515cc8380dfad076699410628a1838116 |
|
MD5 | 7894417d29feca3695625c5b5158ca90 |
|
BLAKE2b-256 | 26122ab79e58d6f1412d11daed323edfb424034e49567086b29543c10037c7ce |
File details
Details for the file scArches-0.5.6-py3-none-any.whl
.
File metadata
- Download URL: scArches-0.5.6-py3-none-any.whl
- Upload date:
- Size: 118.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/0.0.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0.post20210125 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e0e468468de124415292453d3b6f196bd00bb948d28e76f28fc2b0b3a49adaf5 |
|
MD5 | 23054a086ca021be1501f73f03ab757b |
|
BLAKE2b-256 | 0d0eadd686367cb8e865edbc795459314772bb4f0845f16cf018bd5be6c944a4 |