R package for gene signature extraction and cell-identity recognition at individual cell level from single-cell RNA-seq.
Welcome to the official Github repository of the Cell-ID software presented at the BioRxiv preprint "Cell-ID: gene signature extraction and cell identity recognition at individual cell level". Cortal A, Martignetti L, Six E, Rausell A. BioRxiv 2020.
Cell-ID is a robust statistical method that performs gene signature extraction and functional annotation for each individual cell in a single-cell RNA-seq dataset. Cell-ID is based on Multiple Correspondence Analysis (MCA) and produces a simultaneous representation of cells and genes in a low dimension space. Genes are then ranked by their distance to each individual cell, providing unbiased per-cell gene signatures. Such signatures proved valuable to (i) correctly predict cell type labels at individual cell resolution, (ii) correctly match cells from the same cell type across independent datasets, overcoming batch effects arising from different technologies, tissues-of-origin and donors, and (iii) uncover functionally relevant cell heterogeneity that would have been missed by clustering-based approaches. Cell-ID enables the robust identification of rare or even unique cells whose gene signatures are reproducible across diverse single-cell omics datasets.
Cell-ID is provided as an R package (R version >= 3.6). It contains dependencies with several Biocondutor packages as described in the Description file
Within R, set first:
install.packages("devtools")
setRepositories(ind = c(1,2,3))
To install Cell-ID then just type:
devtools::install_github("cbl-imagine/CellID")
MAC OS users might experience installation issues related to Gfortran library. To solve such issue download and install the appropriate gfortran dmg file from https://github.com/fxcoudert/gfortran-for-macOS
Cell-ID use as input single cell data in the form of specific S4objects. Curreltly supported files are SingleCellExperiment from Bioconductor and Seurat Version 3 from CRAN.
A vignette illustrating Cell-ID step-by-step procedures is provided here. Applications include MCA dimensionality reduction, per-cell gene signatures extraction, automatic cell type prediction using marker gene lists, label-transferring across datasets and functional enrichment analysis.
This project is licensed under the GNU General Public License 3 - see the LICENSE file for details
Cell-ID: gene signature extraction and cell identity recognition at individual cell level. Cortal A, Martignetti L, Six E, Rausell A. BioRxiv 2020
Upon publication of the Cell-ID paper, a companion Github repository will be released containing all R scripts and intermediate data representations required to reproduce all figures in the manuscript
You may follow us in Twitter for regular updates: https://twitter.com/AntonioRausell
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.