Abrupt and often irreversible changes (or tipping points) are decisive in the progression of biological processes. We, therefore, developed this R-package for the characterization of biological tipping points using gene expression profiles. BioTIP is the first toolset that amalgamates two computational impediments in multivariate expression-data analysis: (1) detection of tipping points accurately, and (2) identification of significant critical transition signals (CTSs).
The following figure outlines BioTIP pipeline:
a, Analytic pipeline of single-cell RNA-seq analysis, with BioTIP as an alternative to mainstream analyses by focusing on transition states. The arrows in orange show that BioTIP is applied to cell state ensembles (clusters) without inputting the pseudo-order information. The dashed orange arrow shows that BioTIP may use pseudo-order information to visualize and interpret the results.
b, Overview of BioTIP’s three analytical steps. The scRNA-seq profiles with cell cluster IDs are the inputs, which can be generated by independent pipelines. The outputs are significant CT states and their characteristic CTSs which can infer CT-driven transcription factors. Three analytic steps HVG: highly variable genes. DNB: dynamic biomarker network; Ic: index of criticality.
BioTIP addresses and shows robust performances by addressing three analytical challenges of the existing tipping-point methods:
We applied BioTIP to six datasets and compared BioTIP's performance with other existing tools (see examples). The six datasets are as following:
For each dataset, cell-type biomarkers or cluster identities are given by the original publications. The scRNA-seq data quality control (e.g., removing triplicate cell and mitochondria gene), normalize distinct sequencing depths, general feature selections, and cell cluster are performed following the corresponding manuscripts.
Cell clustering is a prerequisite for CTS identification with BioTIP. While different cell cluster numbers may affect BioTIP's prediction, a reasonable clustering method (with reasonably chosen parameters) will not have a major impact on the CTSs identified. BioTIP is designed for the case where the transition state can be identified as a cell cluster, and therefore soft clustering is not always applicable.
BioTIP also demonstrated robustness with respect to different clustering methods. Details can be found here.
To adopt tipping-point theory to transcriptomic analysis, there are two commonly accepted premises:
BioTIP applies to data meeting these premises, including both single-cell and bulk-cell transcriptomes. The impending transition states are called ‘critical transition states’ or ‘tipping points.’
We have successfully applied BioTIP to identify temporal features of molecular-network dynamics from gene expression profiles. Importantly, the CTS identifications helped infer the underlying gene-regulatory network and the involving key transcription factors.
BioTIP tutorial: This is a detailed walkthrough of BioTIP on one of our key results (Mouse Gastrulation, GSE87038, E8.25 2019).
Vignette: This documented exampled case studies on bulk (GSE6136) and single-cell (Nestorowa 2016) datasets.
To use the newest BioTIP package, either clone/download this repository, or you can install BioTIP with:
library("devtools")
devtools::install_github("xyang2uchicago/BioTIP")
You can install the released version of BioTIP from CRAN with:
install.packages("BioTIP")
or even better:
source('http://bioconductor.org/biocLite.R')
biocLite("BioTIP")
BioTIP is made possible by contributions from the following authors: Xinan H Yang, Zhezhen Wang, Yuxi Sun, Andrew Goldstein, Dannie Griggs, Antonio Feliciano, Yanqiu Wang, Biniam Feleke, Qier An, Ieva Tolkaciovaite, and John M Cunningham.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.