Finite Mixtures of Multivariate Poisson-Log Normal Model for Clustering Count Data
MPLNClust
is an R package for performing clustering using finite
mixtures of multivariate Poisson-log normal (MPLN) distribution proposed
by Silva et al., 2019. It
was developed for count data, with clustering of RNA sequencing data as
a motivation. However, the clustering method may be applied to other
types of count data. The package provides functions for functions for
parameter estimation via 1) an MCMC-EM framework by Silva et al.,
2019 and 2) a variational
Gaussian approximation with EM algorithm by Subedi and Browne,
2020. Information criteria (AIC, BIC,
AIC3 and ICL) and slope heuristics (Djump and DDSE, if more than 10
models are considered) are offered for model selection. Also included
are functions for simulating data from this model and visualization.
To install the latest version of the package:
require("devtools")
devtools::install_github("anjalisilva/MPLNClust", build_vignettes = TRUE)
library("MPLNClust")
To run the Shiny app:
MPLNClust::runMPLNClust()
To list all functions available in the package:
ls("package:MPLNClust")
MPLNClust
contains 14 functions.
Framework of mplnVariational makes it computationally efficient and faster compared to mplnMCMCParallel or mplnMCMCNonParallel. Therefore, mplnVariational may perform better for large datasets. For more information, see details section below. An overview of the package is illustrated below:
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.