RSEC | R Documentation |
Implementation of the RSEC algorithm (Resampling-based Sequential Ensemble Clustering) for single cell sequencing data. This is a wrapper function around the existing ClusterExperiment workflow that results in the output of RSEC.
## S4 method for signature 'SummarizedExperiment'
RSEC(x, ...)
## S4 method for signature 'data.frame'
RSEC(x, ...)
## S4 method for signature 'ClusterExperiment'
RSEC(x, eraseOld = FALSE, rerunClusterMany = FALSE, ...)
## S4 method for signature 'matrixOrHDF5'
RSEC(x, ...)
## S4 method for signature 'SingleCellExperiment'
RSEC(
x,
isCount = FALSE,
transFun = NULL,
reduceMethod = "PCA",
nFilterDims = defaultNDims(x, reduceMethod, type = "filterStats"),
nReducedDims = defaultNDims(x, reduceMethod, type = "reducedDims"),
k0s = 4:15,
subsample = TRUE,
sequential = TRUE,
clusterFunction = "hierarchical01",
alphas = c(0.1, 0.2, 0.3),
betas = 0.9,
minSizes = 1,
makeMissingDiss = if (ncol(x) < 1000) TRUE else FALSE,
consensusProportion = 0.7,
consensusMinSize,
dendroReduce,
dendroNDims,
mergeMethod = "adjP",
mergeCutoff,
mergeLogFCcutoff,
mergeDEMethod = if (isCount) "limma-voom" else "limma",
verbose = FALSE,
parameterWarnings = FALSE,
mainClusterArgs = NULL,
subsampleArgs = NULL,
seqArgs = NULL,
consensusArgs = NULL,
whichAssay = 1,
ncores = 1,
random.seed = NULL,
stopOnErrors = FALSE,
run = TRUE
)
x |
the data matrix on which to run the clustering. Can be object of the
following classes: matrix (with genes in rows),
|
... |
For signature |
eraseOld |
logical. Only relevant if input |
rerunClusterMany |
logical. If the object is a ClusterExperiment object, determines whether to rerun the clusterMany step. Useful if want to try different parameters for combining clusters after the clusterMany step, without the computational costs of the clusterMany step. |
isCount |
if |
transFun |
a transformation function to be applied to the data. If the
transformation applied to the data creates an error or NA values, then the
function will throw an error. If object is of class
|
reduceMethod |
character A character identifying what type of dimensionality reduction to perform before clustering. Options are 1) "none", 2) one of listBuiltInReducedDims() or listBuiltInFitlerStats OR 3) stored filtering or reducedDim values in the object. |
nFilterDims |
vector of the number of the most variable features to keep
(when "var", "abscv", or "mad" is identified in |
nReducedDims |
vector of the number of dimensions to use (when
|
k0s |
the k0 parameter for sequential clustering (see
|
subsample |
logical as to whether to subsample via
|
sequential |
logical whether to use the sequential strategy (see details
of |
clusterFunction |
function used for the clustering. This must be either
1) a character vector of built-in clustering techniques, or 2) a
named list of |
alphas |
values of alpha to be tried. Only used for clusterFunctions of
type '01'. Determines tightness required in creating clusters from the
dissimilarity matrix. Takes on values in [0,1]. See documentation of
|
betas |
values of |
minSizes |
the minimimum size required for a cluster (in the
|
makeMissingDiss |
logical. Whether to calculate necessary distance
matrices needed when input is not "diss". If TRUE, then when a clustering
function calls for a inputType "diss", but the given matrix is of type "X",
the function will calculate a distance function. A dissimilarity matrix
will also be calculated if a post-processing argument like |
consensusProportion |
passed to |
consensusMinSize |
passed to |
dendroReduce |
passed to |
dendroNDims |
passed to |
mergeMethod |
passed to |
mergeCutoff |
passed to |
mergeLogFCcutoff |
passed to |
mergeDEMethod |
passed to |
verbose |
logical. If TRUE it will print informative messages. |
parameterWarnings |
logical, as to whether warnings and comments from checking the validity of the parameter combinations should be printed. |
mainClusterArgs |
list of arguments to be passed for the mainClustering
step, see help pages of |
subsampleArgs |
list of arguments to be passed to the subsampling step
(if |
seqArgs |
list of arguments to be passed to |
consensusArgs |
list of additional arguments to be passed to
|
whichAssay |
numeric or character specifying which assay to use. See
|
ncores |
the number of threads |
random.seed |
a value to set seed before each run of clusterSingle (so that all of the runs are run on the same subsample of the data). Note, if 'random.seed' is set, argument 'ncores' should NOT be passed via subsampleArgs; instead set the argument 'ncores' of clusterMany directly (which is preferred for improving speed anyway). |
stopOnErrors |
logical. If |
run |
logical. If FALSE, doesn't run clustering, but just returns matrix
of parameters that will be run, for the purpose of inspection by user (with
rownames equal to the names of the resulting column names of clMat object
that would be returned if |
Note that the argument isCount
is mainly used when the input is a matrix or SingleCellExperiment Class and passed to clusterMany
to set the transformation function of the data. However, if RSEC is being re-called on an existing ClusterExperiment
object, it does not reset the transformation; in this case the only impact it will have is in setting the default value for DEMethod
for mergeClusters
step, but ONLY if mergeClusters
hasn't already been calculated. To set arguments that allow you to recalculate the non-null probabilities of the hierarchy see mergeClusters
.
A ClusterExperiment
object is returned containing all
of the clusterings from the steps of RSEC
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.