Description Usage Arguments Details Value A brief description of subfunctions Note Author(s) See Also Examples
View source: R/generate_noise_matrices.R
Computes BEARscc
simulated technical replicates from the
previously estimated noise parameters computed with the
function estimate_noise_parameters()
.
1 | simulate_replicates(SCEList, max_cumprob=0.9999, n = 3)
|
SCEList |
A |
max_cumprob |
Because a cumulative distribution will range from n=0 to a countable
infinity, the event space needs to be set to cover a reasonable
fraction of the probability density. This parameter determines the
the fraction of probability density covered by the event space, which
in turn defines the highes count number in the event space. We recommend
users use the default value of 0.9999. However, if the default value
was altered in |
n |
The number of simulated technical replicates to generate. |
In the second step of BEARscc, the algorithm applies the model from first
step to produce simulated technical replicates. For every observed
gene count below which drop-outs occurred amongst the spike-ins,
BEARscc assesses whether to convert the count to zero (using the
drop-out injection distribution). For observations where the count is
zero, the drop-out recovery distribution is used to estimate a new value,
based on the overall drop-out frequency for that gene. After this
drop-out processing, all non-zero counts are substituted with a value
generated by the model of expression variance created in the first step.
parameterized to the observed counts for each gene. This second step is
repeated any number of times (as prescribed by parameter n
) to
generate a collection of simulated technical replicates for
downstream analysis.
The resulting object is a list of counts data that is added to the
metadata
of the SingleCellExpression
object as a long
list titled "simulated_replicates"
. Each element of the list
is a data.frame
of the counts representing a BEARscc simulated
technical replicate, e.g for n=10
we would have the list:
[,1] | Counts data.frame of simulated replicate 1. |
|
[,2] | Counts data.frame of simulated replicate 2. |
|
[,3] | Counts data.frame of simulated replicate 3. |
|
[,4] | Counts data.frame of simulated replicate 4. |
|
[,5] | Counts data.frame of simulated replicate 5. |
|
[,6] | Counts data.frame of simulated replicate 6. |
|
[,7] | Counts data.frame of simulated replicate 7. |
|
[,8] | Counts data.frame of simulated replicate 8. |
|
[,9] | Counts data.frame of simulated replicate 9. |
|
[,10] | Counts data.frame of simulated replicate 10. |
|
[,11] | Counts data.frame of observed data. |
|
simulate_replicates
relies on the following subfunctions
to generate simulated technical replicates. These functions share many
common options with the user interactive function. For those options
that are internal to the programming; these are annotated to give an
idea of flow. For further detail please examine source code in the R
directory of this package:
spikes_prepared <- execute_noiseinjected_counts(n=1,
noise_parameters=estimated_noise, total_sampling)
probs4detection.genes<-t(data.frame(
noise_parameters$bayes_parameters, row.names = "k")[,
4:eval(dim(noise_parameters$bayes_parameters)[2]-1)])
probs4detection.k<-data.frame(
noise_parameters$bayes_parameters[,2:4, with=FALSE],row.names = "k")
noisy_counts<-data.table(noise_parameters$original.counts,
keep.rownames = TRUE)[,apply(.SD,1 ,`genewise_permute_count`,
probs4detection.k=probs4detection.k,
probs4detection.genes=probs4detection.genes,
parameters=noise_parameters$ERCC_parameters,
total_sampling=total_sampling)]
probabilityA<-probs4detection.genes[gsub("-",".",x[1]),]
apply(data.frame(as.numeric(x[-1])),1, `permute_count`,
probs4detection.k, probabilityA=probabilityA,
parameters, total_sampling)
Under various conditions some form of nx<-randomizer(x,
parameters, total_sampling)
is invoked.
Frequently, the user will want to compute simulated technical replicates
in a high performance computational environment. When running
estimate_noiseparameters()
using the option
write.noise.model=TRUE
, the user recives the files with
root file="noise_estimation"
,
"noise_estimation_counts4clusterperturbation.xls",
"noise_estimation_bayesianestimates.xls" and
"noise_estimation_parameters4randomize.xls". These files may be input into
the example code, HPC_generate_noise_matrices.R
, on a high
performance computational environment for faster processing.
David T. Severson <david_severson@hms.harvard.edu>
Maintainer: Benjamin Schuster-Boeckler <benjamin.schuster-boeckler@ludwig.ox.ac.uk>
The example code for running the simulation of technical replicates
on a high performance computing cluster can be found
in inst/example/
.
The code for generating simulated technical replicates on a high powered
compute node requires the function, HPC_simulate_replicates()
.
1 2 3 4 5 | library("SingleCellExperiment")
data(analysis_examples)
BEAR_simreplicates.sce<-simulate_replicates(BEAR_analyzed.sce, n=3)
BEAR_simreplicates.sce
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.