run_edger | R Documentation |
Differential expression analysis based on the Negative Binomial distribution using edgeR.
run_edger(
ps,
group,
confounders = character(0),
contrast = NULL,
taxa_rank = "all",
method = c("LRT", "QLFT"),
transform = c("identity", "log10", "log10p", "SquareRoot", "CubicRoot", "logit"),
norm = "TMM",
norm_para = list(),
disp_para = list(),
p_adjust = c("none", "fdr", "bonferroni", "holm", "hochberg", "hommel", "BH", "BY"),
pvalue_cutoff = 0.05,
...
)
ps |
ps a |
group |
character, the variable to set the group, must be one of the var of the sample metadata. |
confounders |
character vector, the confounding variables to be adjusted.
default |
contrast |
this parameter only used for two groups comparison while there are multiple groups. For more please see the following details. |
taxa_rank |
character to specify taxonomic rank to perform
differential analysis on. Should be one of
|
method |
character, used for differential analysis, please see details below for more info. |
transform |
character, the methods used to transform the microbial
abundance. See
|
norm |
the methods used to normalize the microbial abundance data. See
|
norm_para |
arguments passed to specific normalization methods. Most users will not need to pass any additional arguments here. |
disp_para |
additional arguments passed to |
p_adjust |
method for multiple test correction, default |
pvalue_cutoff |
numeric, p value cutoff, default 0.05 |
... |
extra arguments passed to the model. See |
Note that edgeR is designed to work with actual counts. This means that transformation is not required in any way before inputting them to edgeR.
There are two test methods for differential analysis in edgeR, likelihood ratio test (LRT) and quasi-likelihood F-test (QLFT). The QLFT method is recommended as it allows stricter error rate control by accounting for the uncertainty in dispersion estimation.
contrast
must be a two length character or NULL
(default). It is only
required to set manually for two groups comparison when there are multiple
groups. The order determines the direction of comparison, the first element
is used to specify the reference group (control). This means that, the first
element is the denominator for the fold change, and the second element is
used as baseline (numerator for fold change). Otherwise, users do required
to concern this parameter (set as default NULL
), and if there are
two groups, the first level of groups will set as the reference group; if
there are multiple groups, it will perform an ANOVA-like testing to find
markers which difference in any of the groups.
a microbiomeMarker
object.
Yang Cao
Robinson, Mark D., and Alicia Oshlack. "A scaling normalization method for differential expression analysis of RNA-seq data." Genome biology 11.3 (2010): 1-9.
Robinson, Mark D., Davis J. McCarthy, and Gordon K. Smyth. "edgeR: a Bioconductor package for differential expression analysis of digital gene expression data." Bioinformatics 26.1 (2010): 139-140.
edgeR::glmFit()
,edgeR::glmQLFit()
,edgeR::estimateDisp()
,normalize()
data(enterotypes_arumugam)
ps <- phyloseq::subset_samples(
enterotypes_arumugam,
Enterotype %in% c("Enterotype 3", "Enterotype 2")
)
run_edger(ps, group = "Enterotype")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.