loadEData | R Documentation |
This function implements a general interface for loading the pre-defined GEO2KEGG microarray compendium and the TCGA RNA-seq compendium. It also allows loading of user-defined data from file.
loadEData(edata, nr.datasets = NULL, cache = TRUE, ...)
edata |
Expression data compendium. A character vector of length 1 that must be either
See details. |
nr.datasets |
Integer. Number of datasets that should be loaded from the compendium. This is mainly for demonstration purposes. |
cache |
Logical. Should an already cached version used if available?
Defaults to |
... |
Additional arguments passed to the internal loading routines of the GEO2KEGG and TCGA compendia. This currently includes for loading of the GEO2KEGG compendium
And for loading of the TCGA compendium
|
The pre-defined GEO2KEGG microarray compendium consists of 42 datasets investigating a total of 19 different human diseases as collected by Tarca et al. (2012 and 2013).
The pre-defined TCGA RNA-seq compendium consists of datasets from The Cancer Genome Atlas (TCGA, 2013) investigating a total of 34 different cancer types.
User-defined data can also be loaded, given that datasets, preferably of
class SummarizedExperiment
, have been saved as
RDS
files.
A list
of datasets, typically of class
SummarizedExperiment
.
Note that loadEData("geo2kegg", preproc = FALSE)
(the default)
returns the original microarray probe level data as a list of
ExpressionSet
objects. Use preproc = TRUE
or
the maPreproc
function to summarize the probe level
data to gene level data and to obtain a list
of
SummarizedExperiment
objects.
Ludwig Geistlinger <Ludwig.Geistlinger@sph.cuny.edu>
Tarca et al. (2012) Down-weighting overlapping genes improves gene set analysis. BMC Bioinformatics, 13:136.
Tarca et al. (2013) A comparison of gene set analysis methods in terms of sensitivity, prioritization and specificity. PLoS One, 8(11):e79217.
The Cancer Genome Atlas Research Network (2013) The Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet, 45(10):1113-20.
Rahman et al. (2015) Alternative preprocessing of RNA-Sequencing data in The Cancer Genome Atlas leads to improved analysis results. Bioinformatics, 31(22):3666-72.
SummarizedExperiment
,
ExpressionSet
, maPreproc
# (1) Loading the GEO2KEGG microarray compendium
geo2kegg <- loadEData("geo2kegg", nr.datasets=2)
# (2) Loading the TCGA RNA-seq compendium
tcga <- loadEData("tcga", nr.datasets=2)
# (3) reading user-defined expression data from file
data.dir <- system.file("extdata/myEData", package="GSEABenchmarkeR")
edat <- loadEData(data.dir)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.