preprocessDataset: Preprocess Dataset
In ctlab/LinSeed: Linear Subpsace identification to solve complete gene expression deconvolution problem

Preprocesses given dataset. Preprocessing consists of 3 major steps: 1) If needed, probes corresponding to the same genes are collapsed, only most expressed probe is taken for further analysis. It's common technique in microarray data analysis. 2) If needed, only highly expressed genes are taken for further analysis. (Say hello to noize reduction) 3) All genes are clustered with Kmeans using cosine simillarity as distance.

1 2	preprocessDataset(dataset, annotation = NULL, geneSymbol = "Gene symbol", samples = NULL, topGenes = 10000)

`dataset`	matrix, data.frame, path to file or GSE accession with expression data
`annotation`	dataframe, matrix, named vector with annotation to probes
`geneSymbol`	column from annotation to collapse the genes, deafult value is 'Gene Symbol'
`samples`	character vector of samples. If column were not in samples, it would be excluded from analysis. Default value is NULL, which takes every sample from dataset
`topGenes`	integer How many genes include in analysis. We suppose to include only expressed genes. Default value is 10000

clustered dataset, matrix, first column identifies cluster of the row

ctlab/LinSeed documentation built on Aug. 9, 2019, 4:33 p.m.

ctlab/LinSeed index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

ctlab/LinSeed
Linear Subpsace identification to solve complete gene expression deconvolution problem

preprocessDataset: Preprocess Dataset
In ctlab/LinSeed: Linear Subpsace identification to solve complete gene expression deconvolution problem

Description

Usage

Arguments

Value

Related to preprocessDataset in ctlab/LinSeed...

R Package Documentation

Browse R Packages

We want your feedback!

ctlab/LinSeed Linear Subpsace identification to solve complete gene expression deconvolution problem

preprocessDataset: Preprocess Dataset In ctlab/LinSeed: Linear Subpsace identification to solve complete gene expression deconvolution problem

Description

Usage

Arguments

Value

Related to preprocessDataset in ctlab/LinSeed...

R Package Documentation

Browse R Packages

We want your feedback!

ctlab/LinSeed
Linear Subpsace identification to solve complete gene expression deconvolution problem

preprocessDataset: Preprocess Dataset
In ctlab/LinSeed: Linear Subpsace identification to solve complete gene expression deconvolution problem