Description Usage Arguments Details Value Note Author(s) References See Also Examples
Parallelized inference of co-regulatory network from gene expression only.
1 2 3 4 5 6 7 8 | hLICORN(numericalExpression, discreteExpression =
discretizeExpressionData(numericalExpression),
TFlist, GeneList = setdiff(rownames(numericalExpression), TFlist),
parallel = c("multicore", "no", "snow"), cluster = NULL,
minGeneSupport=0.1,minCoregSupport = 0.1,
maxCoreg=length(TFlist),
searchThresh=1/3,nGRN=100,
verbose=FALSE)
|
numericalExpression |
A numerical Matrix containing the expression of genes and of transcription factors that will be used to inferred the network. Rownames should contain the Gene names/identifiers. Samples should be in columns but Colnames are not important. The data will be gene-centered but not scaled. |
discreteExpression |
optional. Should be in exactly the same format as |
TFlist |
A character vector containing the names of the genes that
are designated as Transcription Factor or Regulators.
These should be contained in the |
GeneList |
optional. The list of genes for which Gene Regulatory Networks
should be inferred. Should be in the rownames of the expression data.
If not provided will be taken as all the genes in the rownames
of the expression data that are not annotated as TF in |
parallel |
optional.
The type of parallel method to use to run the process on several threads.
Should be a value in "no" for single thread calculation,
"multicore" to use several threads on the same local machine or "snow"
to use the snow package in which case the cluster argument
should contain a cluster object (e.g. |
cluster |
A object of type cluster from the snow package which describes a cluster of computer to run the inference of each gene in parallel. Needed only when parallel is set to snow. |
minGeneSupport |
A float between 0 and 1. Minimum number of samples in which a gene has to have non-zero discretized expression value to be considered for regulatory network inference. |
minCoregSupport |
A float between 0 and 1. Minimum number of samples in which a set of co-regulators have the same non-zero discretized expression value to be considered as a potential set of co-regulator. Default is 0.1 . Can be increased in case of limitations on memmory. |
maxCoreg |
A integer. Maximum size of co-regulator to consider. Default is set to the number of TF. Can be decreased in case of limitations on memmory. |
searchThresh |
A float between 0 and 1. Minimum proportion of sample in which a gene and a set of co-regulators must have the same non-zero discretized value in order to be considered as a potential co-regulator of the gene. |
nGRN |
if NA, takes only the best GRN models. |
verbose |
Sets wether information will appear in console during computation. |
A parallelized implementation of the h-LICORN alogrithm.
The inference of the network can be run in parallel using
either a single machine or on a cluster using the make cluster in the
parallel
package (now a base R package).
The two required inputs are the continuous gene expression matrix
and the list of transcription factor in the rows of the
dataset. For Human, a list of TF is included as default data set
using data(HumanTF)
for symbols or data(HumanTF_entrezgene)
for entrez gene IDs.
In case of memmory overflow, a higher minCoregSupport (0.2 to 0.4) will perform more shallow search of co-regulators but will limit the quantity of memmory used.
In some cases, in particular for very large datasets (several hundreds and thousands of samples), a lower minCoregSupport (down to 0.01) can be used for deep search.
A object of type CoRegNet with gene regulatory network (GRN) containing several solution per gene and specifying the co-regulation interactions, the same network in the form of an adjacency list (adjacencyList) and the inferred co-regulators (coRegulators).
The regulatory network inference process is done in several steps. The speed of the inference is dependant on several factors. The number of genes, the density of the discrete expression matrix (number of 1 and -1).
Remy Nicolle <remy.c.nicolle AT gmail.com>
Elati M, Neuvial P, Bolotin-Fukuhara M, Barillot E, Radvanyi F and Rouveirol C (2007) LICORN: learning cooperative regulation networks from gene expression data. Bioinformatics 23: 2407-2414
Chebil I, Nicolle R, Santini G, Rouveirol C and Elati M (2014) Hybrid Method Inference for the Construction of Cooperative Regulatory Network in Human. IEEE Trans Nanobioscience
discretizeExpressionData,coregnet-class
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 | # Dummy expression data
gexp=matrix(rnorm(2600,sd=3),ncol=100)
gexp=rbind(gexp,do.call(rbind,lapply(1:26,function(i){
tf = sample(1:26,4)
return((gexp[tf[1],]+gexp[tf[2],] -gexp[tf[3],]-gexp[tf[4],] +rnorm(100,sd=0.5))/2)})))
dimnames(gexp)=list(c(letters,LETTERS),paste("s",1:100,sep=""))
## Simple example of network inference
dummyNet=hLICORN(gexp,TFlist = letters)
## Infer a network only on a subset of genes
subgene = unique(dummyNet@GRN$Target)[1:2]
dummyNet=hLICORN(gexp,TFlist = letters,GeneList=subgene)
## Discretize data based on a set of reference samples (here 10 first)
discexp = discretizeExpressionData(gexp,refSamples=1:10)
dummyNet=hLICORN(gexp,TFlist = letters,discreteExpression=discexp)
## The network can be queried using the following functions
# returns the hub regulators
regulators(dummyNet)
# get the regulators of a given gene
regulators(dummyNet,"A")
activators(dummyNet,"A")
targets(dummyNet)
targets(dummyNet,"b")
# or transformed into a data.frame
coregnetToDataframe(dummyNet)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.