Description Usage Arguments Details Value Author(s) References See Also Examples
Function to analyze bam files to generate an ExpressionSet with expression estimates for all samples, read start and fragment length distributions, path counts and optinally processed reads.
1 2 3 4 5 6 | wrapDenovo(bamFile, output_wrapKnown, knownGenomeDB, targetGenomeDB, readLength,
rpkm=TRUE, keep.multihits=TRUE, searchMethod="submodels",
exactMarginal=TRUE, integrateMethod = "plugin", maxExons=40,
islandid, chroms=NULL, keep.pbam=FALSE, keepPbamInMemory=FALSE,
niter=10^3, priorq=3, priorqGeneExpr=2,
mc.cores.int=1, mc.cores=1, verbose=TRUE, seed=1)
|
bamFile |
Names of bam files with the sample to analyze. These must sorted and indexed, and the index must be in the same directory. |
output_wrapKnown |
Optional argument containing the output of an
earlier call to |
knownGenomeDB |
|
targetGenomeDB |
|
readLength |
Read length in bp, e.g. in a paired-end experiment where
75bp are sequenced on each end one would set |
rpkm |
Set to |
keep.multihits |
Set to |
searchMethod |
Method used to perform the model search.
|
exactMarginal |
Set to |
integrateMethod |
Method to compute integrated likelihoods. The default
( |
maxExons |
Prior probabilities of isoform expression are
estimated for genes with 1 up to |
islandid |
Names of the gene island to be analyzed. If missing all gene islands are analyzed |
chroms |
Names of the chromosomes to be analyzed. If missing all chromosomes are analyzed. |
keep.pbam |
Set to |
keepPbamInMemory |
Set to |
niter |
Number of MCMC iterations in the model search algorithm. |
priorq |
Parameter of the Dirichlet prior for the proportion of
reads coming from each variant. We recommend |
priorqGeneExpr |
Parameter of the Dirichlet prior distribution on overall gene expression. Defaults to 2 to ensure non-zero estimates. |
mc.cores |
Number of cores to use in expression estimation. |
mc.cores.int |
Number of cores to use when loading bam files. Be careful as this is a memory intensive step. |
verbose |
Set to |
seed |
Set seed of random number generator. |
The function executes the functions procBam
, getDistrs
,
pathCounts
calcDenovo
and denovoExpr
and formats the output nicely.
Running wrapDenovo
is much more efficient in cpu
speed and memory usage than running these functions separately.
When rpkm
is false the function returns the estimated
proportion of reads arising from each isoform within a gene island.
See the details in help("wrapKnown")
for more information on this.
denovoGenomeDB |
|
.
exp |
Object of class |
distr |
Object of class |
pbam |
List of objects of class |
Miranda Stobbe, David Rossell
Rossell D, Stephan-Otto Attolini C, Kroiss M, Stocker A. Quantifying Alternative Splicing from Paired-End RNA-sequencing data. Annals of Applied Statistics, 8(1):309-330.
calcDenovo
, wrapKnown
, relexprByGene
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | ## not run
## Known isoforms
## library(TxDb.Hsapiens.UCSC.hg19.knownGene)
## hg19DB <- procGenome(TxDb.Hsapiens.UCSC.hg19.knownGene), genome='hg19')
## gtf with known & de novo predictions
## mygtf <- import('hg19_denovo.gtf')
## hg19denovoDB <- procGenome(mygtf, genome='hg19')
## bamFile="/path_to_bam/sorted.bam"
## ans <- wrapDenovo(bamFile=bamFile, targetGenomeDB=hg19denovoDB, knownGenomeDB=hg19DB, readLength=101)
## Estimated expression via BMA
## head(exprs(ans[['exp']]))
## Posterior probability that each isoform is expressed
## head(fData(ans[['exp']]))
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.