BSFDataSet | R Documentation |
BSFDataSet
contains the class GenomicRanges
, which is used to
store input ranges. Alongside with the iCLIP signal in list
structure
and additional meta data as data.frame
.
BSFDataSet(ranges, meta, signal, dropSeqlevels = TRUE, silent = FALSE)
BSFDataSet(ranges, meta, signal, dropSeqlevels = TRUE, silent = FALSE)
BSFDataSetFromBigWig(ranges, meta, silent = FALSE, dropSeqlevels = TRUE)
ranges |
a |
meta |
a |
signal |
a |
dropSeqlevels |
enforce seqnames to be the same in ranges and signal,
by dropping unused seqlevels which is required for most downstream functions
such as |
silent |
suppress messages but not warnings (TRUE/ FALSE) |
The ranges are enforced to have to have a "+" or "-" strand annotation,"*" is not allowed. They are expected to be of the same width and a warning is thrown otherwise.
The meta information is stored as data.frame
with at least two
required columns, 'id' and 'condition'. They are used to build the unique
identifier for each replicate split by '_' (eg. id = 1 and condition = WT
will result in 1_WT).
The meta data needs to have the additional columns 'clPlus' and 'clMinus'
to be present if BSFDataSetFromBigWig
is called.
It is used to provide the location to the iCLIP coverage
files to the import function. On object initialization these files are
loaded and internally represented in the signal slot of the object
(see BSFDataSet
).
The iCLIP signal is stored in a special list structure.
At the lowest level crosslink counts per nucleotide are stored as
Rle
per chromosome summarized as a SimpleRleList
. Such a list
exits for each replicate and must be named by the replicate identifier
(eg. 1_WT). Therefore this list contains always exactly the same number of
entries as the number of replicates in the dataset. Since we handle strands
initially seperated from each other this list must be given twice, once
for each strand. The strand specific entries must be named 'signalPlus' and
'signalMinus'.
The option dropSeqlevels
forces the seqnames of the ranges and
the signal to be the same. If for a specific chromosome in the ranges
no respective entry in the signal list can be found, then entries with
that chromosome are dropped This behavior is needed to keep the
BSFDataSet object in sync, which is required for downstream functions
such as coverageOverRanges
A BSFDataSet object.
# load data
files <- system.file("extdata", package="BindingSiteFinder")
load(list.files(files, pattern = ".rda$", full.names = TRUE))
rng = getRanges(bds)
sgn = getSignal(bds)
mta = getMeta(bds)
bdsNew = BSFDataSet(ranges = rng, signal = sgn, meta = mta)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.