PAC_saturation: Filter a PAC object on sequence size and coverage
In Danis102/seqpac: Seqpac: A Framework for smallRNA analysis in R using Sequence-Based Counts

PAC_saturation

R Documentation

Filter a PAC object on sequence size and coverage

Description

PAC_saturation Performs an sequence diversity/saturation analysis on a PAC objects.

Usage

PAC_saturation(PAC, resample = 10, steps = 10, thresh = c(1, 10), threads = 1)

Arguments

`PAC`	PAC-list object containing a Counts data.frame with sequences as row names and samples as column names.
`resample`	Integer setting the number of permutations at each percentage step (default=10).
`steps`	Integer defining the number of percentage steps between 0-100 original dataset (default=10).
`thresh`	Integer vector containing mean count thresholds that will be targeted. Default is set to c(1,10), where each new occurrence reaching 1 count (>=1) and each new occurrence reaching 10 counts (>=10) will be analyzed.
`threads`	Number of cores to be used for performing the permutations.

Details

Given a PAC object the function will perform a sequence saturation analysis. This is done by downsampling the original dataset by permutation at different percentages of the original dataset. The closer the curve at the original sequence depth (100 diversity of sequences for the original dataset. Approaching the plateau usually means that the sequencing depth of the library have sampled the full population of sequences available in the sample. Here we use an none-linear least square (nls) model with a self-starter for asymptotic regression (SSasympt) to describe the rate in which the library approaches the plateau.

Value

A list with ggplot2 graph objects: The 1:st graph shows saturation/diversity result at the 1:st threshold. The 2:nd graph shows saturation/diversity result at the 2:nd threshold, etc.

Examples



# OBS! The example below is using already down-sampled data. Still, sequence
# diversity is rather saturated on >=1 occurrence. meaning that most sequences
# in the samples has been caught. Nonetheless, sequences reaching >=2
# occurrences have not plateaued.

load(system.file("extdata", "drosophila_sRNA_pac_filt_anno.Rdata", 
                  package = "seqpac", mustWork = TRUE))

plot_lst  <- PAC_saturation(pac, resample=10, steps=10, 
                            thresh=c(1,2), threads=1)
names(plot_lst)
cowplot::plot_grid(plotlist=plot_lst)

Danis102/seqpac documentation built on Aug. 26, 2023, 10:15 a.m.

Danis102/seqpac index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Danis102/seqpac
Seqpac: A Framework for smallRNA analysis in R using Sequence-Based Counts

PAC_saturation: Filter a PAC object on sequence size and coverage
In Danis102/seqpac: Seqpac: A Framework for smallRNA analysis in R using Sequence-Based Counts

Filter a PAC object on sequence size and coverage

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to PAC_saturation in Danis102/seqpac...

R Package Documentation

Browse R Packages

We want your feedback!

Danis102/seqpac Seqpac: A Framework for smallRNA analysis in R using Sequence-Based Counts

PAC_saturation: Filter a PAC object on sequence size and coverage In Danis102/seqpac: Seqpac: A Framework for smallRNA analysis in R using Sequence-Based Counts

Filter a PAC object on sequence size and coverage

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to PAC_saturation in Danis102/seqpac...

R Package Documentation

Browse R Packages

We want your feedback!

Danis102/seqpac
Seqpac: A Framework for smallRNA analysis in R using Sequence-Based Counts

PAC_saturation: Filter a PAC object on sequence size and coverage
In Danis102/seqpac: Seqpac: A Framework for smallRNA analysis in R using Sequence-Based Counts