Home

/

GitHub

/

csdaw/goseq2

/

nullp: Probability Weighting Function

nullp: Probability Weighting Function
In csdaw/goseq2: Gene Ontology analyser for RNA-seq and other length biased data

View source: R/nullp.R

nullp

R Documentation

Probability Weighting Function

Description

Calculates a Probability Weighting Function for a set of genes based on a given set of biased data (usually gene length) and each genes status as differentially expressed or not.

Usage

nullp(DEgenes, genome, id, bias.data=NULL,plot.fit=TRUE)

Arguments

`DEgenes`	A named binary vector where 1 represents DE, 0 not DE and the names are gene IDs.
`genome`	A string identifying the genome that `genes` refer to. For a list of supported organisms run `supportedGenomes`.
`id`	A string identifying the gene identifier used by `genes`. For a list of supported gene IDs run `supportedGeneIDs`.
`bias.data`	A numeric vector containing the data on which the DE may depend. Usually this is the median transcript length of each gene in bp. If set to `NULL` `nullp` will attempt to fetch length using `getlength`.
`plot.fit`	Plot the PWF or not? Calls `plotPWF` with default values if `TRUE`.

Details

It is essential that the entire analysis pipeline, from summarizing raw reads through to using goseq be done in just one gene identifier format. If your data is in a different format you will need to obtain the gene lengths and supply them to the nullp function using the bias.data arguement. Converting to a supported format from another format should be avoided whenever possible as this will almost always result in data loss.

NAs are allowed in the bias.data vector if you do not have information about a certain gene. Setting a gene to NA is preferable to removing it from the analysis.

If bias.data is left as NULL, nullp attempts to use getlength to fetch GO catgeory to gene identifier mappings.

It is recommended you review the fit produced by the nullp function before proceeding by leaving plot.fit as TRUE.

Value

A data frame with 3 columns, named "DEgenes", "bias.data" and "pwf" with the rownames set to the gene names. Each row corresponds to a gene with the DEgenes column specifying if the gene is DE (1 for DE, 0 for not DE), the bias.data column giving the numeric value of the DE bias being accounted for (usually the gene length or number of counts) and the pwf column giving the genes value on the probability weighting function. This object is usually passed to goseq to calculate enriched categories or plotPWF for further plotting.

Author(s)

Matthew D. Young myoung@wehi.edu.au

References

Young, M. D., Wakefield, M. J., Smyth, G. K., Oshlack, A. (2010) Gene ontology analysis for RNA-seq: accounting for selection bias Genome Biology Date: Feb 2010 Vol: 11 Issue: 2 Pages: R14

Examples

data(genes)
pwf <- nullp(genes, 'hg19', 'ensGene')

csdaw/goseq2 documentation built on April 23, 2022, 12:37 a.m.

csdaw/goseq2 index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

csdaw/goseq2
Gene Ontology analyser for RNA-seq and other length biased data

nullp: Probability Weighting Function
In csdaw/goseq2: Gene Ontology analyser for RNA-seq and other length biased data

Probability Weighting Function

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Related to nullp in csdaw/goseq2...

R Package Documentation

Browse R Packages

We want your feedback!

csdaw/goseq2 Gene Ontology analyser for RNA-seq and other length biased data

nullp: Probability Weighting Function In csdaw/goseq2: Gene Ontology analyser for RNA-seq and other length biased data

Probability Weighting Function

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Related to nullp in csdaw/goseq2...

R Package Documentation

Browse R Packages

We want your feedback!

csdaw/goseq2
Gene Ontology analyser for RNA-seq and other length biased data

nullp: Probability Weighting Function
In csdaw/goseq2: Gene Ontology analyser for RNA-seq and other length biased data