View source: R/PlotFunctions.R
histGBSR | R Documentation |
Draw histograms of specified statistics
histGBSR(
x,
stats = c("dp", "missing", "het"),
target = c("marker", "sample"),
binwidth = NULL,
color = c(Marker = "darkblue", Sample = "darkblue"),
fill = c(Marker = "skyblue", Sample = "skyblue")
)
x |
A GbsrGenotypeData object. |
stats |
A string to specify statistics to be drawn. |
target |
Either or both of "marker" and "sample", e.g. |
binwidth |
An integer to specify bin width of the histogram. This value is passed to the ggplot function. |
color |
A named vector "Marker" and "Sample" to specify border color of bins in the histograms. |
fill |
A named vector "Marker" and "Sample" to specify fill color of bins in the histograms. |
You can draw histograms of several summary statistics of genotype counts and read counts per sample and per marker. The "stats" argument can take the following values:
Proportion of missing genotype calls.
Proportion of heterozygote calls.
Reference allele frequency.
Total read counts.
Reference allele read counts.
Alternative allele read counts.
Reference allele read frequency.
Mean of reference allele read counts.
Standard deviation of reference allele read counts.
Quantile of reference allele read counts.
Mean of alternative allele read counts.
Standard deviation of alternative allele read counts.
Quantile of alternative allele read counts.
Mapping quality.
Phred-scaled p-value (strand bias)
Variant Quality by Depth
Symmetric Odds Ratio (strand bias)
Alt vs. Ref read mapping qualities
Alt vs. Ref read position bias
Alt Vs. Ref base qualities
To draw histograms for "missing", "het", "raf", you need to run
countGenotype()
first to obtain statistics. Similary, "dp",
"ad_ref", "ad_alt", "rrf" requires values obtained via
countRead()
. "mq", "fs", "qd", "sor", "mqranksum", "readposranksum",
and "baseqranksum" only work with target = "marker"
, if your data
contains those values supplied via SNP calling tools like
GATK.
A ggplot object.
# Load data in the GDS file and instantiate a [GbsrGenotypeData] object.
gds_fn <- system.file("extdata", "sample.gds", package = "GBScleanR")
gds <- loadGDS(gds_fn)
# Summarize genotype count information to be used in `histGBSR()`
gds <- countGenotype(gds)
# Draw histograms of missing rate, heterozygosity, and reference
# allele frequency per SNP and per sample.
histGBSR(gds, stats = "missing")
# Close the connection to the GDS file
closeGDS(gds)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.