Description Usage Arguments Value See Also Examples
Use filtering paramaters to filter out high and low depth reads as well as low Genotype Quality as defined by GATK. All filters are optional but recommended.
1 2 | filterSNPs(SNPset, refAlleleFreq, filterAroundMedianDepth, minTotalDepth,
maxTotalDepth, minSampleDepth, depthDifference, minGQ, verbose = TRUE)
|
SNPset |
The data frame imported by |
refAlleleFreq |
A numeric < 1. This will filter out SNPs with a
Reference Allele Frequency less than |
filterAroundMedianDepth |
Filters total SNP read depth for both bulks. A
median and median absolute deviation (MAD) of depth will be calculated.
SNPs with read depth greater or less than |
minTotalDepth |
The minimum total read depth for a SNP (counting both bulks) |
maxTotalDepth |
The maximum total read depth for a SNP (counting both bulks) |
minSampleDepth |
The minimum read depth for a SNP in each bulk |
depthDifference |
The maximum absolute difference in read depth between the bulks. |
minGQ |
The minimum Genotype Quality as set by GATK. This is a measure of how confident GATK was with the assigned genotype (i.e. homozygous ref, heterozygous, homozygous alt). See What is a VCF and how should I interpret it? |
verbose |
logical. If |
Returns a subset of the data frame supplied which meets the filtering
conditions applied by the selected parameters. If verbose
is
TRUE
the function reports the number of SNPs filtered in each step
as well as the initiatl number of SNPs, the total number of SNPs filtered
and the remaining number.
See mad
for explaination of calculation of
median absolute deviation.
What
is a VCF and how should I interpret it? for more information on GATK
Fields and Genotype Fields
1 2 3 4 5 6 7 8 9 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.