View source: R/preprocess.genetic.data.R
preprocess.genetic.data | R Documentation |
This function performs several pre-processing steps, intended for use before function run.gadgets.
preprocess.genetic.data( case.genetic.data, complement.genetic.data = NULL, father.genetic.data = NULL, mother.genetic.data = NULL, ld.block.vec = NULL, bp.param = bpparam(), snp.sampling.probs = NULL, categorical.exposures = NULL, continuous.exposures = NULL, mother.snps = NULL, child.snps = NULL, lower.order.gxe = FALSE )
case.genetic.data |
The genetic data of the disease affected children
from case-parent trios or disease-discordant sibling pairs. If searching for
maternal SNPs that are related to risk of disease in the child, some of the
columns in |
complement.genetic.data |
A genetic dataset for the controls
corresponding to the genotypes in |
father.genetic.data |
The genetic data for the fathers of the cases in
|
mother.genetic.data |
The genetic data for the mothers of the cases in
|
ld.block.vec |
An integer vector specifying the linkage blocks of the
input SNPs. As an example, for 100 candidate SNPs, suppose we specify
|
bp.param |
The BPPARAM argument to be passed to bplapply when
estimating marginal disease associations for each SNP. If using a cluster
computer, this parameter needs to be set with care. See
|
snp.sampling.probs |
A vector indicating the sampling probabilities of
the SNPs in |
categorical.exposures |
(experimental) A matrix or data.frame of
integers corresponding to categorical exposures corresponding to the cases in
|
continuous.exposures |
(experimental) A matrix or data.frame of numeric
values representing continuous exposures corresponding to the families in
|
mother.snps |
If searching for maternal SNPs that are associated
with disease in the child, the indices of the maternal SNP columns in object
|
child.snps |
If searching for maternal SNPs that are associated
with disease in the child, the indices of the child SNP columns in object
|
lower.order.gxe |
(experimental) A boolean indicating whether, if multiple exposures of interest are input, E-GADGETS should search for only for genetic interactions with the joint combination of exposures (i.e., GxGxExE interactions), or if it should additionally search for lower-order interactions that involve subsets of the exposures that were input (i.e., GxGxE in addition to GxGxExE). The default, FALSE, restricts the search to GxGxExE interactions. Users should be cautious about including large numbers of input exposures, and, if they do, very cautious about setting this argument to TRUE. |
A list containing the following:
A matrix of case/maternal genotypes.
A matrix of complement/sibling/paternal genotypes. If running E-GADGETS, this is set to a 1x1 matrix whose single entry is 0, and not used
If running E-GADGETS, A matrix of maternal genotypes, otherwise a 1x1 matrix whose single entry is 0.0, and not used
If running E-GADGETS, A matrix of mpaternal genotypes, otherwise a 1x1 matrix whose single entry is 0.0, and not used
A vector of chi-square statistics corresponding to
marginal SNP-disease associations, if snp.sampling.probs
is not specified, and snp.sampling.probs
otherwise.
A vector eaul to cumsum(ld.block.vec)
.
A design matrix of the input categorical and continuous exposures, if specified. Otherwise NULL.
A boolean indicating whether a GxGxE search is desired.
A vector of the column indices of maternal SNPs in
case.genetic.data
, set to NULL if not applicable.
A vector of the column indices of child SNPs in
case.genetic.data
, set to NULL if not applicable.
data(case) data(dad) data(mom) case <- as.matrix(case) dad <- as.matrix(dad) mom <- as.matrix(mom) res <- preprocess.genetic.data(case[, 1:10], father.genetic.data = dad[ , 1:10], mother.genetic.data = mom[ , 1:10], ld.block.vec = c(10))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.