Description Usage Arguments Details Value
GEM_GWASmodel performs genome wide association study (GWAS).
1 2 | GEM_GWASmodel(env_file_name, snp_file_name, covariate_file_name, GWASmodel_pv,
output_file_name, qqplot_file_name)
|
env_file_name |
Text file with rows representing environment factor and columns representing samples, such as the example data file "env.txt". |
snp_file_name |
Text file with rows representing genotype encoded as 1,2,3 or any three distinct values for major allele homozygote (AA), heterozygote (AB) and minor allele homozygote (BB) and columns representing samples, such as the example data file "snp.txt". |
covariate_file_name |
Text file with rows representing covariate factors, and columns representing samples, such as the example data file "cov.txt". |
GWASmodel_pv |
The pvalue cut off. Associations with significances at GWASmodel_pv level or below are saved to output_file_name, with corresponding estimate of effect size (slope coefficient), test statistics and p-value. Default value is 5.0E-08. |
output_file_name |
The result file with each row presenting a SNP and its association with environment, which contains SNPID, estimate of effect size (slope coefficient), test statistics, pvalue and FDR at each column. |
qqplot_file_name |
Output QQ plot for all pvalues. |
GEM_GWASmodel finds the association between genetic variants and environment genome-wide by performing matrix based iterative correlation and memory-efficient data analysis instead of millions of linear regressions (N = number_of_SNPs). The environmental factor can be a particular phenotype or environment factor from,for example, birth outcomes, maternal conditions or disease traits. The genotype data are encoded as 1,2,3 or any three distinct values for major allele homozygote (AA), heterozygote (AB) and minor allele homozygote (BB). The linear regression is adjusted by covariates read from covariate data file. The output of GEM_GWASmodel is a list of SNPs and their association with environment. GEM_GWASmodel runs linear regression like lm (E ~ G + covt), where G is a matrix with genotype data, E is a matrix with environment factor and covt is a matrix with covariates, and all read from the formatted text data file.
save results automatically
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.