View source: R/loadGeneSummary.R
loadGeneSummary | R Documentation |
Load gene summary table
loadGeneSummary(organism = 9606,
status = c("reviewed", "predicted", "inferred", "validated", "provisional"))
organism |
Name of the organism (e.g. "Homo sapiens") or the corresponding taxon ID (e.g. 9606). |
status |
Status of the review of the genes. |
The gene summaries are extracted from RefSeq database (https://ftp.ncbi.nih.gov/refseq/release/complete/*.rna.gbff.gz).
Gene summaries are available in the "COMMENT" section of the *rna.gbff.gz
files.
The Perl script used to parse *rna.gbff.gz
files is available in system.file("extdata", "parse_refseq.pl", package = "GeneSummary")
.
A data frame with the following columns:
RefSeq accession. Note there is also the version number after the accession.
Full organism name.
Taxon ID of the organism.
Entrez gene ID.
Status of the review of the gene.
A long description of the gene.
df = loadGeneSummary(9606)
dim(df)
head(df)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.