Description Usage Arguments Value See Also Examples
The genome and gene annotations of some species can be conveniently obtained
from Bioconductor packages. This is more convenient than downloading GTF
files from Ensembl and reading it into R. In these packages, the gene
annotation is stored in a TxDb
object, which has standardized
names for gene IDs, transcript IDs, exon IDs, and so on, which are stored in
the metadata fields in GTF and GFF3 files, which are not standardized.
This function extracts transcript and corresponding gene information from
gene annotation stored in a TxDb
object.
1 | tr2g_TxDb(txdb)
|
txdb |
A |
A data frame with 3 columns: gene
for gene ID, transcript
for transcript ID, and tx_id
for internal transcript IDs used to avoid
duplicate transcript names. For TxDb packages from Bioconductor, gene ID is
Entrez ID, while transcript IDs are Ensembl IDs with version numbers for
TxDb.Hsapiens.UCSC.hg38.knownGene
. In some cases, the transcript ID
have duplicates, and this is resolved by adding numbers to make the IDs
unique.
A data frame with 3 columns: gene
for gene ID, transcript
for transcript ID, and gene_name
for gene names. If other_attrs
has been specified, then those will also be columns in the data frame returned.
Other functions to retrieve transcript and gene info: sort_tr2g
,
tr2g_EnsDb
, tr2g_ensembl
,
tr2g_fasta
, tr2g_gff3
,
tr2g_gtf
, transcript2gene
Other functions to retrieve transcript and gene info: sort_tr2g
,
tr2g_EnsDb
, tr2g_ensembl
,
tr2g_fasta
, tr2g_gff3
,
tr2g_gtf
, transcript2gene
1 2 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.