nearestTSS | R Documentation |
Find nearest TSS and distance to nearest TSS for a vector of chromosome loci.
nearestTSS(chr, locus, species="Hs")
chr |
character vector of chromosome names. |
locus |
integer or numeric vector of genomic loci, of same length as |
species |
character string specifying the species.
Possible values are |
This function takes a series of genomic loci, defined by a vector of chromosome names and a vector of genomic positions within the chromosomes,
and finds the nearest transcriptional start site (TSS) for each locus.
The chromosome names can be in the format "1","2","X"
or can be "chr1","chr2","chrX"
.
For genes with more than one annotated TSS, only the most 5' (upstream) of the alternative TSS is reported.
This function uses the Bioconductor organism package named "org.XX.eg.db" where XX is species
.
Note that each organism package supports only a particular build of the genome for that species.
For human (species="Hs"
, the results are for the hg38 genome build.
For mouse (species="Mm"
), the results are for the mm10 genome build.
A data.frame with the following columns:
gene_id |
character vector giving the Entrez Gene ID of the nearest TSS for each element of |
symbol |
character vector of gene symbols. |
strand |
character vector with |
tss |
integer vector giving TSS. |
width |
integer vector giving genomic width of the gene. |
distance |
integer vector giving distance to nearest TSS. Positive values means that the TSS is downstream of the locus, negative values means that it is upstream. Gene body loci will therefore have negative distances and promotor loci will have positive. |
Gordon Smyth
nearestReftoX
nearestTSS(chr = c("1","1"), locus = c(1000000,2000000))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.