View source: R/import_spliceJunctions.R
importSpliceJunctions | R Documentation |
Generates putative gene-models based on supplied genomic coordinates of splice-junctions.
Input should be a tibble containing the following columns:
junctionA: Genomic coordinates of the 5'-junction. (format: chr:start:strand, i.e.: chr1:100:+)
junctionB: Genomic coordinates of the 3'-junction. (format: chr:end:strand, i.e.: chr1:150:+)
sample: Names of the samples. (character, optional)
identifier: The identifier which will be used in downstream analysis. (character, optional)
Common splice-junction formats (BED and SJ.out.tab (STAR)) can also be supplied and are converted into the correct DataFrame.
By utilizing two separate junction-sites, interchromosomal trans-splicing or chimeric transcripts from genomic fusions (e.g., resulting from the BCR/ABL1 fusion-gene) can also be handled.
importSpliceJunctions( ProteoDiscography, inputSpliceJunctions, isTopHat = TRUE, samples = NULL, aggregateSamples = FALSE, removeExisting = FALSE, overwriteDuplicateSamples = FALSE )
ProteoDiscography |
(ProteoDiscography): ProteoDiscography object which stores the annotation and genomic sequences. |
inputSpliceJunctions |
(tibble): Tibble containing the splice-junctions. |
isTopHat |
(logical): Are the imported (.BED) files from TopHat? If so, the start-end of the SJ are corrected for max. overhang. |
samples |
(character): Preferred names for the samples if BED / TAB files are supplied, default is derived from filepath. |
aggregateSamples |
(logical): Should splice-junctions from multiple samples be aggregated? Or should sample-specific models be generated? If genomic variants are to be incorporated within the derived splice-transcripts, the names of samples need to be match. |
removeExisting |
(logical): Should existing entries be removed? |
overwriteDuplicateSamples |
(logical): Should duplicate samples be overwritten? |
ProteoDiscography with imported splice-junctions.
Job van Riet j.vanriet@erasmusmc.nl
Wesley van de Geer w.vandegeer@erasmusmc.nl
ProteoDiscography.hg19 <- ProteoDisco::generateProteoDiscography( TxDb = TxDb.Hsapiens.UCSC.hg19.knownGene::TxDb.Hsapiens.UCSC.hg19.knownGene, genomeSeqs = BSgenome.Hsapiens.UCSC.hg19::BSgenome.Hsapiens.UCSC.hg19 ) # Import from file. ProteoDiscography.hg19 <- ProteoDisco::importSpliceJunctions( ProteoDiscography = ProteoDiscography.hg19, inputSpliceJunctions = system.file('extdata', 'spliceJunctions_pyQUILTS_chr22.bed', package = 'ProteoDisco'), # (Optional) Rename samples. samples = 'pyQUILTS', # Specify that the given BED files are obtained from TopHat. # Chromosomal coordinates from TopHat require additional formatting. isTopHat = TRUE, ) # Or, import splice-junctions (even spanning different chromosomes) based on our format. testSJ <- readr::read_tsv(system.file('extdata', 'validationSetSJ_hg19.txt', package = 'ProteoDisco')) # Add custom SJ to ProteoDiscography. ProteoDiscography.hg19 <- ProteoDisco::importSpliceJunctions( ProteoDiscography = ProteoDiscography.hg19, inputSpliceJunctions = testSJ, # Append to existing SJ-input. removeExisting = FALSE )
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.