View source: R/readGenomesFromMPF.R
readGenomesFromMPF | R Documentation |
'readGenomesFromMPF()' reads somatic mutations of a single tumor genome (sample) or a set of genomes from an MPF file (Mutation Position Format; see details below) and determines the mutation frequencies according to a specific model of mutational signatures (Alexandrov or Shiraishi).
readGenomesFromMPF(file, numBases=5, type="Shiraishi", trDir=TRUE, enforceUniqueTrDir=TRUE, refGenome=BSgenome.Hsapiens.UCSC.hg19::BSgenome.Hsapiens.UCSC.hg19, transcriptAnno= TxDb.Hsapiens.UCSC.hg19.knownGene::TxDb.Hsapiens.UCSC.hg19.knownGene, verbose=TRUE)
file |
(Mandatory) The name of the MPF file (can be compressed with
|
numBases |
(Mandatory) Total number of bases (mutated base and flanking bases) to be used for sequence patterns. Must be odd. Default: 5 |
type |
(Mandatory) Signature model or type ( |
trDir |
(Mandatory) Specifies whether the transcription direction is
taken into account in the signature model. If so, only mutations within
genomic regions with a defined transcription direction can be considered.
Default: |
enforceUniqueTrDir |
(Optional) Used only if |
refGenome |
(Mandatory) The reference genome ( |
transcriptAnno |
(Optional) Transcript annotation ( |
verbose |
(Optional) Print information about reading and processing the
mutation data. Default: |
An MPF file has the following format (one line per mutation and patient/sample):
[sampleID]<tab>[chrom]<tab>[position]<tab>[ref_bases]<tab>[alt_bases]
A list containing the genomes in terms of frequencies of the mutated
sequence patterns. This list of genomes can be used for
decomposeTumorGenomes
.
Rosario M. Piro
Politecnico di Milano
Maintainer: Rosario
M. Piro
E-Mail: <rmpiro@gmail.com> or <rosariomichael.piro@polimi.it>
http://rmpiro.net/decompTumor2Sig/
Krueger, Piro (2019) decompTumor2Sig: Identification of mutational
signatures active in individual tumors. BMC Bioinformatics
20(Suppl 4):152.
decompTumor2Sig
decomposeTumorGenomes
readGenomesFromVCF
getGenomesFromMutFeatData
### load reference genome and transcript annotation (if direction is needed) refGenome <- BSgenome.Hsapiens.UCSC.hg19::BSgenome.Hsapiens.UCSC.hg19 transcriptAnno <- TxDb.Hsapiens.UCSC.hg19.knownGene::TxDb.Hsapiens.UCSC.hg19.knownGene ### read breast cancer genomes from Nik-Zainal et al (PMID: 22608084) gfile <- system.file("extdata", "Nik-Zainal_PMID_22608084-MPF.txt.gz", package="decompTumor2Sig") genomes <- readGenomesFromMPF(gfile, numBases=5, type="Shiraishi", trDir=TRUE, enforceUniqueTrDir=TRUE, refGenome=refGenome, transcriptAnno=transcriptAnno, verbose=FALSE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.