createExonByTranscriptCdf.AffymetrixCdfFile | R Documentation |
Creates an exon-by-transcript CDF based on the probesets defined in an "exon-only" CDF and transcript-exon mapping of a NetAffx probeset annotation data file.
## S3 method for class 'AffymetrixCdfFile'
createExonByTranscriptCdf(cdf, csv, tags=c("*"), path=getPath(cdf),
type=c("all", "core", "extended", "full", "main", "control", "cds"), subsetBy=NULL,
within=NULL, ..., overwrite=FALSE, verbose=FALSE)
cdf |
An |
csv |
An |
tags |
Additional tags added to the filename of created CDF, i.e. <chiptype>,<tags>.cdf. |
path |
The output path where the custom CDF will be written. |
type |
A |
subsetBy |
An optional |
within |
A |
... |
Additional arguments passed to |
overwrite |
If |
verbose |
... |
Returns an AffymetrixCdfFile
.
The template CDF - argument cdf
- should be an "exon-only" CDF:
each unit has one group/probeset, which is the exon.
An example of this is the "unsupported" HuEx-1_0-st-v2.cdf
from Affymetrix, which has 1,432,154 units.
Such "exon-only" CDFs do not contain information about clustering
exons/probesets into gene transcripts.
The CDF may also contain a number of non-exon probesets corresponding
to control probes, which can contain very large numbers of
probes per probeset. Such units are dropped/ignored by this method.
The transcripts (=units) will be ordered as they appear in the NetAffx annotation file. Within each transcript (=unit), the exons (=groups) are ordered lexicographically by their names.
In the created CDF, each unit corresponds to one transcript cluster, and each group within a unit corresponds to the exons within that transcript cluster. Thus, the unit names correspond to the transcript cluster names and the group names correspond to the exon names.
The exon names are defined by unit names of the exon-only CDF,
whereas the transcript names are defined by the
transcriptClusterId
column in the NetAffx annotation data file.
These transcript and exon names are often "non-sense" integers.
In order to map these to more genome-friendly names, use the various
annotations available in the NetAffx annotation data file.
Henrik Bengtsson adopted from createTranscriptCDF()
written
by Ken Simpson, Elizabeth Purdom and Mark Robinson.
## Not run:
# The exon-only CDF
cdf <- AffymetrixCdfFile$byChipType("HuEx-1_0-st-v2")
# The NetAffx probeset annotation data file
csv <- AffymetrixNetAffxCsvFile("HuEx-1_0-st-v2.na24.hg18.probeset.csv", path=getPath(cdf))
# Create a CDF containing all core probesets:
cdfT <- createExonByTranscriptCdf(cdf, csv=csv, tags=c("*,HB20110911"))
print(cdfT)
# Create CDF containing the core probesets with 3 or 4 probes:
cdfT2 <- createExonByTranscriptCdf(cdf, csv=csv,
tags=c("*,bySize=3-4,HB20110911"),
subsetBy="probeCount", within=c("3", "4"))
print(cdfT2)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.