Description Usage Arguments Details Value Warnings Author(s) See Also Examples
This function will read tab separated text files with Ct values and feature meta-data from high-throughput qPCR experiments into a qPCRset containing all the relevant information.
1 |
files |
character vector with the names of the files to be read. |
path |
character string with the path to the folder containing the data files. |
n.features |
integer, number of features present on each array (e.g. 384). See details. |
format |
character, the format of the input file. Options are "plain", "SDS", "LightCycler", "CFX", "OpenArray" and "BioMark". See Details. |
column.info |
list, indicating which column number or name the information of interest is in. It is set automatically by |
flag,feature,
Ct,type,
position |
deprecated, use |
header |
logical, does the file contain a header row or not. Only used for |
SDS |
deprecated, use |
n.data |
integer vector, same length as |
samples |
character vector with names for each sample. Per default the file names are used. |
na.value |
integer, a Ct value that will be assigned to all undetermined/NA wells. |
sample.info |
object of class |
... |
any other arguments are passed to |
This is the main data input function for the HTqPCR package for analysing qPCR data. It extracts the threshold cycle, Ct value, of each well on the card, as well as information about the quality (e.g.~passed/failed) of the wells. The function is tuned for data from TaqMan Low Density Array cards, but can be used for any kind of qPCR data.
The information to be extracted is:
flag
integer indicating the number of column containing information about the flags.
feature
integer indicating the number of column containing information about the individual features (typically gene names).
type
integer indicating the number of column containing information about the type of each feature.
position
integer indicating the number of column containing information about the position of features on the card.
Ct
integer indicating the number of column containing information about the Ct values.
Per default, this information is assumed to be in certain columns depending on the input format.
featureNames
, featureType
and featurePos
will be extracted from the first file. If flag
, type
or position
are not included into column.info
, this means that this information is not available in the file. flag
will then be set to "Passed", type
to "Target" and position
to "feature1", "feature2", ... etc until the end of the file. Especially position
might not be available in case the data does not come from a card format, but it is required in subsequent functions in order to disambiguate in case some features are present multiple times.
format
indicates the format of the input file. The options currently implemented are:
plain
A tab-separated text file, containing no header unless header=TRUE
. The information extracted defaults to column.info=list(flag=4, feature=6, type=7, position=3, Ct=8)
.
SDS
An output file from the SDS Software program. This is often used for the TaqMan Low Density Arrays from Applied Biosystems, but can also be used for assays from other vendors, such as Exiqon. column.info
is the same as for "plain".
OpenArray
The TaqMan OpenArray Real-Time PCR Plates. The information extracted defaults to column.info=list(flag="ThroughHole.Outlier", feature="Assay.Assay.ID", type="Assay.Assay.Type", position="ThroughHole.Address", Ct="ThroughHole.Ct")
.
BioMark
The BioMark HD System from Fluidigm, currently including the 48.48 and 96.96 assays. The information extracted defaults to column.info=list(flag="Call", feature="Name.1", position="ID", Ct="Value")
.
CFX
The CFX Automation System from Bio-Rad. The information extracted defaults to column.info=list(feature="Content", position="Well", Ct="Cq.Mean")
.
LightCycler
The LightCycler System from Roche Applied Science. The information extracted defaults to column.info=list(feature="Name", position="Pos", Ct="Cp")
.
The BioMark and OpenArray assays always contain information multiple samples on each assay, such as 48 features for 48 samples for the BioMark 48.48. The results across these samples are always present in a single file, e.g. with 48x48=2304 rows. Setting n.features=2304 will read in all the information and create a qPCRset
object with dimensions 2304x1. Setting n.data=48 and n.features=48 will however automatically convert this into a 48x48 qPCRset
. See openVignette(package="HTqPCR")
for examples. The samples are being read in the order in which they're present in the file, i.e. from row 1 onwards, regardless of how they're loaded onto the particular platform.
If the data was analysed using for example SDS Software it may contain a variable length header specifying parameters for files that were analysed at the same time. If format="SDS"
then readCtData
will scan through the first 100 lines of each file, and skip all lines until (and including) the line beginning with "#", which is the header. The end of the file might also contain some plate ID information, but only the number of lines specified in n.features
will be read.
n.features
indicates the number of features present on each array. For example, for a 384 well plate with just 1 sample, the number would be 382. For a plate with 2 individual samples loaded onto it, n.features=196
and n.data=2
. For 1 file with 5 plates and 2 samples per plate, the numbers are n.features=196
and n.data=10
. n.features*n.data must correspond to the total number of lines to be read from each file.
A "qPCRset"
object.
The files are all assumed to belong to the same design, i.e.~have the same features (genes) in them and in identical order.
Heidi Dvinge
read.delim
for further information about reading in data, and "qPCRset"
for a definition of the resulting object.
1 2 3 4 5 6 7 8 9 | # Locate example data and create qPCRset object
exPath <- system.file("exData", package="HTqPCR")
exFiles <- read.delim(file.path(exPath, "files.txt"))
raw <- readCtData(files=exFiles$File, path=exPath)
# Example of adding missing information (random data in this case)
featureClass(raw) <- factor(rep(c("A", "B", "C"), each=384/3))
pData(raw)[,"rep"] <- c(1,1,2,2,3,3)
## See the package vignette for more examples, including different input formats.
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.