View source: R/classification.R
classification | R Documentation |
Given two distinct data sets, one of mRNA and one of lncRNA. The classification of the data is done from the structure of the networks formed by the sequences. After this is done classifying with the J48 classifier and randomForest. Can be also created in the current directory a file of type arff called' result 'with all values so that it can be used later. There is also the graphic parameter that when TRUE generates graphs based on the results of each measure. Using the J48 classifier it is possible to generate a tree based on the dataset and then save this tree so that it can be used to predict other RNA sequences
classification(mRNA, lncRNA, word = 3, step = 1, sncRNA, graphic, classifier = c("J48", "RF"), load, save)
mRNA |
Directory where the file .FASTA lies with the mRNA sequences |
lncRNA |
Directory where the file .FASTA lies with the lncRNA sequences |
word |
Integer that defines the size of the word to parse. By default the word parameter is set to 3 |
step |
Integer that determines the distance that will be traversed in the sequences for creating a new connection. By default the step parameter is set to 1 |
sncRNA |
Directory where the file .FASTA lies with the sncRNA sequences (OPTIONAL) |
graphic |
Parameter of the logical type, TRUE or FALSE for graphics generation. As default graphic gets FALSE |
classifier |
Character Parameter. By default the classifier is J48, but the user can choose to use randomForest by configuring as classifier = "RF". The prediction with a model passed by the param load only works with the classifier J48. |
load |
When defined this parameter will be loaded the file which is the model previously saved in the current directory with the name entered in this parameter. No file is loaded by default |
save |
when set, this parameter saves a .arff file with the results of the features in the current directory and also saves the tree created by the J48 classifier so that it can be used to predict RNA sequences. This parameter sets the file name. No file is created by default |
Results with cross-validation or the prediction result
Eric Augusto Ito
# Classification - cross validation library(BASiNET) arqSeqMRNA <- system.file("extdata", "sequences2.fasta", package = "BASiNET") arqSeqLNCRNA <- system.file("extdata", "sequences.fasta", package = "BASiNET") classification(mRNA=arqSeqMRNA,lncRNA=arqSeqLNCRNA) classification(mRNA=arqSeqMRNA,lncRNA=arqSeqLNCRNA, save="example") #Save Tree to Predict Sequences # Prediction mRNApredict <- system.file("extdata", "sequences2-predict.fasta", package = "BASiNET") lncRNApredict <- system.file("extdata", "sequences-predict.fasta", package = "BASiNET") modelPredict <- system.file("extdata", "modelPredict.dat", package = "BASiNET") classification(mRNApredict,lncRNApredict,load=modelPredict)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.