Description Usage Arguments Value Author(s) See Also Examples
Partition synthetic dataset to training and test set
1 2 3 4 5 6 7 | partition_dataset(
dt_obj,
data_train_prcg = 0.5,
region_train_prcg = 0.95,
cpg_train_prcg = 0.5,
is_synth = FALSE
)
|
dt_obj |
Melissa data object |
data_train_prcg |
Percentage of genomic regions that will be fully used for training, i.e. across the whole region we will have no CpGs missing. |
region_train_prcg |
Fraction of genomic regions to keep for training set, i.e. some genomic regions will have no coverage at all during training. |
cpg_train_prcg |
Fraction of CpGs in each genomic region to keep for training set. |
is_synth |
Logical, whether we have synthetic data or not. |
The Melissa object with the following changes. The 'met' element will now contain only the 'training' data. An additional element called 'met_test' will store the data that will be used during testing to evaluate the imputation performance. These data will not be seen from Melissa during inference.
C.A.Kapourani C.A.Kapourani@ed.ac.uk
create_melissa_data_obj
, melissa
,
filter_regions
1 2 | # Partition the synthetic data from Melissa package
dt <- partition_dataset(melissa_encode_dt)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.