knitr::opts_chunk$set(echo = TRUE,
                      collapse = TRUE,
                      comment = "#>")
BiocStyle::markdown()

This tutorial concerns the use of Magellan at a end-user level. It describes how the interface works. briefly explains the structure/strategy of data managment within the different workflows and how the user interface works. It is not a technical document. However, the developers who want to write their own process modules or just better understand deeply how Magellan works, we advice beginning with this article before switching to more detailed articles.

Overview

As Magellan consists in a Shiny module, the best manner to launch and use Magellan is to integrate it into a Shiny App framework. This app does not to be complex but is is used for its ui() and server() functions.

Magellan is not a ready-to-use package because it only provides a deep framework to manage several steps in a workflow. So, it is necessary to write a minimum of R code so as to use it.

The package Magellan is heavily based on Shiny modules and the processes/pipelines it manages are also built as Shiny modules. A module correspond to a single process or pipeline. However, Magellan is just the core program to manage the different steps and results and it does not contain any module of process nor pipeline. It contains only the modules which allow to navigate between steps of a process.

Magellan is a workflow manager which can handle processes with a series of steps. We define as:

knitr::include_graphics("./figs/process_overview.png", error = FALSE)

Hierarchical levels of workflow managers

Magellan proposes two types of workflow managers, depending of the nature of the steps.

The basic brick of navigation consists of a unique workflow manager which can manage a process with simple steps. Each of these steps are a unique operation on the data.

A more complex level consists of a workflow manager in which each step can integrate a workflow manager. This situation occurs for example when working with a pipeline. Thus, the whole structure of a Magellan implementation is a hierachical tree where leaves are basic bricks and nodes are of complex level type.

The navigation in such a complex structure is the same as for the simple brick as it respect the order of processes.

knitr::include_graphics("./figs/hierarchical_structure.png", error = FALSE)

Two timelines (horizontal and vertical) are implemented in Magellan. For now, in order to make a suitable GUI, two levels can be used: the basic brick with a horizontal timeline and a first complex level with vertical timeline.

Requirements

the package Magellan is has be used with processes/pipelines. These processes are Shiny modules which functions ui() and server() must be available (loaded in the R environment). They can be in individual R scripts of embedded in a R package

A md file may also be accessible as for the Description step.

Each process

User interface

Layouts

The basic user interface of Magellan is divided into two main parts: a navigation bar and a panel which shows the content of the current step. As one can see in xxx, two layouts are implemented: a horizontal navigation bar or a vertical one.

This layout is used by the process workflow.

knitr::include_graphics("./figs/process_layout.png", error = FALSE)

In the case of a pipeline workflow, the same schema is used. The difference is in the content of the screens part: in the process worklow, it contains the simple screens of the steps and in a pipeline workflow, it contains the user interface of the different processes (see xxx). As for simpler workflow, two layouts are available for the position of th navigation bar.

knitr::include_graphics("./figs/pipeline_layout.png", error = FALSE)

Navigation bar

The navigation bar which contains four items:

Buttons

The two buttons 'Prev' and 'Next' allow to go respectively forward or backward in the timeline. If the current step is the first one, than the 'Prev' button is disabled. If the current step is the last one, then the 'Next' button is disabled. In any other case, those buttons are enabled.

The button 'Reset' allow to reset a process or step. Please refer to xxx for more details.

Timeline

The timeline is composed of several marks (circles), one for each step of the process. The name of the step is written below the mark.

A short line is showed under the step that is currently active in the server. That means that the UI showed under the timeline correspond to this step and all processes on the object will be related to this step.

The steps are defined in the source code of the process, in the config variable. They have a property called 'Mandatory'. That means that it must be validated by the user. The steps marked as non-mandatory are facultative.

A step can be in three different states:

Different colors and style of the steps marks are available to easily vizualize the state of the steps:

General rules:

Legend of colors and styles:

If the color is plain, then the step is enabled, otherwise (transparent color) it is disabled. In the latter case, all the widgets of such a step are also disabled.

If the circle is empty, then the step is either skipped or non validated.

Screens for steps (steps UI)

The first and last steps are particular in the sense that they are the input and output doors of the workflow, thus they are mandatory. Moreover, there name is fixed.

The first step (called 'Description') contains the description of the process and its different steps. If an object has been sent to the process server, then the Load button is enabled. At the contrary, if a NULL object is sent, it stays disabled. That allows the user to view the different steps event if he does not want to run the process.

Its look & feel is standardized if the developer use common scripts in Magellan to create its own module.

The Data processing steps (mid-steps) contain the interface to perform actions on the dataset. Their name is defined by the developer of the module and contained in the configuration part of the module as well as all the widgets, plots, etc...

The last step (names 'Save') does not contain any treatment on the dataset. It only contatenate the result of the process (which lead to a dataset) and append to the dataset given in input. This new object is then the result of the module.

Process workflow \label{simpleWorkflow}

Each step is built on the same basic UI. It contains graphical elements related to the process such as widgets (for parameters), different plots and graphics. It also contains a button called Perform which is used to validate the step.

Launch the module \label{simpleWFLaunch}

Thus, to use Magellan with a process module, it is necessary to load in memory the functions relatives to those modules (it can be in individual scripts or included in a package).

Suppose we have a module called 'process1'. We have to load the two functions mod_process1_ui() and mod_process1_server() (Magellan will call those functions). If there are not found, a warning appears and Magellan stops.

In the example below, one launch Magellan with the process 'Process1' and an dataset called 'feat1', provided by Magellan.

library(Magellan)

ui <- fluidPage(
  mod_nav_ui('Process1')
)


server <- function(input, output){
  data(feat1, package='Magellan')
  rv <- reactiveValues(
    dataIn = feat1,
    dataOut = NULL
  )

  observe({
    rv$dataOut <- mod_nav_server(id = 'Process1',
                                 dataIn = reactive({rv$dataIn})
                                 )
    })
}


shinyApp(ui, server)

In the example above, the dataset is loaded directly in the server function for a simpler example. If t

The simple level is a process, defined with its steps; each of these steps realize some computation on the object. The input of the step n correspond to the output of the step n-1 and the output of the step n will be the input for the step n+1.

At each step, the input and ouput are the structure than the object given in input of the whole process.

The process (and all of its steps) work on a specific item of the object (generally, the last one) and generate a new item. The output of the process is then tho object given in input plus the new item, processed by the steps.

Contrary to the workflows of higher level, each step of a process does not affect the object. The results of computations are stored in a temporary variable and this is this variable that is modified from step to another.

Once the process server has been loaded, the UI is opening on the first step of the process. This step is always a Description step which explains the content of the process

The server can be loaded with an object that contains data or a NULL object. In the first case, it means that an object has been sent into the process and the user can work on its data

At the beginning, the inner object is NULL and all the UIs are disabled. It is possible to navigate into steps in order to watch the different UIs but one cannot do anything at this time. In particular, the buttons are disabled except the button n the Description step.

In fact, when an object is sent to the server, it is stored in a temporary place. At this time, the server is ready to run. When the user clicks on the load button on the Description step, then the object is loaded in the process.

Navigate through steps \label{simpleWFNavigate}

Thanks to the buttons 'Next' and 'Prev', the user can view the different steps without execute them.

Validate a step \label{simpleWFValidate}

The workflow manager is a sorted list in which the execution of different steps obey to a certain order. This order is defined in the configuration of the process and cannot be modified by the user.

So, the steps are executed from the first one (left) to the last one, at the right of the timeline.

By default, all steps are undone. At any moment, the validation of the current step (by clicking on its Perform button) has several effects:

Example

Suppose the following configuration process

knitr::include_graphics("./figs/example1.png", error = FALSE)

Reset a step \label{simpleWFReset}

At this lower level of navigation (process), The button Reset is used to reset the current process. Obviously, reseting steps after the last validated one has no effect on the workflow.

Rules of reseting:

Example Suppose there are four steps (named from 1 to 5). Steps 1, 2 are validated, step 3 is skipped and the current step is 4.

Save the process \label{simpleWFSave}

When the last step ('Save') is enabled (which is the case even if there are skipped steps), it is possible to finish the process.

This step is simple as it contains only a 'Save' button. By clicking on it, the output variable of the process is instantiated with the result of the entire process. If this process has been launched from another one (i.e. pipeline), this output dataset is caught by this caller program.

The effect of this action is to enable the steps that can be enabled

Pipeline workflow \label{composedWF}

This section describes the behaviour of a complex workflow with two levels.

Here, each step is itself a (simple or composed) workflow (see xxx)

Launch the module \label{composedWFLaunch}

The same code can be used for a pipeline module. But, it is necessary to both have the fucntions for the pipeline module and the functions for each of its process

For example, suppose that we use a pipeline 'PipeA' which uses two processes called 'Proc1' and 'Proc2'. Then, before launching the app, the following functions may be loaded in R:

An error will occur if one of these functions are missing.

Globally, it respects the rules explained in the section xxx because each level follow the same logics: they only differ with the nature of each of their steps. So, for a general overview of the behaviour, please go to the section xxx.

The main difference resides in the memory managment of intermediate datasets. In a simple process, a temporary variable stores the result of each step, erasing the previous one. Thus, having a history within the process module is not possible.

In the complex workflow, the managment of results from each step is different. Each steps's result is stored in memory. This strategy consumes more memory but offer a more flexibility to xxx.

However, there are some little differences and additional features when working with two or more levels: the 'remote commands'.

Navigate through processes \label{composedWFNavigate}

The same rules of navigation as (xxx) apply here.

Validate a process \label{composedWFValidate}

The validation here differs from the simple workflow. There is no specific 'Save' button: the button used to validate a workflow is implemented in the last step of a simple workflow. Its validation leads to automatically xxx

In the latter case,

Here, each time a steps is validated (with its own interface),

Resetting \label{composedWFReset}

A single process

The entire pipeline

As for any workflow in Magellan, a 'Reset' button allows to set the current step back to its default values. The effects of this action are the same as those described in section xxx for a single process. The difference here is that a process is a workflow itself, which means that resetting a step in a complex workflow also implies resetting a basic workflow. In such a case, it is like a 'remote reset' for the lower level workflow: it is reseted not by clicking on its own 'Reset' button but via the 'Reset' button of another workflow. However, the effects are the same.

The other difference resides in the strategy implemented in Magellan. When reseting the current step, all further steps are also (and automatically) reseted. This guarantees that the relative order to execution of processes in the workflow is correct. This feature is possible with complex workflows because the module keeps in memory the datasets generated by each step, which is not the case with simple workflows (see section xxx)

The 'remote reset' action come with the click on the Reset button on a higher level of workflow. Contrarly to the basic level, it does not reset all the workflow but only the current step. As for the simple workflow, this resets the entire interface by setting widgets to their default values. Moreover, the dataset in this step is emptied. Automatically, if there is a dataset ready to sent to this step, it is done.

This action has two effects:

The other effect is that it also resets all the underneath workflow: all its steps are reseted, widgets sets to their default values and it go to its start state.

Save the pipeline

xxxx



samWieczorek/Magellan documentation built on March 30, 2022, 3:40 a.m.