Abstract

We introduce the concept of Canonical Workflow Building Blocks (CWBB), a methodology of describing and wrapping computational tools, in order for them to be utilised in a reproducible manner from multiple workflow languages and execution platforms.

The concept is implemented and demonstrated with the BioExcel Building Blocks library (BioBB), a collection of tool wrappers in the field of computational biomolecular simulation. Interoperability across different workflow languages is showcased through a protein Molecular Dynamics setup transversal workflow, built using this library and run with 5 different Workflow Manager Systems (WfMS).

We argue such practice is a necessary requirement for FAIR Computational Workflows and an element of Canonical Workflow Frameworks for Research (CWFR) in order to improve widespread adoption and reuse of computational methods across workflow language barriers.

[maxbutton id=”4″ url=”https://doi.org/10.1162/dint_a_00135″ text=”Read more” linktitle=”Data Intelligence: Making Canonical Workflow Building Blocks interopreable across workflow languages” ]

Citation

Stian Soiland-Reyes, Genís Bayarri, Pau Andrio, Robin Long, Douglas Lowe, Ania Niewielska, Adam Hospital, Paul Groth (2022):
Making Canonical Workflow Building Blocks interoperable across workflow languages.
Data Intelligence 4(2)
https://doi.org/10.1162/dint_a_00135

The same workflow shown in Galaxy, Knime, CWL, Jupyter and PyCOMPSs

Protein MD Setup transversal workflow. Assembled in with 5 different workflow managers using BioBB canonical building blocks. From top-left: Galaxy, KNIME, CWL, Jupyter Notebook, PyCOMPSs.