SHIWA Project
   HOME

TheInfoList



OR:

The SHIWA project (Sharing Interoperable Workflows for large-scale scientific simulations on Available DCIs) within
grid computing Grid computing is the use of widely distributed computer resources to reach a common goal. A computing grid can be thought of as a distributed system with non-interactive workloads that involve many files. Grid computing is distinguished from co ...
was a project led by the
LPDS The Laboratory of Parallel and Distributed Systems (LPDS), as a department of MTA SZTAKI, is a research laboratory in distributed grid and cloud technologies. LPDS is a founding member of the Hungarian Grid Competence Centre, the Hungarian Nation ...
(Laboratory of Parallel and Distributed Systems) of MTA Computer and Automation Research Institute. The project coordinator was Prof. Dr.
Peter Kacsuk Peter may refer to: People * List of people named Peter, a list of people and fictional characters with the given name * Peter (given name) ** Saint Peter (died 60s), apostle of Jesus, leader of the early Christian Church * Peter (surname), a sur ...
. It started on 1 July 2010 and lasted two years. SHIWA (project number 261585) was supported by a grant from the
European Commission The European Commission (EC) is the executive of the European Union (EU). It operates as a cabinet government, with 27 members of the Commission (informally known as "Commissioners") headed by a President. It includes an administrative body o ...
's FP7 INFRASTRUCTURES-2010-2 call under grant agreement n°261585. The SHIWA project developed and deployed the SHIWA Simulation Platform (SSP) to enable
infrastructure Infrastructure is the set of facilities and systems that serve a country, city, or other area, and encompasses the services and facilities necessary for its economy, households and firms to function. Infrastructure is composed of public and priv ...
and
workflow A workflow consists of an orchestrated and repeatable pattern of activity, enabled by the systematic organization of resources into processes that transform materials, provide services, or process information. It can be depicted as a sequence of ...
interoperability Interoperability is a characteristic of a product or system to work with other products or systems. While the term was initially defined for information technology or systems engineering services to allow for information exchange, a broader defi ...
at two levels: * coarse-grained interoperability, referring to the nesting of different workflow systems in order to achieve execution frameworks interoperability; and * fine-grained interoperability, referring to the transformation workflow representations in order to achieve workflows migration from one system to another. After the project ended the SHIWA Technology was overtaken by the ER-flow support action project to provide sustainability and to extend the user community base.


Background and motivations

Scientists of all disciplines have invested tremendous effort in the exploitation of
Distributed Computing A distributed system is a system whose components are located on different computer network, networked computers, which communicate and coordinate their actions by message passing, passing messages to one another from any system. Distributed com ...
Infrastructures (DCIs) for their ability to support compute-intensive in-silico experiments and virtual organisations. Many DCIs with large user communities have emerged during the last decade, such as the Distributed European Infrastructure for Supercomputing Applications (DEISA) iederberger and Mextorf 2005 EGEE Grid (Enabling Grids for e-Science) GEE n.d. the German D-Grid initiative (D-Grid) entzsch 2006 UK National Grid Service (NGS) GS n.d.and the North American TeraGrid (TG) eraGrid n.d. They are based on different
middleware Middleware is a type of computer software that provides services to software applications beyond those available from the operating system. It can be described as "software glue". Middleware makes it easier for software developers to implement co ...
stacks that provide an abstraction layer between computer resources and applications. For example, NGS and TeraGrid are built on the Globus Toolkit oster 2006 EGEE on gLite Lite n.d. DEISA relies on both the Globus Toolkit and Unicore rwin and Snelling 2002 while D-Grid is executed under gLite, the Globus Toolkit and
Unicore UNICORE (UNiform Interface to COmputing REsources) is a grid computing technology for resources such as supercomputers or cluster systems and information stored in databases. UNICORE was developed in two projects funded by the German ministry ...
. In
Europe Europe is a large peninsula conventionally considered a continent in its own right because of its great physical size and the weight of its history and traditions. Europe is also considered a Continent#Subcontinents, subcontinent of Eurasia ...
, this momentum is climaxing in 2010 with the emergence of the
European Grid Initiative European Grid Infrastructure (EGI) is a series of efforts to provide access to high-throughput computing resources across Europe using grid computing techniques. The EGI links centres in different European countries to support international res ...
(EGI) that will federate all major European organisations related to distributed-computing and
National Grid Initiatives National may refer to: Common uses * Nation or country ** Nationality – a ''national'' is a person who is subject to a nation, regardless of whether the person has full rights as a citizen Places in the United States * National, Maryland, ce ...
(NGIs). In its effort to create the next generation of pan-Europe DCI, EGI will face unprecedented challenges related to the heterogeneity of national grid infrastructures, resources and operating middleware. Production DCIs are commonly built on a large number of components, such as data resources,
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
catalogues, authentication and authorisation methods, and software repositories. Managing the execution of applications on DCIs is consequently a complex task. Moreover, solutions developed for one particular Grid are difficult to port to other infrastructures. In order to shield this complexity from researchers and to facilitate the design of in-silico experiments, workflow systems are widely used as a virtualisation layer on top of the underlying infrastructures. They have become essential to integrate expertise about both the application (user domain) and the DCI (infrastructure domain) in order to optimise and support research of the scientific computing community. In the current multi-DCI panorama, users need to access different infrastructures in order to enlarge and widen the variety of resources usable, as well as share and reuse domain specific resources. Interoperability among DCIs is hardly achieved at the middleware level though. SHIWA considers the EGI production infrastructure a major DCI of great interest for the European Scientists to design and simulate experiments in-silico. It directly addresses the challenges related to (i) scientific experiments design through simulation workflows description and (ii) middleware heterogeneities encountered among the many existing DCIs through workflow interoperability techniques.


Concepts and project objectives

SHIWA aimed at improving the experience of Virtual Research Communities which are heavily using DCIs for their scientific experimentation. With the recent multiplication of efforts dedicated to
e-infrastructure United States federal research funders use the term cyberinfrastructure to describe research environments that support advanced data acquisition, data storage, data management, data integration, data mining, data visualization and other computin ...
s, scientific simulation can now benefit from the availability of massive computing and data storage facilities to sustain multi-disciplinary scientific challenges. As a side effect, a variety of non-interoperable technologies coexist to enable the exploitation of computing infrastructures for in-silico experiments. In Europe, this momentum is climaxing with the emergence of the EGI that will federate all major European organisations related to distributed computing and NGIs. Consequently, European research on simulation is currently hampered by several interoperability issues that reduce its efficiency by limiting knowledge and expertise sharing among scientific communities. SHIWA was designed as a user-centred project aiming at lowering barriers among scientific communities by providing services tackling interoperability issues. In particular, SHIWA' work program focuses on improving the efficiency of workflow-based in-silico experiments by targeting the following three objectives: * Objective 1: develop workflows and expertise sharing among Virtual Research Communities. * Objective 2: enable cross-system management of simulation workflows in
Scientific Gateways Science is a systematic endeavor that builds and organizes knowledge in the form of testable explanations and predictions about the universe. Science may be as old as the human species, and some of the earliest archeological evidence for ...
. * Objective 3: support Virtual Research Communities in their design and realisation of in-silico experiments. * Objective 4: improve interoperability among DCIs. * Objective 5: simplify access to multiple DCIs for Virtual Research Communities. * Objective 6: promote the use of European e-Infrastructure among simulation users from various disciplines.


Workflows interoperability

Workflow interoperability enables the execution of workflows of different workflow systems that may span multiple heterogeneous infrastructures (DCIs). It can facilitate application migration due to infrastructure, services and workflow system evolution. Workflow interoperability allows workflow sharing to support and foster the adoption of common research methodologies, improve efficiency and reliability of research by reusing these common methodologies, increase the lifetime of workflows and reduction of development time for new workflows. Interoperability among workflow systems not only permit the development and enactment of large-scale and comprehensive workflows, but also reduces the existing gap between different DCIs, and consequently promotes cooperation among research communities exploiting these DCIs. As workflow systems enable researchers to build comprehensive workflow applications for DCIs, the project consortium identified workflow interoperability as the most promising approach to bridge the existing gaps among DCIs. Workflow and DCI interoperability is of paramount importance to advance the quality and impact of scientific applications that target DCIs which enables advanced features previously not available:{{Citation needed, date=October 2022 * Enabling exploitation of specific features of workflow systems considering applications’ requirements and DCIs’ capabilities. * Sharing workflows published by research communities to support collaboration, reuse of validated methodologies and knowledge transfer. * Running workflow applications on multiple heterogeneous DCIs. * Facilitating workflow-based application migration maintenance. * Optimising experiments by using the most appropriate workflow system and/or DCIs. SHIWA developed workflow interoperability solutions for several workflow systems, namely
ASKALON Ashkelon or Ashqelon (; Hebrew language, Hebrew: , , ; Philistine language, Philistine: ), also known as Ascalon (; Ancient Greek: , ; Arabic: , ), is a coastal city in the Southern District (Israel), Southern District of Israel on the Medite ...
ahringer, et al. 2005 MOTEUR latard, et al. 2008
Pegasus Pegasus ( grc-gre, Πήγασος, Pḗgasos; la, Pegasus, Pegasos) is one of the best known creatures in Greek mythology. He is a winged divine stallion usually depicted as pure white in color. He was sired by Poseidon, in his role as hor ...
eelman 2005 PGRADE acsuk, et al. 2003 Galaxy, GWES, Kepler, LONI Pipeline, Taverna, ProActive and Triana ajithia et al. 2004 In so doing, it will provide access to Grids built on gLite and Globus middleware to create production-level services to run workflow-based large-scale simulations. The targeted middleware and workflow systems are depicted by components with bold borders in Figure 1.1.1. The project will use existing Grid middleware interoperability solutions enabling access to gLite and Globus based Grids such as the Austrian Grid, D-Grid, EGEE and NGS. The project consortium will also consider support for the EMI-supported Nordugrid Advanced Resource Connector (ARC) .Ellert 2007and Unicore


Project Partners

*
Institute for Computer Science and Control The Institute for Computer Science and Control (in short SZTAKI, hu, Számítástechnikai és Automatizálási Kutatóintézet) is a Hungarian research institute in Budapest, founded in 1964. Scope Its primary tasks include basic and applicat ...
( SZTAKI) of the
Hungarian Academy of Sciences The Hungarian Academy of Sciences ( hu, Magyar Tudományos Akadémia, MTA) is the most important and prestigious learned society of Hungary. Its seat is at the bank of the Danube in Budapest, between Széchenyi rakpart and Akadémia utca. Its ma ...
* University of Innsbruck * Charité - Universitätsmedizin Berlin * French National Centre for Scientific Research * University of Westminster * Cardiff University * Academic Medical Centre of the University of Amsterdam * University of Southern California


Subcontractors

* ActiveEon SAS * MAAT France * Correlation Systems Ltd * ETH Zurich, Institute of Molecular Systems Biology * National Research Council, Institute for Biomedical Technologies


References

* http://www.erflow.eu/about


Official webpage

* http://www.erflow.eu/


External links


MTA SZTAKI Laboratory of Parallel and Distributed Systems
Grid computing Information technology organizations based in Europe