MaRDI Workshop on Scientific Computing

Name: MaRDI Workshop on Scientific Computing
Start: 2022-10-26T09:00:00+02:00
End: 2022-10-28T15:20:00+02:00
Location: Münster Mathematics Conference Centre

26.–28. Okt. 2022

Münster Mathematics Conference Centre

Europe/Berlin Zeitzone

Contact

mardi.workshop@wwu.de

Beitragsliste

41. MaRDI - The Mathematical Research Data Initiative within the German National Research Data Infrastructure (NFDI)

Thomas Koprucki (Weierstrass Institute for Applied Analysis and Stochastics)

26.10.22, 09:00

Track 1

Invited Talks

Like in all scientific disciplines research data in mathematics has become vast, it is complex and multifaceted, and, through the successful application of mathematics in interdisciplinary research, it is widespread in the scientific landscape. It ranges from information bases such as the standard reference data for special functions, tables and similar mathematical objects to highly complex...

16. Challenges and tools for FAIR data in the heterogeneous CRC 1456 "Mathematics of Experiment"

Christoph Lehrenfeld (University of Göttingen)

26.10.22, 09:45

Invited Talks

Within the collaborative research center "CRC 1456 - Mathematics of Experiment" of the German Research Foundation (DFG) several research groups from the natural sciences and mathematics jointly work on measurement and extracting the most information from them. These measurement data come from different types of measurements ranging from nanoscale imaging to observations of the Sun. As the...

38. Design of a Workflow Description for Documentation and Integration of FAIR Computational Experiments

Dr. Pavan Veluvali (Max Planck Institute for Dynamics of Complex Technical Systems)

26.10.22, 11:00

Contributed Talks

Numerical algorithms and computational tools are essential for managing and analyzing complex data processing tasks. With increasing meta-data awareness and parameter driven simulations, the demand for reliable and automated workflows to reproduce computational experiments across platforms has grown.

In general, computational workflows describe the complex multi-step methods that are used...

3. Dark Data in Mathematics

Dr. Björn Schembera (IANS / University of Stuttgart)

26.10.22, 11:25

Contributed Talks

Dark data is data that is poorly managed [1, 2]. It is diametrically opposed to FAIR data because its epistemic status is unclear, and it is neither findable, accessible, interoperable, nor reusable. For example, research data may be uncurated, unavailable, unannotated, biased, or incomplete. Examples of dark data in scientific computing include the vast amounts of data that are held...

13. Confirmable Workflows in Computer Algebra

Lars Kastner (TU Berlin)

26.10.22, 11:50

Contributed Talks

Computer experiments are becoming an essential part of pure math fields, such as combinatorics, commutative algebra and algebraic geometry. We discuss the arising challenges and the work of the task area on computer algebra of MaRDI.

5. Managing reproducibility in computational experiments

Herr Emil Løvbak (KU Leuven)

26.10.22, 12:15

Contributed Talks

It is often difficult to reproduce computational experiments from papers due to a lack of detailed in how such experiments are documented. Even when researchers publish their code along side a paper, key information is often not well documented: *What version of an external software library was used? What value should be given to an undocumented model parameter? Which specific version of the...

42. Metadata4Ing: An ontology for describing the generation and provenance of research data within a scientific activity

Dorothea Iglezakis

26.10.22, 13:40

Track 1

Invited Talks

Knowledge graphs basing on ontologies enables us to describe and connect research data, software, methods, actors and instruments in a machine readable and actionable manner. Ontologies function in this context as a formalized language that unify a semantic description of research results, their content and their provenance.

Metadata4Ing (m4i) (https://w3id.org/nfdi4ing/metadata4ing/),...

37. Building a Knowledge Graph for Scientific Computing

Frank Wübbeling

26.10.22, 14:50

Track 1

Contributed Talks

Mathematical computing knowledge is produced at an immense, and seemingly ever increasing, speed. Very little of it is organised in meaningful ways, making its discovery, insight and discussion harder every year. Following new developments in a given field is time consuming even for experts. Entering a new specialisation is daunting for students.

We will show how building a knowledge graph...

4. Helping Ontology Extension with Natural Language Processing for Catalysis

Alexander Behr (TUDO-NFDI4Cat)

26.10.22, 15:15

Contributed Talks

Ontologies store semantic knowledge in a machine-readable way and represent domain knowledge in controlled vocabulary. Scientific results often are published in text form, thus discouraging research data FAIRness. Using natural language processing (NLP), concept names and relations can be extracted from text datasets.
A workflow to process scientific textual text corpora is introduced...

6. Expanding an Ontology with semantically linked CFD-Simulation Data by Segmentation into reusable Concepts

Hendrik Borgelt

26.10.22, 15:40

Contributed Talks

Computational-Fluid-Dynamics (CFD) simulation and other numerical simulation tools generate a rich variety of complex (meta-)data, which are inherently difficult to store in a FAIR manner (Findable-Accessible-Interoperable-Reusable). As the amount of data generated by such simulations is one of the major challenges, meta-data, e.g. the simulation settings and major output variables, offers the...

19. Graph-based Data Representation for Crash-worthiness Simulations

Anahita Pakiman (Fraunhofer SCAI-University of Wuppertal)

26.10.22, 16:05

Contributed Talks

We consider graph modeling for a knowledge graph for vehicle development, with a focus on
crash safety. An organized schema that incorporates information from various structured and
unstructured data sources is provided, which includes relevant concepts within the domain. In
particular, we propose semantics for crash computer aided engineering (CAE) data, which enables
searchability,...

11. Developing a Sustainable and FAIR HPC Sparse Linear Algebra Framework

Terry Cojean (Karlsruhe Institute of Technology)

26.10.22, 17:00

Invited Talks

With a strong reliance on research software projects in both industry
and for scientific simulations, research software sustainability is
increasingly becoming a major point of contention. A necessary but
nonsufficient aspect of software sustainability is Continuous
Integration and Benchmarking (CI/CB/Cx). In addition, software
flexibility to support newer HPC hardware as well as modern,...

24. Reception

26.10.22, 17:45

36. FitBenchmarking: an open source tool for comparing data analysis software

Tyrone Rees (UKRI-STFC)

27.10.22, 09:00

Invited Talks

STFC's Computational Mathematics Group provides support and mathematical software for the UK's large scale facilities, such as the ISIS Neutron and Muon source, the Diamond Light Source, the Central Laser Facility, and the Culham Centre for Fusion Energy. These facilities are visited by thousands of researchers each year, and they produce increasingly large amounts of data that needs to be...

40. Towards a Benchmark Framework for Model Order Reduction in the Mathematical Research Data Initiative (MaRDI)

Kathryn Lund

27.10.22, 10:15

Track 1

Contributed Talks

The race for the most efficient, accurate, and universal algorithm in scientific computing drives innovation. However, this healthy competition is only beneficial if research outputs from different projects are actually comparable to one another. Fairly comparing algorithms can be a complex endeavor, as the implementation, configuration, compute environment, and test problems need to be well...

14. Benchmarking supervised regression algorithms with mlr3 and OpenML

Sebastian Fischer (LMU Munich)

27.10.22, 10:40

Contributed Talks

Machine learning research should be easily accessible and reusable. OpenML is an open platform for sharing datasets, algorithms, and experiments. mlr3 is an open-source collection of R packages providing a unified interface for machine learning in the R language. One of the projects in the MaRDI task area 3 (statistics and machine learning) was the interface package mlr3oml which allows for...

10. Reproducibility Infrastructure of the Julia Language

Jürgen Fuhrmann

27.10.22, 11:05

Contributed Talks

The [Julia language ][1] is mostly advertised for the underlying vision to provide an environment for scientific computing and data science which allows to implement algorithms using a syntax similar to Python and Matlab but without sacrificing performance.

Reproducibility and reusability are further important aspects of Julia and its ecosystem.

Julia's built-in package manager...

8. Reproducibility as a service: collaborative scientific computing with Julia

Dr. Michael Schlottke-Lakemper (RWTH Aachen University), Prof. Hendrik Ranocha (University of Hamburg)

27.10.22, 11:30

Contributed Talks

With the complexity of the involved algorithms and software packages, reproducibility of numerical simulations is often difficult to achieve. This makes it harder to collaborate on research projects, since there can be a considerable ramp-up time for new project members before they are able to contribute to a joint code base. Julia is a modern, dynamic programming language designed for...

35. Creating sustainable research software by the example of the deal.II library

Martin Kronbichler (University of Augsburg)

27.10.22, 12:55

Invited Talks

In my talk, I will present the library deal.II, an open-source software aiming at the rapid development of simulation codes for partial differential equations based on the finite element method. The guiding principle of deal.II is to provide functions for the main building blocks in a solver that a user code can then combine and extend in an application-specific way. I will then give insight...

28. Presentation of Workgroups

27.10.22, 16:20

Workgroups

39. ADOL-C: 40 years of software development

Andrea Walther

27.10.22, 17:00

Track 1

Invited Talks

The provision of derivatives for a function defined by an evaluation procedure in a high level computer language like Fortran or C forms an important task for numerous applications comprising for example optimization, parameter estimation, and data assimilation. The technique of algorithmic differentiation (AD) offers an opportunity to provide derivative information of any order for the given...

30. Conference Dinner

27.10.22, 19:00

17. xSDK: an Ecosystem of Interoperable Independently Developed Math Libraries

Ulrike Meier Yang (Lawrence Livermore National Laboratory)

28.10.22, 09:00

Invited Talks

The development of emerging extreme-scale architectures with higher performance potential provides developers of application codes, including multiphysics modeling, and the coupling of simulations and data analytics, unprecedented resources for larger simulations achieving more accurate solutions than ever before. Achieving high performance on these new heterogeneous architectures requires...

43. preCICE – A General-Purpose Simulation Coupling Interface

Benjamin Uekermann (University of Stuttgart)

28.10.22, 09:45

Track 1

Invited Talks

[preCICE ][1] is an open-source coupling software for partitioned multi-physics and multi-scale simulations. Thanks to the software's library approach (the simulations call the coupling) and its high-level API, only minimally-invasive changes are required to prepare an existing (legacy) simulation software for coupling. Moreover, ready-to-use adapters for many popular simulation software...

49. Towards Foundations for Open Interfaces for Scientific Computing

René Fritze

28.10.22, 11:00

Contributed Talks

Algorithms and models realized by established software packages can be hard to exchange,
compose or interconnect in the context of complex modeling or simulation workflows.

In this contribution we will present our work towards developing and establishing open interface standards
with a core API toolkit.

These open interfaces will improve the reusability of numerical models and...

12. The Importance of Symbolic Data Types

Antony Della Vecchia

28.10.22, 11:25

Contributed Talks

Convex hull computations are an essential part of many scientific calculations. We present an experiment written in Julia involving convex hull computations done with two different types of data, floats and rationals. A comparison of the results shows that using floats leads to the loss of the combinatorics of the experiment.

15. Experiences in Refactoring Software Code for the Solution of the Wave Equation based on a Very Weak Space-Time Variational Formulation

Jürgen Vorloeper

28.10.22, 11:50

Contributed Talks

Recently a mathematical approach for the efficient numerical solution of the Wave Equation based on a very weak space-time variational formulation has been proposed by J. Henning, D. Palitta, V. Simoncini and K. Urban. Beside mathematical analysis the authors developed software code generating numerical results. This software code is actually in progress of refactoring to facilitate further...

9. Increasing the reproducibility of scientific results in mathematics and related fields: Experiences and discussions with the research community

Dr. Christian Riedel (University of Potsdam)

28.10.22, 12:15

Contributed Talks

Reproducible research results are vital to safeguard scientific quality assurance and to build a reliable foundation for sustainable research. The discussions on this issue accelerated when investigations on reproducibility showed that few scientific publications across many research fields allow for reproducing the published results. This reproducibility crisis is well known within the...

18. Fostering interdisciplinary research by composable Julia software

Michael Herbst

28.10.22, 12:40

Invited Talks

Today's ubiquitous data-driven workflows allow scientists to expand the limits of length and time scales in simulations. In my own field, namely first-principles atomistic simulations, the data itself is generated by systematic high-throughput workflows, which occupy a noteworthy chunk of the world's supercomputing resources. Questions related to the efficiency, robustness and accuracy of...

34. Lunch & End of Workshop

28.10.22, 13:25

20. Registration and Welcome

Wähle Zeitzone

MaRDI Workshop on Scientific Computing

Contact