Table of Contents

Summer School on Effective HPC for Climate and Weather

News: April 15th: The registration for the 2021 Summer School on Effective HPC for Climate and Weather is now open!

Aim and Scope

Due to COVID-19, we will organize the summer school as virtual event again.

Making effective use of HPC environments becomes increasingly challenging for PhD students and young researchers. As their primary intent is to generate insight, they often struggle with the technical nature of the tools and environments that enable their computer-aided research: computation, integration, and analysis of relevant data.

The scope of the summer school is the training of young researchers and software engineers in methods, tools, and theoretical knowledge to make effective use of HPC environments and generate insights.

Date 23-27 August 2021
Venue Virtual event
Contact Julian Kunkel (University of Göttingen, GWDG)
Communication Mailing List

While the school aims to prepare the attendees for large scale simulation runs and data processing, it does also cover a representative selection of modern concepts such as machine learning, domain-specific languages, containerisation, and analysis of climate/weather data using Python 1).

We will also provide an outlook of challenges and strategies for HPC for climate and weather. Additionally, we aim to foster networking among scientists bringing together users of specific models and tools and enabling them to exchange their knowledge.

A certificate of attendance will be provided to the attendees.

The summer school will also support the mission of the European Network for Earth System modelling (ENES).

The ESiWACE project funds this summer school.

Summer School Programme

The ESiWACE Summer School is structured along with topical sessions in the morning/afternoon.

A topical session typically consists of an academic lecture and it may contain hands-on/lab practicals, group work, and discussion. Experts in the respective field will organise each of these sessions.

The hands-on/lab practicals are recorded, and they will provide an introduction/walk-through to the topic. We will have dedicated slots to allow independent/self-paced learning, and participants can decide what and when they want to engage. Additional support will be provided by the Mailing List, in which participants may post questions and cooperate with other students and the organisers. The Q&A session on Friday will offer the participants the opportunity to contact the organiser of each hands-on session and ask questions regarding the pre-recorded material.

The hands-on tutorials/lab practicals work as follows:

The session will offer the participants the opportunity to contact the organisers of each hands-on session — to ask questions regarding the topic and particularly regarding the tutorial and exercises.

Detailed information will be announced to registered attendees at the beginning of August.

Topics

The topics covered in the summer school are as follows:

Extreme-Scale Computation

This session will introduce the concept of extreme-scale computing with an explanation of the trends in the computer architectures that provide the underlying computing power. In particular, the increasing use of parallelism and heterogeneity in these architectures will be discussed.

A high-level overview will then be given of the performance, portability and productivity (3P's) requirements that Weather and Climate models have in order to run successfully on these computer architectures. It will be shown how current approaches can struggle to meet all three of these requirements.

Lastly, a relatively new, Domain-Specific Language (DSL), approach to programming Weather and Climate models will be introduced with examples from two existing DSLs - DAWN and PSyclone. It will be shown that the DSL approach offers the possibility of supporting all three of the above requirements, by separating the implementation of the science code from its parallelisation and optimisation on the underlying computer architecture.

Learning Objectives

Parallel Programming in Practice

In this session, we will provide a global overview of how the main concepts of parallel programming are implemented in weather and climate codes. We will detail the different parallel programming models for distributed and shared memory systems and describe the resulting scalability of commonly-used algorithms implementing those models. Particular attention will be devoted to specific features that may inhibit the scaling and performance of weather and climate codes. This analysis will be done at the level of the code routine itself but also in the more general context of code coupling, the latter being a specific implementation of coarse grain parallelism.

Learning Objectives

Modern Storage

Learning Objectives

Input/Output and Middleware

Climate and weather research is typically data-intensive and applications must utilise input/output efficiently. Often, a user struggles to assess observed performance leading to superflux attempts to tune the application and optimise performance in a wrong layer of the stack. The content of this session is twofold. Firstly, we discuss storage layers focusing on the NetCDF middleware and provide a performance model that aids users to identify inefficient I/O. Secondly, we introduce the NetCDF Climate and Forecast (CF) conventions that are often used as a standard to exchange data.

Learning Objectives

Machine Learning

(1) Predicting weather and climate require modelling the Earth System – a huge system that consists of many individual components that show chaotic behaviour and for which conventional tools often struggle to provide satisfying results. (2) A huge amount of data of the Earth System is available from both observations and modelling. (3) Machine learning methods allow learning complex non-linear behaviour from data if enough data is available and to apply the learned tools efficiently on modern supercomputers. If you combine (1), (2) and (3), it is easy to see that there are a large number of potential application areas for machine learning in weather and climate science that are currently explored. However, whether these approaches will succeed is still unclear as there are also a number of challenges for the application of machine learning tools in weather predictions. This talk will provide an introduction to machine learning, outline how to apply machine learning in Earth System modelling, show examples for the application of machine learning throughout the weather and climate modelling workflow, and discuss the challenges that will need to be tackled.

Learning Objectives

ECMWF - Virtual Visit

Learn about the performance and specifications of the ECMWF High-Performance Computing Facilities, and the way this supercomputer is used for operations, storage and research by ECMWF and its 34 Member & Co-operating States. The presentation will include a video tour of the computing facilities currently located in our HQ in Reading and a preview of what the new data centre will look like when it opens in Bologna (Italy) next year.

Learn about ECMWF Forecasting products and activities. A member of the ECMWF Forecasting team will introduce you to the maps, charts and plots that are produced daily in the “Weather Room” for weather prediction and analysis.

High-Performance Data Analytics and Visualisation

Analysis and visualisation of scientific data, such as those in the field of climate and weather, requires solution capable of effectively and efficiently handling massive data. In this session, we will discuss some of the main challenges concerning scientific data management and in particular those related to data analytics and visualisation. Software solutions for high-performance data analytics and visualisation, as well as examples of applications of these systems for real use cases in the climate and weather domain, will be presented. The lab tutorial will provide a more practical introduction about some tools and modules for data analysis and how to apply these on climate data, as well as a walk-through of the VMI for the virtual lab.

Learning Objectives

Performance Analysis

Learning Objectives

Containers

This session will present an introduction to an end-to-end scientific computing workflow utilising Docker containers. Attendees will learn about the fundamentals of containerisation and the advantages it brings to scientific software. Participants will then familiarise with Docker technologies and tools, discovering how to manage and run containers on personal computers, and how to build applications of increasing complexity into portable container images. Particular emphasis will be given to software resources which enable highly-efficient scientific applications, like MPI libraries and the CUDA Toolkit. The final part of the lecture will briefly introduce HPC-focused tools capable of deploying containers on high-end computing systems starting from Docker images.

Learning Objectives

Agenda

We are currently preparing the agenda. The following shows the general (tentative) structure of the agenda.

The video playlist is available.

All times are in CEST.

Monday - Computing

Tuesday - Storage

Wednesday - Data Analytics

Thursday - Supporting Tools

Friday - Keynote and conclusions

Attendance

It is free to attend the virtual summer school, however, registration is required!

Important Dates

Registration

The registration is now closed.

Target Audience

The target audience for the summer school is Earth system scientists, including PhD students and young researchers, and software engineers in the domain. While each of our topics will be introduced, the attendees should have a basic understanding of:

For attendees without prior experience, the following links provide some references to cover significant aspects of the contents mentioned above.

Organisation

Organisers

Programme Committee

ESiWACE2

The Centre of Excellence in Simulation of Weather and Climate in Europe (ESiWACE) addresses world challenges pushing the limits of science. It benefits the broader community by providing services and training opportunities.

As part of the ESiWACE2 project, we are organising this summer school to bridge the gap between scientists and computational science and increase the effectiveness of young scientists. The main goal of this event is the training of representative scientists from different institutions with state-of-the-art concepts tailored to the domain, but that also stretches beyond climate and weather, allowing them to act as a multiplier and increase productivity overall.

ESiWACE is funded by the European Union's Horizon 2020 research and innovation programme under grant agreement No. 823988.

1)
A detailed list of topics can be found in the programme.