BoF: Analyzing Parallel I/O
Abstract
Parallel I/O performance can be a critical bottleneck for applications, yet users are often ill-equipped for identifying and diagnosing I/O performance issues. Increasingly complex hierarchies of storage hardware and software deployed on many systems only compound this problem. Tools that can effectively capture, analyze, and tune I/O behavior for these systems empower users to realize performance gains for many applications.
In this BoF, we form a community around best practices in analyzing parallel I/O and cover recent advances to help address the problem presented above, drawing on the expertise of users, I/O researchers, and administrators in attendance.
The primary objectives of this BoF are to: 1) highlight recent advances in tools and techniques for monitoring I/O activity in data centers, 2) to discuss experiences and limitations of current approaches, 3) to discuss and derive a roadmap for future I/O tools with the goal to capture, assess, predict and optimize I/O.
The BoF is held in conjunction with the Supercomputing conference. The official announcement is listed here.
Date | Thursday, 21 November 2024 | ||
Time | 12:15pm - 1:15pm EST | ||
Venue | Room B313B-B314 |
The BoF is powered by the Virtual Institute for I/O and DECICE 1).
Organization
The BoF is organized by
- Shane Snyder (ANL, USA), ssnyder@mcs.anl.gov
- Jean Luca Bez (Lawrence Berkeley Lab, USA), jlbez@lbl.gov
- Julian Kunkel (Georg-August-Universität Göttingen/GWDG), julian.kunkel@gwdg.de
Agenda
We have a series of (8 minute) talks followed by a longer discussion.
- Welcome – Shane Snyder
Slides - Leveraging AI: Large Language Models in HPC I/O Optimization – Dong Dai
Slides - Analyzing I/O in Deep Learning Workloads – Hari Devarajan (LLNL)
Slides - Scaling Performance Analysis Tools to Hundreds of Lustre Filesystems – Ellis Wilson (Microsoft)
Slides - Data analysis news from GWDG and analysis tasks in the Scalable Storage Competition – Christian Boehme (GWDG)
Slides