====== BoF: Analyzing Parallel I/O ====== ===== Abstract ===== Parallel I/O performance can be a critical bottleneck for applications, yet users are often ill-equipped for identifying and diagnosing I/O performance issues. Increasingly complex hierarchies of storage hardware and software deployed on many systems only compound this problem. Tools that can effectively capture, analyze, and tune I/O behavior for these systems empower users to realize performance gains for many applications. In this BoF, we form a community around best practices in analyzing parallel I/O and cover recent advances to help address the problem presented above, drawing on the expertise of users, I/O researchers, and administrators in attendance. The primary objectives of this BoF are to: 1) highlight recent advances in tools and techniques for monitoring I/O activity in data centers, 2) to discuss experiences and limitations of current approaches, 3) to discuss and derive a roadmap for future I/O tools with the goal to capture, assess, predict and optimize I/O. The BoF is held in conjunction with the [[http://sc24.supercomputing.org/|Supercomputing conference]]. The official announcement is listed [[https://sc24.conference-program.com/presentation/?id=bof141&sess=sess640|here]]. || Date || Thursday, 21 November 2024 || || Time || 12:15pm - 1:15pm EST || || Venue || Room B313B-B314 || The BoF is powered by the [[https://www.vi4io.org|Virtual Institute for I/O]] and [[http://www.decice.eu|DECICE]] ((The DECICE project received funding from the European Union's Horizon 2022 research and innovation programme under grant agreement No 101092582)). {{:events:2017:vi4io.png?200&nolink|}} \w {{:research:projects:decice:decice-logo.png?200&nolink|}} ===== Organization ===== The BoF is organized by * Shane Snyder (ANL, USA), [[ssnyder@mcs.anl.gov]] * Jean Luca Bez (Lawrence Berkeley Lab, USA), [[jlbez@lbl.gov]] * [[about:people:julian_kunkel|Julian Kunkel]] (Georg-August-Universität Göttingen/GWDG), [[julian.kunkel@gwdg.de]] ===== Agenda ===== We have a series of (8 minute) talks followed by a longer discussion. * **Welcome** -- //Shane Snyder// \\ Slides * **Leveraging AI: Large Language Models in HPC I/O Optimization** -- //Dong Dai// \\ Slides * **Analyzing I/O in Deep Learning Workloads** -- Hari Devarajan (LLNL) \\ Slides * **Scaling Performance Analysis Tools to Hundreds of Lustre Filesystems** -- Ellis Wilson (Microsoft) \\ Slides * **Data analysis news from GWDG and analysis tasks in the Scalable Storage Competition** -- Christian Boehme (GWDG) \\ Slides