Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
events:2019:isc-ngi [2019-04-24 18:56]
Julian Kunkel
events:2019:isc-ngi [2019-07-19 17:19] (current)
Julian Kunkel [Agenda]
Line 1: Line 1:
 ====== BoF: Data-Centric Computing for the Next Generation ====== ====== BoF: Data-Centric Computing for the Next Generation ======
- 
-**This event has been approved recently; this is preliminary information.** 
  
 The efficient, convenient, and robust execution of data-driven workflows and enhanced data management are key for productivity in scientific computing and computer-aided RD&E. Big data tools integrate compute and storage capabilities into a holistic solution demonstrating the benefit of tight integrating while the HPC community still optimizes the compute and storage components independently from each other, and, moreover, independently from the needs of end-to-end user workflows that ultimately lead to insight. Even within a single data center, utilizing homogeneous storage and compute infrastructure efficiently is complex for experts. The efficient management of data and compute capabilities in a heterogeneous environment,​ however, is an unresolved question as the execution of individual tasks from workflows may benefit from alternative hardware architectures and infrastructures. The efficient, convenient, and robust execution of data-driven workflows and enhanced data management are key for productivity in scientific computing and computer-aided RD&E. Big data tools integrate compute and storage capabilities into a holistic solution demonstrating the benefit of tight integrating while the HPC community still optimizes the compute and storage components independently from each other, and, moreover, independently from the needs of end-to-end user workflows that ultimately lead to insight. Even within a single data center, utilizing homogeneous storage and compute infrastructure efficiently is complex for experts. The efficient management of data and compute capabilities in a heterogeneous environment,​ however, is an unresolved question as the execution of individual tasks from workflows may benefit from alternative hardware architectures and infrastructures.
Line 34: Line 32:
 ====== Agenda ====== ====== Agenda ======
  
-To be announced +  * **Welcome** -- //Julian Kunkel (University of Reading)// 
- +  * **High-Level Workflows -- Potential for Innovation? Peeking at the current IO stack.** -- //Julian Kunkel (University of Reading)// \\ {{ :​events:​2019:​bof19-data-centric-kunkel.pdf |Slides}} 
- +  * **Changing Your Archive From a Black Hole to a Gold Mine** -- //Jay Lofstead (Sandia National Laboratories)//​ \\ {{ :​events:​2019:​events:​2019:​bof19-data-centric-lofstead-archive.pdf |Slides}} \\ Archives have been a crucial part of long term data storage ensuring that future users can refer back to previous work to check new results integrity or to deal with compliance and legal requirements. However, making these archives more useful than write-once, read-never systems has been challenging. Tape speeds, when compared to other storage media, have high latency making any interactive exploration painful. POSIX style attributes and extended attributes can only offer so much additional information. We are exploring offering another layer on top of the archive that can make archive item selection more efficient and effective turning your data black hole into a gold mine. 
- +  * **Approaches to Programming Extremely Heterogenous Memory Systems** -- //Jeffrey Vetter (ORNL)// 
 +  * **The goldilocks node: getting the RAM just right** -- //Julian Kunkel (on behalf of the collaboration))//​ \\ {{ :​events:​2019:​bof19-data-centric-kove.pdf |Slides}} 
 +  * **NGI initiative: toward a bridge in the semantic gap** -- //​Jean-Thomas Acquaviva (DDN)// \\ {{ :​events:​2019:​bof19-data-centric-acquaviva.pdf |Slides}} \\ Most of the recent innovations in the storage area have been driven by the cloud emergence. This had led to a race toward genericity and ultimately to the now ubiquitous object interface. While this has been highly beneficial for the community the absence of semantic is a drawback for the HPC ecosystem. The NGI initiative is an effort coming from the weather forecast community aiming at bringing *more* semantic in the data format. We advocate to bring application closer to data and pave the way for on situ computing data format should be richer and more specific. 
 +  * **The community can make the difference** -- //Julian Kunkel (University of Reading)// 
 +  * //​Discussion//​