Seminar: Newest Trends in High-Performance Data Analytics

High-Performance Data Analytics is a vehicle to extract findings from large data sets. It is an indispensable tool in science and business but a rapidly changing field. As part of this seminar, you will create a presentation and report revolving around a selected hot topic in German or English. You will learn to research literature and may conduct small experiments to provide a holistic view of the selected topic. You will meet regularly with an assigned supervisor and work towards the presentation and report.

Contact Julian Kunkel, Jonathan Decker
Location Virtual
Time Thursday 16:15-17:45
Language English or German (individual presentation)
Module M.Inf.1237: Seminar Neueste Trends in High-Performance Data Analytics
SWS 2
Credits 5
Contact time 28 hours
Independent study 122 hours

As part of this seminar, you will create a presentation (and report) revolving around a research topic in German or English (your choice!). Therefore, you will meet regularly with an assigned supervisor and work towards the presentation and report.

This seminar is also available as a pro-seminar. As pro-seminar, the focus will be on learning presentation techniques while in the seminar your focus must be on presenting scientific facts and leading a scientific discussion. There are also two additional mandatory sessions for pro-seminar attendees (optional for seminar attendees).

The presentation time is 35 minutes (plus discussion). A short report accompanying the slides is expected (max 15 pages).

Please note that we plan to record sessions (lectures and seminar talks) with the intent of providing the recordings via BBB to other students but also to publish and link the recordings on YouTube for future terms. If you appear in any of the recordings via voice, camera or screen share, we need your consent to publish the recordings. See also this Slide.

  • Appraise research in the area of high-performance data analytics
  • Compose a presentation covering their selected topic in depth
  • Evaluate findings (tools or theory) of other researchers
  • Explain theory and application covering their topic

This is the list of topics that we will assign to students during the first meeting. You will have some room for developing the topic in the direction of your choice. Feel free to propose your own great topic.

  • In-Network Computing with the Kalray DPU - Security, Performance, …
  • Emerging trends in parallel file systems
  • Specific file systems, e.g., SeaweedFS, Ceph
  • Emerging trends in cloud storage
  • Efficient Workflow management
  • Kubernetes for HPC; Case Studies, e.g. CERN
  • Security in Cloud and HPC
  • Using the ARM Architecture for performant HPC
  • Modern Benchmarking Strategies of HPC Systems
  • RISC-V: State of the Union
  • Regression Testing for HPC
  • In-Network Computing and the NVIDIA DPU
  • Development in data lakes and data warehousing
  • Machine learning performance and behavior of HPC storage systems
  • Quantum neural networks: Libraries and applications
  • Computational Performance Characterization of GPU-accelerated Image Analysis
  • Using R for HPDA
  • RUST Programming for HPC application
  • Efficient streaming (with NetFlix)

The exam is conducted as part of the presentation (50% of the mark) and report (50%). The focus for pro-seminars lies in the effective presentation while the focus for seminars is the depth of the scientific topic (slightly different marking schemes).

  • 26.10.23 Preliminary discussion / VorbesprechungJulian Kunkel, Jonathan Decker
    If you cannot attend contact us asap!
    • Short introduction to the topics of the seminar.
    • Organizational matters: How to get good marks.
    • Assignment of topics to the participants on a first-come-first-served basis.
    • Talk: Professional presentation Slides
  • 02.11.23 How to create professional presentations and reports?Julian Kunkel, Jonathan Decker
    This session is mandatory for seminar attendees.
    • Introducing our report template and usage (very quick intro to LaTeX) Slides
    • Discussion of existing reports and presentations individually and in the group
  • 09.11.23
  • 16.11.23
  • 23.11.23 Student presentations
  • 30.11.23 Student presentations
  • 07.12.23 Student presentations
  • 14.12.23 Student presentations
    • Sheila Navarro - Machine learning performance and behavior of HPC storage systems
    • Tasmia Arooj - Development in data lakes and data warehousing
  • 21.12.23 Student presentations
    • Pratham Shrivastava - Security in Cloud and HPC
    • Michael Hubert Duah - Security in Cloud and HPC
  • 11.01.24 Student presentations
    • Johann Eilts - Emerging trends in cloud storage
    • Abdelllah Omar Adolf - Emerging trends in parallel file systems
  • 18.01.24 Student presentations
    • Zhuojing Huang - Quantum neural networks: Libraries and applications
    • Carolin Lafeld - Efficient Workflow management
  • 25.01.24 Student presentations
    • Christopher Lee Lübbers - Using R for HPDA
    • Ayan Gupta - Computational Performance Characterization of GPU-accelerated Image Analysis
  • 01.02.24 Student presentations
    • Nils Rosenboom - Modern Benchmarking Strategies of HPC Systems
    • Tasmia Arooj - Development in data lakes and data warehousing
  • 08.02.24 Student presentations
    • Abdelllah Omar Adolf - Emerging trends in parallel file systems
  • 31.03.23 Deadline for the submission of the report
Student Supervisor Topic
Sheila Navarro Patrick Höhn Machine learning performance and behavior of HPC storage systems
Tasmia Arooj Giorgi Mamulashvili Development in data lakes and data warehousing
Pratham Shrivastava Trevor Khwam Security in Cloud and HPC
Michael Hubert Duah Trevor Khwam Security in Cloud and HPC
Johann Eilts Chirag Mandal Emerging trends in cloud storage
Abdelllah Omar Adolf Patrick Höhn Emerging trends in parallel file systems
Zhuojing Huang Christian Boehme Quantum neural networks: Libraries and applications
Carolin Lafeld Aasish Kumar Sharma Efficient Workflow management
Christopher Lee Lübbers Matthias Eulert Using R for HPDA
Ayan Gupta Michael B. Khani Computational Performance Characterization of GPU-accelerated Image Analysis
Nils Rosenboom Narges Lux Modern Benchmarking Strategies of HPC Systems
  • Impressum
  • Privacy
  • teaching/autumn_term_2023/nthpda.txt
  • Last modified: 2024-01-12 11:14
  • by Jonathan Decker