Seminar: Newest Trends in High-Performance Data Analytics

High-Performance Data Analytics is a vehicle to extract findings from large data sets. It is an indispensable tool in science and business but a rapidly changing field. As part of this seminar, you will create a presentation and report revolving around a selected hot topic in German or English. You will learn to research literature and may conduct small experiments to provide a holistic view of the selected topic. You will meet regularly with an assigned supervisor and work towards the presentation and report.

Contact Julian Kunkel, Jonathan Decker
Location Virtual in BBB
Time Thursday 16-18, First meeting: 2022-04-21
Language English or German (individual presentation)
Module M.Inf.1237: Seminar Neueste Trends in High-Performance Data Analytics
SWS 2
Credits 5
Contact time 28 hours
Independent study 122 hours

Module description

As part of this seminar, you will create a presentation (and report) revolving around a research topic in German or English (your choice!). Therefore, you will meet regularly with an assigned supervisor and work towards the presentation and report.

This seminar is also available as a pro-seminar. As pro-seminar, the focus will be on learning presentation techniques while in the seminar your focus must be on presenting scientific facts and leading a scientific discussion. There are also two additional mandatory sessions for pro-seminar attendees (optional for seminar attendees).

The presentation time is 35 minutes (plus discussion). A short report accompanying the slides is expected (max 15 pages).

The students will be able to

  • Appraise research in the area of high-performance data analytics
  • Compose a presentation covering their selected topic in depth
  • Evaluate findings (tools or theory) of other researchers
  • Explain theory and application covering their topic

This is the list of topics that we will assign to students during the first meeting. You will have some room for developing the topic in the direction of your choice. Feel free to propose your own great topic.

  • GPU Computing with Triton
  • Seagate CORTX storage system
  • FPGA Computing with SciEngine
  • RISC-V: State of the union
  • Regression Testing for HPC
  • Global Optimization (of Clusters) with Genetic Algorithms
  • Julia Programming Language
  • RUST Programming for HPC application
  • KPI4DC - Key performance indicators for data centers
  • The HPC Community (for Proseminar)
  • Benchmarking of HPC Systems
  • History and Development of System Architectures
  • Security in Cloud and HPC
  • DevOps strategies in HPC
  • Infiniband DPU
  • OneAPI for heterogeneous computing (CPU, GPU, FPGA)
  • Convergence of HPC and High-Performance Data Analytics
  • Using Data Analytics in HPC Applications
  • GPU Computing with Python
  • What's new with Spark 3
  • What's new with Tensorflow
  • Development in data lakes and data warehousing
  • Trends in edge computing
  • Key-value stores for HPDA
  • Object storage systems
  • HPDA Benchmarks
  • Using R for HPDA
  • Security in Cloud and HPC

The exam is conducted as part of the presentation (50% of the mark) and report (50%). The focus for pro-seminars lies in the effective presentation while the focus for seminars is the depth of the scientific topic (slightly different marking schemes).

  • 2022-04-21 Preliminary discussion / Vorbesprechung – Julian Kunkel
    Slides
    If you cannot attend contact us asap!
    • Short introduction to the topics of the seminar.
    • Organizational matters: How to get good marks.
    • Assignment of topics to the participants on a first-come-first-served basis.
    • Talk: Professional presentation
  • 2022-05-05 How to create professional presentations and reports?Julian Kunkel, Jonathan Decker
    This session is mandatory for pro-seminar attendees.
  • 2022-05-12
  • 2022-05-19
    • Real-Time data analysis in education – Lorenz Glißmann 1)
    • GPU Computing with Triton – Dimitris Oikonomou 2)
  • 2022-06-02
  • 2022-06-09
    • RISC-V: State of the union – Ilia Kurin 3)
  • 2022-06-16
    • GPU Computing with Python – Sören Metje 4)
    • OneAPI for heterogeneous computing – Vincenz Dumann 5)
  • 2022-06-23
    • Using R for HPDA – Celine Thorns 6)
    • Julia Programming Language – Anna Kahle 7)
  • 2022-06-30
    • Using Data Analytics in HPC Applications – Theint Hay Thi Maung 8)
    • Key performance indicators for data centers – Tim van den Berg KPI4DC 9)
  • 2022-07-07
    • Security in Cloud and HPC – Nicolas Alqas-Alyas 10)
    • RUST Programming for HPC application – Singh 11)
  • 2022-07-14
    • What's new with Spark 3 – Abdul Rafay 12)
  • 2022-07-21
  • 2022-09-30 Deadline for the submission of the report

1)
Betreuung: Julian Kunkel
2)
Betreuung: Jonathan Decker
3)
Betreuung: Christian Köhler
4)
Betreuung: Tino Meiselt
5)
Betreuung: Vanessa End
6)
Betreuung: Hauke Gronenberg
7)
Betreuung: Marcus Merz
8)
Betreuung: Nils Kanning
9)
Betreuung: Laura Endter
10)
Betreuung: Tim Ehlers
11)
Betreuung: Christian Boehme
12)
Betreuung: Patrick Michaelis
  • teaching/summer_term_2022/nthpda.txt
  • Last modified: 2022-06-16 16:30
  • by Julian Kunkel