Seminar with Practical: Scalable Computing Systems and Applications in AI, Big Data and HPC

Contact Julian Kunkel, Jonathan Decker
Location Virtual
Time Thursday 14:15-15:45
Language English or German (individual presentation)
Module M.Inf.1238: Scalable Computing Systems and Applications in AI, BigData and HPC
SWS 3
Credits 5
Contact time 42 hours
Independent study 108 hours

As part of this seminar, you will create a presentation, work on a small-scale practical project and write a report revolving around a research topic in German or English (your choice!). Therefore, you will meet regularly with an assigned supervisor and work towards the presentation, practical project and report. You will first select a topic and a use case related to the overall topic of the course. Then, during the term you will prepare a presentation to introduce the topic and the state of the art. Next, you will realize a small-scale project by practically working on your topic. This includes evaluating performance and scalability, as well as analyzing and quantifying the contribution of your topic or tool. Finally, you present your results in another presentation.

The presentation time is 25 minutes (plus discussion) for each presentation. A short report describing your work in the practical project is expected (max 15 pages).

Please note that we plan to record sessions (lectures and seminar talks) with the intent of providing the recordings via BBB to other students but also to publish and link the recordings on YouTube for future terms. If you appear in any of the recordings via voice, camera or screen share, we need your consent to publish the recordings. See also this Slide.

  • No skills/knowledge is required
  • Understanding of Linux basics and having used Linux before and being able to operate a Bash shell is beneficial
  • Describe approaches for the development of scalable systems and applications
  • Sketch efficient algorithms and concepts
  • Analyze and summarize state-of-the-art concepts, tools and research papers
  • Deliver a technical presentation for a professional audience
  • Explore and apply concepts or tools to improve scalability for a selected use case
  • Quantify efficiency and scalability of selected use cases

This is the list of topics that we will assign to students during the first meeting. You will have some room for developing the topic into the direction of your choice. Feel free to propose your own great topic.

  • Understanding GPU performance e.g. using MLCommons ML Benchmarks
  • Usage of data lakes and/or data warehouses
  • The compute continuum - IoT, edge and HPC computing
  • Use cases for integration of edge and IoT with HPC simulations
  • AI and HPDA use cases for critical infrastructure from the medical and energy domains
  • Data management concepts in HPC - potential of data lakes and data warehousing
  • Scalable quantum computer simulation on HPC systems
  • Seagate CORTX storage system
  • FPGA Computing with SciEngine
  • RISC-V: State of the union
  • Regression Testing for HPC
  • Global Optimization (of Clusters) with Genetic Algorithms
  • Julia Programming Language for deep learning
  • RUST Programming for HPC application
  • Sustainability for data centers
  • The HPC Community
  • Benchmarking of HPC Systems
  • History and Development of System Architectures
  • Security in Cloud and HPC
  • DevOps strategies in HPC
  • Infiniband DPU
  • Convergence of HPC and High-Performance Data Analytics
  • Using Data Analytics in HPC Applications
  • GPU Computing with Python
  • Parallelization with Dask + Xarray
  • What's new in the Kubernetes ecosystem (SEDNA, Volcana, …)
  • What's new with Spark
  • What's new with Pytorch/Tensorflow
  • Containers in HPC
  • Webassembly for Function-as-a-service
  • Function-as-a-service in HPC
  • Key-value stores for HPDA
  • Object storage systems
  • HPDA Benchmarks
  • Performance Analysis using Scalasca and Vampir
  • Data Streaming and Workflows using Apache Airflow

The exam is conducted as part of the final presentation (30% of the mark) and the report (70%). The (ungraded) project topic presentation should cover 15 min, the final presentation should cover 25 min and the report should be 10 to 15 pages (not counting cover, toc, appendix).

  • 11.04.24 Introduction & Scientific Presentation – Julian Kunkel, Jonathan Decker
    If you cannot attend contact us asap!
  • 18.04.24 LaTeX Crash Course & Scientific WritingJulian Kunkel, Jonathan Decker
  • 19.04.24 You have submitted your selected topic by email to jonathan.decker@uni-goettingen.de
  • 25.04.24 Effective Literature Search & Discussion of example reportsJulian Kunkel, Jonathan Decker
    • Talk: Effective Literature Search Slides
    • Discussion of example reports from previous semesters
  • 26.04.24 You have been assigned a supervisor and presentation date
  • 02.05.24 Extra presentation
    • Claas Kochanke - Performance Analysis using Scalasca and Vampir
  • 16.05.24 Student topic presentation
    • Constantin Dalinghaus - What's new with Pytorch
    • Asmus Barth - Seagate CORTX storage system
  • 23.05.24 Student topic presentation
    • Nikita Holstein - AI and HPDA use cases for critical infrastructure from the medical and energy domains
    • Davide Mattioli - Julia Programming Language for deep learning
  • 30.05.24 Student topic presentation
    • Emmanuel Tchoumkeu Ngatat - Parallelization with Dask + Xarray
    • Ughur Mammadzada - Scalable quantum computer simulation on HPC systems
    • Sadaf Shafi - GPU Computing with Python
  • 06.06.24 Student topic presentation
    • Henrik Jonathan Seeliger - RUST Programming for HPC application
    • Pranay Bhatia - What's new in the Kubernetes ecosystem (SEDNA, Volcana, …)
    • Yuvraj Singh - What is new in Tensorflow
  • 13.06.24 Student presentations
    • Constantin Dalinghaus - What's new with Pytorch
    • Davide Mattioli - Julia Programming Language for deep learning
  • 20.06.24 Student presentations
    • Asmus Barth - Seagate CORTX storage system
    • Nikita Holstein - AI and HPDA use cases for critical infrastructure from the medical and energy domains
  • 27.06.24 Student presentations
    • Emmanuel Tchoumkeu Ngatat - Parallelization with Dask + Xarray
    • Ughur Mammadzada - Scalable quantum computer simulation on HPC systems
  • 04.07.24 Student presentations
    • Sadaf Shafi - GPU Computing with Python
    • Henrik Jonathan Seeliger - RUST Programming for HPC application
  • 11.07.24 Student presentations
    • Pranay Bhatia - What's new in the Kubernetes ecosystem (SEDNA, Volcana, …)
    • Yuvraj Singh - What is new in Tensorflow
  • 30.09.24 Deadline for the submission of the report
Student Supervisor Topic
Yuvraj Singh Ali Doost Hosseini What is new in Tensorflow
Asmus Barth Patrick Höhn Seagate CORTX storage system
Constantin Dalinghaus Chirag Mandal What's new with Pytorch
Henrik Jonathan Seeliger Artur Wachtel RUST Programming for HPC application
Sadaf Shafi Michael B.Khani GPU Computing with Python
Ughur Mammadzada Tino Meisel Scalable quantum computer simulation on HPC systems
Nikita Holstein Sadgeh Keshtkar AI and HPDA use cases for critical infrastructure from the medical and energy domains
Davide Mattioli Tino Meisel Julia Programming Language for deep learning
Emmanuel Tchoumkeu Ngatat Patrick Höhn Parallelization with Dask + Xarray
Pranay Bhatia Jonathan Decker What's new in the Kubernetes ecosystem (SEDNA, Volcana, …)
  • Impressum
  • Privacy
  • teaching/summer_term_2024/scap.txt
  • Last modified: 2024-04-26 17:29
  • by 127.0.0.1