====== Seminar with Practical: Scalable Computing Systems and Applications in AI, Big Data and HPC ====== ===== Key information ===== || Contact || [[about:people:julian_kunkel|Julian Kunkel]], [[about:people:jonathan_decker|Jonathan Decker]] || || Location || [[https://meet.gwdg.de/b/jul-cal-vop-2bi|Virtual]] || || Time || Thursday 14:15-15:45 || || Language || English or German (individual presentation) || || Module || M.Inf.1238: Scalable Computing Systems and Applications in AI, BigData and HPC || || SWS || 3 || || Credits || 5 || || Contact time || 42 hours || || Independent study || 108 hours || As part of this seminar, you will create a presentation, work on a small-scale practical project and write a report revolving around a research topic in German or English (your choice!). Therefore, you will meet regularly with an assigned supervisor and work towards the presentation, practical project and report. You will first select a topic and a use case related to the overall topic of the course. Then, during the term you will prepare a presentation to introduce the topic and the state of the art. Next, you will realize a small-scale project by practically working on your topic. This includes evaluating performance and scalability, as well as analyzing and quantifying the contribution of your topic or tool. Finally, you present your results in another presentation. The presentation time is 25 minutes (plus discussion) for each presentation. A short report describing your work in the practical project is expected (max 15 pages). Please note that we plan to record sessions (lectures and seminar talks) with the intent of providing the recordings via BBB to other students but also to publish and link the recordings on YouTube for future terms. If you appear in any of the recordings via voice, camera or screen share, we need your consent to publish the recordings. See also this {{ :teaching:templates:dataprivacy_student_notice_slide.pdf |Slide}}. ==== Required Prior Knowledge ==== * No skills/knowledge is required * Understanding of Linux basics and having used Linux before and being able to operate a Bash shell is beneficial * We will provide a short crash course at the beginning of the course and link supplementary training material ===== Learning Objectives ===== * Describe approaches for the development of scalable systems and applications * Sketch efficient algorithms and concepts * Analyze and summarize state-of-the-art concepts, tools and research papers * Deliver a technical presentation for a professional audience * Explore and apply concepts or tools to improve scalability for a selected use case * Quantify efficiency and scalability of selected use cases ===== Topics ===== This is the list of topics that we will assign to students during the first meeting. You will have some room for developing the topic into the direction of your choice. Feel free to propose your own great topic. * Retrieval Augmented Generation (RAG) State-of-the-art and use cases * LLM Open Source Agents * LLM Benchmarking Frameworks and their limitations * LLM Inference Optimization techniques * LLM Compression and Quantization Techniques * LLM Trustworthiness and Fact Validation * Impact of GIL-less Cpython on performance and compatiblity * Parallelization of async Python with trio-parallel * Machine Learning for Predictive Maintenance on a HPC Single Node * Understanding GPU performance, e.g., using MLCommons ML Benchmarks * Confidential Computing (HPC/Cloud) * Python Performance Optimization leveraging Native Implementations (Numba/CPython/PyO3/Nukita/transpyle) * AI for monitoring * Neuromorphic Computing * Effective intrusion detection systems (IDS) Strategies in HPC Environments * The compute continuum - IoT, edge and HPC computing * Use cases for integration of edge and IoT with HPC simulations * AI and HPDA use cases for critical infrastructure from the medical and energy domains * Data management concepts in HPC - potential of data lakes and data warehousing * Scalable quantum computer simulation on HPC systems * FPGA Computing with SciEngine * RISC-V: State of the union * Regression Testing for HPC * Global Optimization (of Clusters) with Genetic Algorithms * RUST Programming for HPC application * Benchmarking of HPC Systems * Security in Cloud and HPC * Infiniband DPU * DevOps strategies in HPC * What's new in the Kubernetes ecosystem * Containers in HPC * Function-as-a-service in HPC * Object storage systems * Encryption tools * Scalable databases with e.g., Elasticsearch, Postgres * Kernel compilation and configuration * Berkeley Packet Filters (eBPF) * Forensic tools * Distributed computing paradigms in Cloud and HPC * Detection of AI generated content * Service Discovery and Traffic Management in Cloud Applications ===== Examination ===== The exam is conducted as part of the final presentation (30% of the mark) and the report (70%). ===== Agenda ===== * 24.10.24 **Introduction & Scientific Presentation** -- Julian Kunkel, Jonathan Decker \\ If you cannot attend contact us asap! * Introduction of the course format and requirements {{ :teaching:autumn_term_2024:scap-welcome.pdf |Slides}} * Assignment of topics to the participants on a first-come-first-served basis * Talk: Scientific Presentation {{ :teaching:autumn_term_2024:scientific-presentation.pdf |Slides}} * Recording: https://youtu.be/NrahVjkUFls?si=SLzX8dxGpfO3SdgW * 31.10.24 **Holiday** * 01.11.24 **You have submitted your selected topic by email to jonathan.decker@uni-goettingen.de** * 07.11.24 **LaTeX Crash Course & Scientific Writing** -- //Julian Kunkel, Jonathan Decker// * Introduction to LaTeX {{ :teaching:autumn_term_2024:latex-intro.pdf |Slides}} * Showcasing our LaTeX templates https://hps.vi4io.org/teaching/ressources/start#templates * Talk: Scientific Writing {{ :teaching:autumn_term_2024:scientific-writing.pdf |Slides}} * 08.11.24 **You have been assigned a supervisor and presentation date** * 14.11.24 **Effective Literature Search & Discussion of example reports** -- //Julian Kunkel, Jonathan Decker// * Talk: Effective Literature Search {{ :teaching:autumn_term_2024:scientific-literature.pdf |Slides}} * Discussion of example reports from previous semesters * 21.11.24 **Student topic presentations** * 28.11.24 **Student topic presentations** * 05.12.24 **Student topic presentations** * Frederik Hennecke: Impact of GIL-less Cpython on performance and compatiblity * Anila Ghazanfar: Security in Cloud and HPC * 12.12.24 **Student topic presentations** * Pablo Jahnen: AI and HPDA use cases for critical infrastructure from the medical and energy domains * 19.12.24 **Student topic presentations** * Maximilian Schlensog: Object Storage Systems * 09.01.25 **Student result presentations** * 16.01.25 **Student result presentations** * Frederik Hennecke: Impact of GIL-less Cpython on performance and compatibility * 23.01.25 **Student result presentations** * Maximilian Schlensog: Object Storage Systems * 30.01.25 **Student result presentations** * Anila Ghazanfar: Security in Cloud and HPC * Pablo Jahnen: AI and HPDA use cases for critical infrastructure from the medical and energy domains * 06.02.25 **Student result presentations** * Backup * 31.03.25 **Deadline for the submission of the report** ===== Topic Distribution ===== || **Student** || **Supervisor** || **Topic** || **Submissions** || || Your Name || Your Supervisor || Your Topic || {{ :teaching:autumn_term_2024:stud:report.pdf |Report}} || || Frederik Hennecke || Patrick Höhn || Impact of GIL-less Cpython on performance and compatiblity || || || Maximilian Schlensog || Michael B.Khani || Object Storage Systems || || || Anila Ghazanfar || Hendrik Nolte || Security in Cloud and HPC || || || Pablo Jahnen || Narges Lux || AI and HPDA use cases for critical infrastructure from the medical and energy domains or AI for monitoring || ||