Jakob is a research assistant in the group of Julian Kunkel. He has a Master's degree in computer science from University of Göttingen.
WebGPU is an emerging graphics API for modern GPU use in the browser. In this thesis the potential of it for AI inference directly in the browser is explored.
Apache TVM is a machine learning compiler framework. In this work, the performance of it is evaluated with respect to alternative options.
The defacto standard API for LLM inference tasks is the OpenAI API. However, it is not optimal with regard to performance and other characteristics. In this work, other existing and novel protocol designs for common AI inference tasks are explored and evaluated.
All publications as BibTex