Aasish Kumar Sharma

aasish_kumar_sharma.jpg

Aasish Kumar Sharma is a researcher/scientific employee at Göttingen University employed under Professor Dr. Julian Kunkel and is focusing in high-performance computing performance optimization. His work includes developing efficient task scheduling algorithms for HPC systems, published in leading journals. Aasish is particularly interested in scalable solutions for heterogeneous architectures and has collected some publications and received the NHR Research Scholarship Award for his contributions. He is interested in optimization applying different smart algorithms, and emerging technologies like Artificial Intelligence/ Machine Learning (AI/ML) and Quantum Computing (QC) techniques. His previous work includes data engineering and big data analysis, and SQL query optimization while working as a Database Administrator. He is also a Microsoft Certified Trainer for Microsoft SQL Query Optimization for year 2025.
ResearchGate

  • High-Performance Computing
  • Data Analytics (SQL)
  • Emerging Technologies: AI/ML, Quantum Computing
  • Optimization (remember, resources are limited) and Ethics (responsible AI)

Graph Transformer Architectures for Workflow Scheduling: Beyond Standard GNNsApply

Graph transformer models, which combine attention mechanisms with graph structure, consistently outperform standard message-passing GNNs on combinatorial optimization tasks in recent benchmarks (2024-2026). Applied to workflow scheduling, graph transformers can capture global task-dependency patterns and long-range interactions that locality-limited GNNs miss, a limitation documented in the Grapheon RL benchmark at large scales (rnc5000, DOI 10.5281/zenodo.20432418). This thesis replaces the GNN backbone of the Grapheon RL architecture with a graph transformer encoder (e.g., GraphGPS or Exphormer) and evaluates whether attention-based representations improve scheduling quality on rnc300-rnc1000 workflow instances from the published STG dataset. The student will compare objective gap, inference speed, and training convergence against the published GNN-RL baselines under the same homogeneous and heterogeneous system configurations. The thesis contributes a systematically evaluated architectural extension to the open benchmark.

Carbon-Aware Multi-Objective Scheduling for HPC Workflows Using Reinforcement LearningApply

Data centers in Europe are now subject to the EU Energy Efficiency Directive (2023/1791) and are under growing pressure from funders and institutions to report and minimize operational carbon emissions. HPC schedulers that factor in carbon intensity of the grid alongside performance are emerging as a key tool, but dedicated benchmarks combining workflow-level quality metrics with carbon cost remain rare. This thesis extends the Grapheon RL framework to a three-objective scheduler minimizing makespan, energy consumption, and carbon cost simultaneously. The student will integrate synthetic carbon intensity traces (modeled after real-world grid data from the European Energy Exchange or carbon-aware open APIs), reformulate the RL reward to include a carbon penalty term, and retrain and evaluate on the published STG workflow dataset. The evaluation reports Pareto frontier trade-offs and compares carbon savings against performance-only Grapheon RL and HEFT baselines. The thesis outcome serves as a reproducible carbon-aware scheduling baseline for the group's benchmark.

Transfer Learning Evaluation of GNN-RL Schedulers Across Workflow FamiliesApply

A practical requirement for production GNN-RL schedulers is the ability to generalize beyond the workflow family used for training. The Grapheon RL model (DOI 10.1109/COMPSAC65507.2025.00341) is trained on Standard Task Graph (STG) instances. It is an open question how well it transfers to other workflow families such as Pegasus CyberShake, Montage, or synthetic BLAST pipelines, which differ in graph structure, depth, parallelism ratio, and task heterogeneity. This BSc thesis systematically evaluates Grapheon RL transfer to at least two non-STG workflow families without any fine-tuning, with lightweight fine-tuning (10-50 additional episodes), and with full retraining. The student will report normalized objective gap, schedule feasibility, and inference speed under the homogeneous 3-node configuration from the benchmark. The thesis produces a transfer learning guide: which workflow structural properties predict successful generalization, and how many fine-tuning samples are sufficient for a new family.

Online Adaptive Scheduling with Continual Reinforcement Learning for Shifting HPC WorkloadsApply

Production HPC clusters experience workload distribution drift over time: new workflow types appear, system configurations change, and peak load periods vary by season or funding cycle. A GNN-RL scheduler trained on a fixed dataset degrades as the deployment distribution shifts away from training, requiring costly full retraining. Continual reinforcement learning methods (Elastic Weight Consolidation, PackNet, progressive networks) address this stability-plasticity dilemma by enabling an agent to learn new tasks without catastrophic forgetting. This thesis implements an online continual learning wrapper for the Grapheon RL scheduler that updates model weights on a rolling window of recent scheduling decisions from the GWDG SCC cluster workload logs. The student will evaluate forgetting rate on previous workflow sizes, adaptation speed to new workflow families, and compute cost per update step. A comparison with periodic full retraining quantifies the practical case for online adaptation. Access to anonymized SCC job logs is available through the GWDG HPS group.

Knowledge Graph Modeling of HPC Resources and Workflow DependenciesApply

Machine-readable modeling of HPC system resources and workflow task dependencies is a prerequisite for both exact optimization and learning-based scheduling. Current approaches use flat JSON or ad-hoc formats that lack semantic expressiveness. Knowledge graphs and linked-data ontologies offer a structured alternative that enables reasoning, constraint checking, and integration with LLM-based planners via graph query languages (SPARQL) or schema retrieval. This thesis designs and implements a knowledge graph schema for heterogeneous HPC systems and Standard Task Graph workflows, building on the node and workflow JSON definitions from the Stage 2 benchmark (DOI 10.5281/zenodo.20432418). The student will populate a reference knowledge graph instance, evaluate its expressiveness against the scheduling constraints used in Grapheon RL, and demonstrate two end-use cases: constraint validation at submission time and semantic query for suitable node allocation. The outcome is a reusable schema with open-source tooling for KG population from existing JSON workflow descriptions.

  • Performance Analysis of Convolutional Neural Network Applying Quantum Annealing, Aasish Kumar Sharma (Master's Thesis), Advisors: Sanjeeb Prasad Pandey, 2020-12-30, BibTeX URL
  • An Empirical Evaluation of Quantum-Inspired QUBO Methods for Heterogeneous HPC Workflow Mapping and Scheduling (Aasish Kumar Sharma, Christian Boehme, Julian Martin Kunkel), In Accepted (Coming Soon), ISC High Performance 2026, IEEE Computer Society (Piscataway, New Jersey, USA), ISC High Performance 2026, ISC 26, Hamburg, Germany, 2026 BibTeX
  • Enabling Kubernetes Workload Execution on Rootless HPC Systems with KSI: A Slurm Integration Framework (Jonathan Decker, Mojtaba Akbari, Ali Doosthosseini, Sören Metje, Aasish Kumar Sharma, Julian Kunkel), In International Journal on Advances in Intelligent Systems (18), 2025-12 BibTeX URL
  • Poster: Optimizing Workload in Heterogeneous HPC Workflows with Constraints (Aasish Kumar Sharma, Christian Boehme, Patrick Gelß, Julian Kunkel), ISC-HPC, Hamburg,Germany, 2025-06-11 BibTeX PDF
  • A Review of Tools and Techniques for Optimization of Workload Mapping and Scheduling in Heterogeneous HPC System (Aasish Kumar Sharma, Julian Kunkel), In ArXiv (1), pp. 12, 2025-05-16 BibTeX URL DOI
  • AI Work Quantization Model: Closed-System AI Computational Effort Metric (Aasish Kumar Sharma, Michael Bidollahkhani, Julian Martin Kunkel), In arXiv preprint arXiv:2503.14515, 2025-03-12 BibTeX URL
  • Performance Analysis of Convolutional Neural Network By Applying Unconstrained Binary Quadratic Programming (Aasish Kumar Sharma, Sanjeeb Prashad Pandey, Julian Martin Kunkel), In 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC), IEEE COMPSAC Proceedings (49), pp. 483-488, IEEE Computer Society (Piscataway, New Jersey, USA), IEEE Computer Society, COMPSAC, 2025 BibTeX URL DOI
  • Grapheon RL: A Graph Neural Network and Reinforcement Learning Framework for Constraint and Data-Aware Workflow Mapping and Scheduling in Heterogeneous HPC Systems (Aasish Kumar Sharma, Julian Martin Kunkel), In 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC), IEEE COMPSAC Proceedings (49), pp. 489-494, IEEE Computer Society (Piscataway, New Jersey, USA), IEEE Computer Society, COMPSAC, 2025 BibTeX URL DOI
  • Workflow-Driven Modeling for the Compute Continuum: An Optimization Approach to Automated System and Workload Scheduling (Aasish Kumar Sharma, Christian Boehme, Patrick Gelß, Ramin Yahyapour, Julian Martin Kunkel), In 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC), IEEE COMPSAC Proceedings (49), pp. 2170-2177, IEEE Computer Society (Piscataway, New Jersey, USA), IEEE Computer Society, COMPSAC, 2025 BibTeX URL DOI
  • Ethical AI: Towards Defining a Collective Evaluation Framework (Aasish Kumar Sharma), Paper Presentation, HammerHAI Ethical Board for AI Factories Meeting 2026, 2026-03-10 Presentation

All publications as BibTex

  • about/people/aasish_kumar_sharma.txt
  • Last modified: 2023-08-28 10:40
  • by 127.0.0.1