Thesis Projects
Information about Diploma/Master thesis process
Open Theses
Large language models (LLMs) are increasingly used in biomedical applications, including literature mining (PMID: 40188094), drug discovery (PMID: 38730226; 41362614; https://arxiv.org/abs/2510.27130), clinical decision support (PMID: 40753316), and patient data analysis (PMID: 41034564). Hybrid approaches combining LLMs with structured knowledge bases and retrieval-augmented generation (RAG) improve performance and interpretability (PMID: 38830083; https://www.biorxiv.org/content/10.1101/2025.05.08.652829v2) . However, LLM-based systems ...
Running Theses
Large Language Models (LLMs) are increasingly used to support data wrangling, but their integration into interactive transformation workflows raises new challenges for auditability, reproducibility, and accountability. When users approve, reject, or refine LLM-generated suggestions, conventional data lineage systems often fail to capture why a change occurred, who was responsible for it, and which transformation produced ...
Knowledge-augmented multiple-choice question answering (MCQA) aims to improve robustness and factual grounding by integrating external structured knowledge (e.g., knowledge graphs) into language-model-based decision making. Current high-performing systems typically retrieve a local subgraph relevant to a question and candidate answers, then combine pretrained language representations with explicit graph reasoning modules.
This thesis investigates an alternative representation path: ...
AbstractThis master’s thesis aims at examining the applicability of automatic ontology generation and ontology-based data integration to the configuration of co-simulation scenarios. To study power systems through simulations, it is conducive to model sub-domains through separate simulators, which are combined through co-simulations to comprise complex simulation scenarios. However, what is gained through focused modelling of ...
Research software is among the least discoverable scholarly outputs. While standards like CodeMeta and CFF enable structured software metadata at the repository level, they require active curation by maintainers and see inconsistent adoption. On the publication side, only select publishers such as Schloss Dagstuhl’s DROPS platform provide citable software artifacts, again contingent on explicit author ...
Keeping a knowledge graph up to date as its source data evolves is harder than building one from scratch. New records appear, existing records are corrected, and metadata is enriched over time. Each type of change a corrected DOI, an added co-author, a retracted publication carries different semantic implications and may require a different update ...
View all running theses
Completed Theses
Federated Machine Learning Architecture for an MDF Production Industry Use Case
Data-driven quality assurance in grinding manufacturing technology
Technical support is an essential aspect of various industries, e.g., to provide help with maintaining machinery and IT systems. However, diagnosing error messages and faults in complex technologies can be a time-consuming and challenging task. The maintainer has to search through the long documentation booklets for the technology in order to find a solution or ...
The innovative integration of Mixed Reality and Large Language Models can lead to highly interactive instructional MR agents. Utilized as automated instructors, these MR agents have the potential to significantly enhance traditional instruction manuals by providing visual guidance. For instance, they can illustrate the next required actions in practical tasks such as tightening screws in ...
View all completed theses