Categories
Pages
-

DBIS

Thesis Projects

Information about Diploma/Master thesis process

Open Theses

  • Comparative analyses of hybrid LLMs with Knowledge base integration and RAGs in biomedical domain


    Large language models (LLMs) are increasingly used in biomedical applications, including literature mining (PMID: 40188094), drug discovery (PMID: 38730226; 41362614; https://arxiv.org/abs/2510.27130), clinical decision support (PMID: 40753316), and patient data analysis (PMID: 41034564). Hybrid approaches combining LLMs with structured knowledge bases and retrieval-augmented generation (RAG) improve performance and interpretability (PMID: 38830083; https://www.biorxiv.org/content/10.1101/2025.05.08.652829v2) . However, LLM-based systems ...

Running Theses

  • Traceability Framework for Human–LLM-Assisted Tabular Data Transformations


    Large Language Models (LLMs) are increasingly used to support data wrangling, but their integration into interactive transformation workflows raises new challenges for auditability, reproducibility, and accountability. When users approve, reject, or refine LLM-generated suggestions, conventional data lineage systems often fail to capture why a change occurred, who was responsible for it, and which transformation produced ...
  • Training a Tiny LLM with Block Attention Residuals on CommonsenseQA


    Knowledge-augmented multiple-choice question answering (MCQA) aims to improve robustness and factual grounding by integrating external structured knowledge (e.g., knowledge graphs) into language-model-based decision making. Current high-performing systems typically retrieve a local subgraph relevant to a question and candidate answers, then combine pretrained language representations with explicit graph reasoning modules. This thesis investigates an alternative representation path: ...
  • Master’s Thesis: Automatic Generation of Ontological Representations for Co-Simulation Configurations in Power Engineering


    AbstractThis master’s thesis aims at examining the applicability of automatic ontology generation and ontology-based data integration to the configuration of co-simulation scenarios. To study power systems through simulations, it is conducive to model sub-domains through separate simulators, which are combined through co-simulations to comprise complex simulation scenarios. However, what is gained through focused modelling of ...
  • Ontology-Grounded Extraction of Research SoftwareMentions from Scientific Publications


    Research software is among the least discoverable scholarly outputs. While standards like CodeMeta and CFF enable structured software metadata at the repository level, they require active curation by maintainers and see inconsistent adoption. On the publication side, only select publishers such as Schloss Dagstuhl’s DROPS platform provide citable software artifacts, again contingent on explicit author ...
  • Incremental Knowledge Graph Ingestion with Change Detection and Provenance Tracking


    Keeping a knowledge graph up to date as its source data evolves is harder than building one from scratch. New records appear, existing records are corrected, and metadata is enriched over time. Each type of change a corrected DOI, an added co-author, a retracted publication carries different semantic implications and may require a different update ...

View all running theses

Completed Theses

View all completed theses