Categories
Pages
-

DBIS

Thesis Projects

Information about Diploma/Master thesis process

Open Theses

    Running Theses

    • Ontology-Based Data Augmentation with LLMs for Narrative Classification


      Narrative Classification identifies stories via NLP but often lacks generalizability. While LLMs augment other text tasks, their narrative application remains exploratory. This thesis investigates whether an ontology-based LLM-agent framework incorporating specific data characteristics improves synthetic training data quality.
    • Traceability Framework for Human–LLM-Assisted Tabular Data Transformations


      Large Language Models (LLMs) are increasingly used to support data wrangling, but their integration into interactive transformation workflows raises new challenges for auditability, reproducibility, and accountability. When users approve, reject, or refine LLM-generated suggestions, conventional data lineage systems often fail to capture why a change occurred, who was responsible for it, and which transformation produced ...
    • Training a Tiny LLM with Block Attention Residuals on CommonsenseQA


      Knowledge-augmented multiple-choice question answering (MCQA) aims to improve robustness and factual grounding by integrating external structured knowledge (e.g., knowledge graphs) into language-model-based decision making. Current high-performing systems typically retrieve a local subgraph relevant to a question and candidate answers, then combine pretrained language representations with explicit graph reasoning modules. This thesis investigates an alternative representation path: ...
    • Master’s Thesis: Automatic Generation of Ontological Representations for Co-Simulation Configurations in Power Engineering


      AbstractThis master’s thesis aims at examining the applicability of automatic ontology generation and ontology-based data integration to the configuration of co-simulation scenarios. To study power systems through simulations, it is conducive to model sub-domains through separate simulators, which are combined through co-simulations to comprise complex simulation scenarios. However, what is gained through focused modelling of ...
    • Ontology-Grounded Extraction of Research SoftwareMentions from Scientific Publications


      Research software is among the least discoverable scholarly outputs. While standards like CodeMeta and CFF enable structured software metadata at the repository level, they require active curation by maintainers and see inconsistent adoption. On the publication side, only select publishers such as Schloss Dagstuhl’s DROPS platform provide citable software artifacts, again contingent on explicit author ...

    View all running theses

    Completed Theses

    View all completed theses