Thesis Projects
Information about Diploma/Master thesis process
Open Theses
Users are increasingly required to give away private Email addresses in order be reachable by service providers, e.g., to be receive invoices, digital receipts, or newsletters. While this is especially true for digital services, also physical interactions increasingly shift toward involving digital information exchanges. Most notably, paper-based receipts are being replaced by digital equivalents.
However, digital ...
Large language models (LLMs) are increasingly used in biomedical applications, including literature mining (PMID: 40188094), drug discovery (PMID: 38730226; 41362614; https://arxiv.org/abs/2510.27130), clinical decision support (PMID: 40753316), and patient data analysis (PMID: 41034564). Hybrid approaches combining LLMs with structured knowledge bases and retrieval-augmented generation (RAG) improve performance and interpretability (PMID: 38830083; https://www.biorxiv.org/content/10.1101/2025.05.08.652829v2) . However, LLM-based systems ...
This thesis focuses on developing a Transcriptome-Language Model (TLM) to effectively bridge the modality gap between transcriptomic data and natural language text. You will explore advanced models for transcriptomic data representation, and cross-modal learning techniques aligning transcriptomic and textual modalities. This model will be evaluated in tasks such as zero-shot cell property classification and text ...
Running Theses
A Rule-Based Agent for Semantic Matching Graph Visualization
Power grids are increasingly operated through tightly interconnected IT/OT infrastructures, which raises the attack surface and makes smaller operators with limited resources particularly vulnerable to security-relevant incidents. This thesis develops and evaluates a reproducible, resource-efficient analysis algorithm that captures essential system, process, role, location, and information-flow data to derive protection needs and criticality, and to ...
Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation :
Lack of access to existing codebases
Limited knowledge of project-specific packages, dependencies, and interfaces
Difficulty maintaining consistency with established code patterns and architectures
To address these challenges, Retrieval Augmented Generation (RAG) approaches have emerged, ...
This thesis investigates how Large Language Models (LLMs) can be equipped with a deeper, architecture-level understanding of tabular data, going beyond “tables-as-serialized-text” toward tables-as-structured objects that expose row/column topology, header semantics, cell neighborhoods, and inter-cell dependencies to the model in a principled way . The target setting is Semantic Table Interpretation (STI) as studied in the SemTab challenge, focusing on three ...
This thesis investigates how to formally represent early-stage data science requirements and how to support the automation of early-stage data science through an LLM-based agent.
View all running theses
Completed Theses
Federated Machine Learning Architecture for an MDF Production Industry Use Case
Data-driven quality assurance in grinding manufacturing technology
Technical support is an essential aspect of various industries, e.g., to provide help with maintaining machinery and IT systems. However, diagnosing error messages and faults in complex technologies can be a time-consuming and challenging task. The maintainer has to search through the long documentation booklets for the technology in order to find a solution or ...
The innovative integration of Mixed Reality and Large Language Models can lead to highly interactive instructional MR agents. Utilized as automated instructors, these MR agents have the potential to significantly enhance traditional instruction manuals by providing visual guidance. For instance, they can illustrate the next required actions in practical tasks such as tightening screws in ...
View all completed theses