Categories
Pages
-

DBIS

Enhancing LLM-based Cybersecurity Playbook Transformation Using Process Similarities

June 24th, 2025

Thesis Type
  • Master
Student
Hans Vrapi
Status
Running
Proposal on
15/07/2025 12:00 pm
Proposal room
Seminar room I5 6202
Supervisor(s)
Stefan Decker
Advisor(s)
Mehdi Akbari G.
Contact
mehdi.akbari.gurabi@fit.fraunhofer.de

Short Description:
This thesis aims to develop a framework for evaluating how accurately Large Language Model (LLM) pipelines convert legacy community playbooks into CACAO-based cybersecurity playbooks. The main focus is on ensuring semantic fidelity: transforming both original and translated workflows into a unified BPMN (or less preferred Petri-nets) representation, then applying process-mining metrics to compute similarity scores and generate detailed discrepancy reports and feed back to the LLM for automatic refinements. By capturing both control-flow and data-flow within BPMN, the framework intends to quantify and refine LLM translation accuracy, enabling standardized cyber incident response procedures without compromising workflow logic. The research emphasizes the use of process mining techniques and similarity checks to evaluate and optimize playbook workflows. Moving beyond traditional graph edit distance, the study seeks to identify and apply superior metrics for process similarity analysis, providing enhanced insights into the workflows and ensuring semantic validation of translated playbooks. The goal is to introduce a novel evaluation framework for the LLM-generated workflows of playbooks.

Relevant links and literatures:

Research Questions:

  • How can diverse cybersecurity playbooks be transformed into a standardized BPMN workflow while preserving their essential procedural logic and data dependencies?
  • Which process mining and similarity techniques effectively evaluate the fidelity of LLM-translated playbooks, ensuring minimal loss or alteration of critical workflow elements?
  • In what ways can detected discrepancies be utilized to refine translation strategies and enhancing the performance and reliability of future LLM-based playbook conversions?

Prerequisites:

Knowledge in the domain of Generative AI and their application, in addition to basic knowledge in the domains of cyber incident response and process mining.