Categories
Pages
-

DBIS

Data Stream Synopses for Layered Cloud-Edge Infrastructures

January 29th, 2025

Thesis Type
  • Master
Status
Open
Presentation room
Seminar room I5 6202
Supervisor(s)
Sandra Geisler
Advisor(s)
Liam Tirpitz
Contact
tirpitz@dbis.rwth-aachen.de

Data stream synopses are compact representations of large, potentially infinite data streams that enable efficient analysis and processing without storing the entire stream.
They consist of a corresponding data structure and an algorithm updating it.
Synposes can also be used to reduce the volume of a data stream, for example on constrained network links. Synopses can provide a tradeoff between the data volume and the accuracy of the resulting data stream.

In Cloud-Edge Settings, for example in Distributed Data Stream Processing on the Edge (DSPoE), it is especially important to reduce data volumes on the edge. Here, multiple, distributed data sources contribute to results of global stream queries and reduced result accuracy may be acceptable to reduce overall data volumes.

 

In the context of this thesis, you will explore how tradeoffs between accuracy and data volume can be configured at runtime through suitable data stream synopses in distributed edge environments.

Towards that goal, you will:

  • Research existing approaches and frameworks for data stream synopses, such as Condor[1]
  • Evaluate the suitability of existing approaches for hierarchical and distributed stream summarization
  • Evaluate the suitability of existing approaches for dynamic accuracy tradeoffs at runtime
  • Implement distributed, hierarchical stream synopses

 

Interested? Questions? Contact Us!

Liam Tirpitz, M.Sc. – tirpitz@dbis.rwth-aachen.de – Tel: +49 241 80-21542

[1] https://doi.org/10.14778/3467861.3467871