Data Lake

Research field Big Data & Model Management
Status completed
Data lakes have been proposed as a solution to deal with the heterogeneity of big data, as they should provide a storage system for any kind of raw data. Metadata is of particular importance in such a system to have information about the structure and semantics of the data. The group has been working on the enhancement of the data lake system Constance. It is based on a modular architecture with components for schema matching, schema mapping, query rewriting, and wrapping of data sources. This system is applied in other research projects, e.g., mi-Mappa, HUMIT, and charMAnt, as basic platform for data integration. null

