Founding offer · lifetime membership for a single £24, exclusive to our first members · closes 20 June Claim your place →
Global Research Partnerships £24 Lifetime Log inCreate free account

Funded Projects › H2020

SmartDataLake · Sustainable Data Lakes for Extreme-Scale Analytics

H2020Status: CLOSED1 January 201931 December 2021EU funding €3,945,450Call H2020-ICT-2018-20

Data lakes are raw data ecosystems, where large amounts of diverse data are retained and coexist. They facilitate self-service analytics for flexible, fast, ad hoc decision making. SmartDataLake enables extreme-scale analytics over sustainable big data lakes. It provides an adaptive, scalable and elastic data lake management system that offers: (a) data virtualization for abstracting and optimizing access and queries over heterogeneous data, (b) data synopses for approximate query answering and analytics to enable interactive response times, and (c) automated placement of data in different storage tiers based on data characteristics and access patterns to reduce costs. The data lake’s contents are modelled and organised as a heterogeneous information network, containing multiple types of entities and relations. Efficient and scalable algorithms are provided for: (a) similarity search and exploration for discovering relevant information, (b) entity resolution and ranking for identifying and selecting important and representative entities across sources, (c) link prediction and clustering for unveiling hidden associations and patterns among entities, and (d) change detection and incremental update of analysis results to enable faster analysis of new data. Finally, interactive and scalable visual analytics are provided to include and empower the data scientist in the knowledge extraction loop. This includes functionalities for: (a) visually exploring and tuning the space of features, models and parameters, and (b) enabling large-scale visualizations of spatial, temporal and network data. The results of the project are evaluated in real-world use cases from the business intelligence domain, including scenarios for portfolio recommendation, production planning and pricing, and investment decision making. SmartDataLake will foster innovation and enable European SMEs to capitalize on the value of their own data lakes.

Consortium · 8 organisations

coordinator

ATHINA-EREVNITIKO KENTRO KAINOTOMIAS STIS TECHNOLOGIES TIS PLIROFORIAS, TON EPIKOINONION KAI TIS GNOSIS

EL · €853,125

participant

SPAZIODATI SRL

IT · €302,500

participant

SYNYO GmbH

AT · €322,500

participant

SPRING TECHNO GMBH & CO KG

DE · €300,938

participant

ECOLE POLYTECHNIQUE FEDERALE DE LAUSANNE

CH · €760,000

participant

RAW LABS SA

CH · €411,750

participant

UNIVERSITAT KONSTANZ

DE · €425,000

participant

TECHNISCHE UNIVERSITEIT EINDHOVEN

NL · €569,638

Research fields

View the official record on CORDIS →

← Find collaborators and more funded projects

Source: CORDIS, Publications Office of the European Union. Global Research Partnerships surfaces open EU research data to help you find collaborators; we are not affiliated with the European Union.