In this paper, we propose a new network-based model to uniformly represent the structured, semi-structured and unstructured sources of a data lake, which is one of the newest and most successful architectures proposed for managing big data. Then, we present a new approach to, at least partially, "structuring" unstructured sources. Finally, with the support of these two tools, we define a new approach to extracting complex knowledge patterns from the data stored in a data lake. (C) 2018 Elsevier Inc. All rights reserved.

An approach to extracting complex knowledge patterns among concepts belonging to structured, semi-structured and unstructured sources in a data lake / LO GIUDICE, Paolo; Musarella, Lorenzo; Sofo, Giuseppe; Ursino, Domenico. - In: INFORMATION SCIENCES. - ISSN 0020-0255. - 478:(2019), pp. 606-626. [10.1016/j.ins.2018.11.052]

An approach to extracting complex knowledge patterns among concepts belonging to structured, semi-structured and unstructured sources in a data lake

Paolo Lo Giudice;Lorenzo Musarella;Domenico Ursino
2019-01-01

Abstract

In this paper, we propose a new network-based model to uniformly represent the structured, semi-structured and unstructured sources of a data lake, which is one of the newest and most successful architectures proposed for managing big data. Then, we present a new approach to, at least partially, "structuring" unstructured sources. Finally, with the support of these two tools, we define a new approach to extracting complex knowledge patterns from the data stored in a data lake. (C) 2018 Elsevier Inc. All rights reserved.
2019
Data lakes
Complex knowledge patterns
Network-based conceptual model
Structuring unstructured sources
Synonymies
Shortest paths
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12318/133751
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 32
  • ???jsp.display-item.citation.isi??? 14
social impact