In this paper, we propose a new network-based model to uniformly represent the structured, semi-structured and unstructured sources of a data lake, which is one of the newest and most successful architectures proposed for managing big data. Then, we present a new approach to, at least partially, "structuring" unstructured sources. Finally, with the support of these two tools, we define a new approach to extracting complex knowledge patterns from the data stored in a data lake. (C) 2018 Elsevier Inc. All rights reserved.
An approach to extracting complex knowledge patterns among concepts belonging to structured, semi-structured and unstructured sources in a data lake / LO GIUDICE, Paolo; Musarella, Lorenzo; Sofo, Giuseppe; Ursino, Domenico. - In: INFORMATION SCIENCES. - ISSN 0020-0255. - 478:(2019), pp. 606-626. [10.1016/j.ins.2018.11.052]
An approach to extracting complex knowledge patterns among concepts belonging to structured, semi-structured and unstructured sources in a data lake
Paolo Lo Giudice;Lorenzo Musarella;Domenico Ursino
2019-01-01
Abstract
In this paper, we propose a new network-based model to uniformly represent the structured, semi-structured and unstructured sources of a data lake, which is one of the newest and most successful architectures proposed for managing big data. Then, we present a new approach to, at least partially, "structuring" unstructured sources. Finally, with the support of these two tools, we define a new approach to extracting complex knowledge patterns from the data stored in a data lake. (C) 2018 Elsevier Inc. All rights reserved.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.