Document details

Modelling and implementing big data warehouses for decision support

Author(s): Santos, Maribel Yasmina ; Martinho, Bruno ; Costa, Carlos

Date: 2017

Persistent ID: http://hdl.handle.net/1822/45327

Origin: RepositóriUM - Universidade do Minho

Project/scholarship: info:eu-repo/grantAgreement/FCT/5876/147280/PT;

Subject(s): Big data; Data model; Data warehouse; Hive; NoSQL; Social Sciences


Description

In the era of Big Data, many NoSQL databases emerged for the storage and later processing of vast volumes of data, using data structures that can follow columnar, key-value, document or graph formats. For analytical contexts, requiring a Big Data Warehouse, Hive is used as the driving force, allowing the analysis of vast amounts of data. Data models in Hive are usually defined taking into consideration the queries that need to be answered. In this work, a set of rules is presented for the transformation of multidimensional data models into Hive tables, making available data at different levels of detail. These several levels are suited for answering different queries, depending on the analytical needs. After the identification of the Hive tables, this paper summarizes a demonstration case in which the implementation of a specific Big Data architecture shows how the evolution from a traditional Data Warehouse to a Big Data Warehouse is possible.

This work has been supported by COMPETE: POCI-01-0145-FEDER-007043 and FCT (Fun- dação para a Ciência e Tecnologia) within the Project Scope: UID/CEC/00319/2013. This work has been funded by the SusCity project (MITP-TB/CS/0026/2013) and by Portugal Incentive System for Research and Technological Development, Project in co-promotion no 002814/ 2015 (iFACTORY 2015-2018).

info:eu-repo/semantics/publishedVersion

Document Type Journal article
Language English
Contributor(s) Universidade do Minho
CC Licence
facebook logo  linkedin logo  twitter logo 
mendeley logo

Related documents