Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

Hive has long been one of the industry-leading systems for Data Warehousing in Big Data contexts, mainly organizing data into databases, tables, partitions and buckets, stored on top of an unstructured distributed file system like HDFS. Some studies were conducted for understanding the ways of optim...

Full description

Bibliographic Details
Main Author: Costa, Eduarda (author)
Other Authors: Costa, Carlos A. P. (author), Santos, Maribel Yasmina (author)
Format: article
Language:eng
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/1822/66781
Country:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/66781