Evaluating partitioning and bucketing strategies for Hive-based Big Data Warehousing systems

Hive has long been one of the industry-leading systems for Data Warehousing in Big Data contexts, mainly organizing data into databases, tables, partitions and buckets, stored on top of an unstructured distributed file system like HDFS. Some studies were conducted for understanding the ways of optim...

ver descrição completa

Detalhes bibliográficos
Autor principal: Costa, Eduarda (author)
Outros Autores: Costa, Carlos A. P. (author), Santos, Maribel Yasmina (author)
Formato: article
Idioma:eng
Publicado em: 2019
Assuntos:
Texto completo:http://hdl.handle.net/1822/66781
País:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/66781