Partitioning and bucketing in hive-based big data warehouses

Hive is a tool that allows the implementation of Data Warehouses for Big Data contexts, organizing data into tables, partitions and buckets. Some studies have been conducted to understand ways of optimizing the performance of data storage and processing techniques/technologies for Big Data Warehouse...

Full description

Bibliographic Details
Main Author: Costa, Eduarda (author)
Other Authors: Costa, Carlos A. (author), Santos, Maribel Yasmina (author)
Format: conferencePaper
Language:eng
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/1822/55212
Country:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/55212