Partitioning and bucketing in hive-based big data warehouses

Hive is a tool that allows the implementation of Data Warehouses for Big Data contexts, organizing data into tables, partitions and buckets. Some studies have been conducted to understand ways of optimizing the performance of data storage and processing techniques/technologies for Big Data Warehouse...

ver descrição completa

Detalhes bibliográficos
Autor principal: Costa, Eduarda (author)
Outros Autores: Costa, Carlos A. (author), Santos, Maribel Yasmina (author)
Formato: conferencePaper
Idioma:eng
Publicado em: 2018
Assuntos:
Texto completo:http://hdl.handle.net/1822/55212
País:Portugal
Oai:oai:repositorium.sdum.uminho.pt:1822/55212