Energy monitoring as an essential building block towards sustainable ultrascale systems

An ultrascale system (USS) joins parallel and distributed computing systems that will be two to three orders of magnitude larger than today's infrastructure regarding scale, performance, the number of components and their complexity. For such systems to become a reality, however, advances must...

Full description

Bibliographic Details
Main Author: Jorge Barbosa (author)
Other Authors: Francisco Almeida (author), Marcos D. Assunção (author), Vivente Blanco (author), Ivona Brandic (author), Georges da Costa (author), Manuel F. Dolz (author), Anne C. Elster (author), Mateusz Jarus (author), Helen D. Karatza (author), Laurent Lefèvre (author), Ilias Mavridis (author), Ariel Oleksiak (author), Anne-Cécile Orgerie (author), Jean-Marc Pierson (author)
Format: article
Language:eng
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/10216/116003
Country:Portugal
Oai:oai:repositorio-aberto.up.pt:10216/116003
Description
Summary:An ultrascale system (USS) joins parallel and distributed computing systems that will be two to three orders of magnitude larger than today's infrastructure regarding scale, performance, the number of components and their complexity. For such systems to become a reality, however, advances must be made in high performance computing (HPC), large-scale distributed systems, and big data solutions, also tackling challenges such as improving the energy efficiency of the IT infrastructure. Monitoring the power consumed by underlying IT resources is essential towards optimising the manner IT resources are used and hence improve the sustainability of such systems. Nevertheless, monitoring the energy consumed by USSs is a challenging endeavour as the system can comprise thousands of heterogeneous server resources spanning multiple data centres. Moreover, the amount of monitoring data, its gathering, and processing, should never become a bottleneck nor profoundly impact the energy efficiency of the overall system. This work surveys state of the art on energy monitoring of large-scale systems and methodologies for monitoring the power consumed by large systems and discusses some of the challenges to be addressed towards monitoring and improving the energy efficiency of USSs. Next, we present efforts made on designing monitoring solutions. Finally, we discuss potential gaps in existing solutions when tackling emerging large-scale monitoring scenarios and present some directions for future research on the topic.