search-icon
Workshop
:
HPCViz: Monitoring Health Status of High Performance Computing Systems
Event Type
Workshop
Registration Categories
W
Tags
HPC Center Planning and Operations
Heterogeneous Systems
Scientific Computing
State of the Practice
Visualization
Datacenter
monitoring
TimeMonday, November 12th10:30am - 10:50am
LocationD170
DescriptionThis paper introduces HPCViz, a visual analytic tool for tracking and monitoring system events through a RESTful interface. The goals of this tool are: 1) to monitor a set of system events from multiple hosts and racks in real time statistics, 2) to support system administrators in alarming and detecting unusual signature-based patterns exhibited by health records of hosts in a complex system, and 3) to help in performing system debugging with a visual layout for both computing resource allocations and health monitoring map that mimics the actual system. A case study was conducted in a Redfish environment with a sample of 10 racks and 467 hosts. The result of the case study shows that the visualization tool offers excellent support for system analysists to profile and observe system behavior and further identify the traces of issues occurred.
Archive
Back To Top Button