CView and NWPerf for Supercomputer Performance Collection and Display.
TimeSunday, November 11th9:30am - 10am
DescriptionCView is a full 3D interactive graphics tool for displaying collected System and Application performance data. Data is collected using various open source tools common on large clusters such as ganglia, collectl, or collectd. When collecting data on large scale systems with thousands of nodes to tens of thousands of nodes, this data set becomes quite large over time. For example, on The Cascade System at the Environmental Molecular Sciences Laboratory at PNNL one days collection of performance data stored as raw floats is about 2.5 Gigabytes of data. The CView Graphics environment allows a user to easily visualize such data-sets. This allows System Administrators to 'see' how the system is performing in near-time data streams. System Administrators and System Users use it for visualizing the performance of the whole systems or just the nodes involved in a specific job run. The Systems at PNNL use the NWPerf software, backed by a Ceph Object store to route and store the performance data our clusters. By storing the data this method a user can display data from any point in time and investigate various metrics at the system or job level.