DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

<span class="var-sub_title">DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access</span> SC18 Proceedings

DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

Authors: Pak Markthub (Tokyo Institute of Technology), Mehmet E. Belviranli (Oak Ridge National Laboratory), Seyong Lee (Oak Ridge National Laboratory), Jeffrey S. Vetter (Oak Ridge National Laboratory), Satoshi Matsuoka (RIKEN, Tokyo Institute of Technology)

Abstract: Heterogeneous computing with accelerators is growing in importance in high performance computing (HPC). Recently, application datasets have expanded beyond the memory capacity of these accelerators, and often beyond the capacity of their hosts. Meanwhile, nonvolatile memory (NVM) storage has emerged as a pervasive component in HPC systems because NVM provides massive amounts of memory capacity at affordable cost. Currently, for accelerator applications to use NVM, they must manually orchestrate data movement across multiple memories and this approach only performs well for applications with simple access behaviors. To address this issue, we developed DRAGON, a solution that enables all classes of GP-GPU applications to transparently compute on terabyte datasets residing in NVM. DRAGON leverages the page-faulting mechanism on the recent NVIDIA GPUs by extending capabilities of CUDA Unified Memory (UM). Our experimental results show that DRAGON transparently expands memory capacity and obtain additional speedups via automated I/O and data transfer overlapping.

Presentation: file

Back to Technical Papers Archive Listing