Accelerating Wave-Propagation Algorithms with Adaptive Mesh Refinement Using the Graphics Processing Unit (GPU). Xinsheng Qin (University of Washington), Randall LeVeque (University of Washington), Michael Motley (University of Washington)
AI Matrix – Synthetic Benchmarks for DNN. Wei Wei (Alibaba Inc), Lingjie Xu (Alibaba Inc), Lingling Jin (Alibaba Inc), Wei Zhang (Alibaba Inc), Tianjun Zhang (University of California, Berkeley)
An Alternative Approach to Teaching Bigdata and Cloud Computing Topics at CS Undergraduate Level. Debzani Deb (Winston-Salem State University), Muztaba Fuad (Winston-Salem State University), Keith Irwin (Winston-Salem State University)
Applying the Execution-Cache-Memory Model: Current State of Practice. Georg Hager (University of Erlangen-Nuremberg, Erlangen Regional Computing Center), Jan Eitzinger (University of Erlangen-Nuremberg, Erlangen Regional Computing Center), Julian Hornich (University of Erlangen-Nuremberg, Erlangen Regional Computing Center), Francesco Cremonesi (Swiss Federal Institute of Technology in Lausanne), Christie L. Alappat (University of Erlangen-Nuremberg, Erlangen Regional Computing Center), Thoams Roehl (University of Erlangen-Nuremberg), Gerhard Wellein (University of Erlangen-Nuremberg, Erlangen Regional Computing Center)
Automatic Generation of Mixed-Precision Programs. Logan Moody (Lawrence Livermore National Laboratory, James Madison University), Nathan Pinnow (Lawrence Livermore National Laboratory, Western Washington University), Michael O. Lam (James Madison University, Lawrence Livermore National Laboratory), Harshitha Menon (Lawrence Livermore National Laboratory), Markus Schordan (Lawrence Livermore National Laboratory), G. Scott Lloyd (Lawrence Livermore National Laboratory), Tanzima Islam (Western Washington University)
Binarized ImageNet Inference in 29us. Tong Geng (Boston University, Pacific Northwest National Laboratory), Ang Li (Pacific Northwest National Laboratory), Tianqi Wang (Boston University), Shuaiwen Leon Song (Pacific Northwest National Laboratory), Martin Herbordt (Boston University)
Boosting the Scalability of Car-Parrinello Molecular Dynamics Simulations for Multi- and Manycore Architectures. Tobias Klöffel (University of Erlangen-Nuremberg), Bernd Meyer (University of Erlangen-Nuremberg), Gerald Mathias (Leibniz Supercomputing Centre)
Capsule Networks for Protein Structure Classification. Dan A. Rosa de Jesus (Lawrence Berkeley National Laboratory, University of Puerto Rico at Mayaguez), Julian Cuevas Paniagua (Lawrence Berkeley National Laboratory, University of Puerto Rico at Mayaguez), Wilson Rivera (Lawrence Berkeley National Laboratory, University of Puerto Rico at Mayaguez), Silvia Crivelli (Lawrence Berkeley National Laboratory)
Characterizing Declustered Software RAID for Enhancing Storage Reliability and Performance. Zhi Qiao (University of North Texas), Song Fu (University of North Texas), Hsing-bung Chen (Los Alamos National Laboratory), Bradley Settlemyer (Los Alamos National Laboratory)
A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms. Marziyeh Nourian (North Carolina State University), Hancheng Wu (North Carolina State University), Michela Becchi (North Carolina State University)
Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA. Hancheng Wu (North Carolina State University), John Ravi (North Carolina State University), Michela Becchi (North Carolina State University)
Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images. Chaitanya Kolluru (Case Western Reserve University), David Prabhu (Case Western Reserve University), Yanzan Gharaibeh (Case Western Reserve University), David Wilson (Case Western Reserve University), Sanjaya Gajurel (Case Western Reserve University)
Cross-Layer Group Regularization for Deep Neural Network Pruning. Shuang Gao (Nvidia Corporation), Xin Liu (Nvidia Corporation)
DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping. Ahmed Al-Jarro (Fujitsu Laboratories Ltd), Serban Georgescu (Fujitsu Laboratories Ltd), Yasumoto Tomita (Fujitsu Laboratories Ltd), Kouta Nakashima (Fujitsu Laboratories Ltd)
Detection of Silent Data Corruptions in Smooth Particle Hydrodynamics Simulations. Aurélien Cavelan (University of Basel), Florina M. Ciorba (University of Basel), Ruben M. Cabezón (University of Basel)
Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis. Kohei Murotani (Railway Technical Research Institute, Japan), Koji Nakade (Railway Technical Research Institute, Japan), Yasushi Kamata (Railway Technical Research Institute, Japan), Daisuke Takahashi (Railway Technical Research Institute, Japan)
Distributed Adaptive Radix Tree for Efficient Metadata Search on HPC Systems. Wei Zhang (Texas Tech University), Houjun Tang (Lawrence Berkeley National Laboratory), Suren Byna (Lawrence Berkeley National Laboratory), Yong Chen (Texas Tech University)
Distributed Fast Boundary Element Methods. Michal Merta (Technical University of Ostrava, Czech Republic), Jan Zapletal (Technical University of Ostrava, Czech Republic), Michal Kravcenko (Technical University of Ostrava, Czech Republic)
An Efficient SIMD Implementation of Pseudo-Verlet Lists for Neighbor Interactions in Particle-Based Codes. James Willis (Durham University, Institute for Computational Cosmology), Matthieu Schaller (Leiden Observatory), Pedro Gonnet (Google LLC)
Enabling Data Analytics Workflows Using Node-Local Storage. Tu Mai Anh Do (University of Southern California, Information Sciences Institute), Ming Jiang (Lawrence Livermore National Laboratory), Brian Gallagher (Lawrence Livermore National Laboratory), Albert Chu (Lawrence Livermore National Laboratory), Cyrus Harrison (Lawrence Livermore National Laboratory), Karan Vahi (University of Southern California, Information Sciences Institute), Ewa Deelman (University of Southern California, Information Sciences Institute)
Enabling High-Level Graph Processing via Dynamic Tasking. Maurizio Drocco (Pacific Northwest National Laboratory), Vito Giovanni Castellana (Pacific Northwest National Laboratory), Marco Minutoli (Pacific Northwest National Laboratory), Antonino Tumeo (Pacific Northwest National Laboratory), John Feo (Pacific Northwest National Laboratory)
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities. Norm Buchanan (Colorado State University), Steven Calvez (Colorado State University), Pengfei Ding (Fermi National Accelerator Laboratory), Derek Doyle (Colorado State University), Alex Himmel (Fermi National Accelerator Laboratory), Burt Holzman (Fermi National Accelerator Laboratory), Jim Kowalkowski (Fermi National Accelerator Laboratory), Andrew Norman (Fermi National Accelerator Laboratory), Alex Sousa (University of Cincinnati), Marc Paterno (Fermi National Accelerator Laboratory), Saba Sehrish (Fermi National Accelerator Laboratory), Brandon White (Fermi National Accelerator Laboratory), Christopher Green (Fermi National Accelerator Laboratory)
Enabling Reproducible Microbiome Science through Decentralized Provenance Tracking in QIIME 2. Ahmad Turan Naimey (Northern Arizona University, Pathogen and Microbiome Institute), Christopher Keefe (Northern Arizona University, Pathogen and Microbiome Institute)
Energy Efficiency of Reconfigurable Caches on FPGAs. Tianqi Wang (Boston University), Ang Li (Pacific Northwest National Laboratory), Tong Geng (Boston University), Martin Herbordt (Boston University)
Estimating Molecular Dynamics Chemical Shift with GPUs. Eric F. Wright (University of Delaware), Mauricio H. Ferrato (University of Delaware)
Exploring Application Performance on Fat-Tree Networks in the Presence of Congestion. Philip A. Taffet (Rice University, Lawrence Livermore National Laboratory), Sanil Rao (University of Virginia, Lawrence Livermore National Laboratory), Ian Karlin (Lawrence Livermore National Laboratory)
Fast and Accurate Training of an AI Radiologist. Lucas A. Wilson (Dell EMC), Vineet Gundecha (Dell EMC), Srinivas Varadharajan (Dell EMC), Alex Filby (Dell EMC), Pei Yang (Dell EMC), Quy Ta (Dell EMC), Valeriu Codreanu (SURFsara), Damian Podareanu (SURFsara), Vikram Saletore (Intel Corporation)
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures. Haidong Lan (Shandong University), Jintao Meng (Tencent Holdings Ltd), Christian Hundt (Johannes Gutenberg University Mainz), Bertil Schmidt (Johannes Gutenberg University Mainz), Minwen Deng (Tencent Holdings Ltd), Weiguo Liu (Shandong University), Yanjie Wei (Shenzhen Institutes of Advanced Technology), Shengzhong Feng (Shenzhen Institutes of Advanced Technology)
Floating-Point Autotuner for CPU-Based Mixed-Precision Applications. Ruidong Gu (North Carolina State University), Paul A. Beata (North Carolina State University), Michela Becchi (North Carolina State University)
FlowOS-RM: Disaggregated Resource Management System. Ryousei Takano (National Institute of Advanced Industrial Science and Technology (AIST)), Kuniyasu Suzaki (National Institute of Advanced Industrial Science and Technology (AIST)), Hidetaka Koie (National Institute of Advanced Industrial Science and Technology (AIST))
Full State Quantum Circuit Simulation by Using Lossy Data Compression. Xin-Chuan Wu (University of Chicago, Argonne National Laboratory), Sheng Di (Argonne National Laboratory), Franck Cappello (Argonne National Laboratory), Hal Finkel (Argonne National Laboratory), Yuri Alexeev (Argonne National Laboratory), Frederic T. Chong (University of Chicago)
GPGPU Performance Estimation with Core and Memory Frequency Scaling. Qiang Wang (Hong Kong Baptist University), Xiaowen Chu (Hong Kong Baptist University)
GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne. Samuel Antao (IBM), Charles Moulinec (Science and Technology Facilities Council, UK), Yvan Fournier (EDF Research and Development), Robert Sawko (IBM), Malgorzata Zimon (IBM), Christopher Thompson (IBM), Alex Skillen (Science and Technology Facilities Council, UK), Juan Uribe (EDF Research and Development), David Emerson (Science and Technology Facilities Council, UK)
GPU-Accelerated Interpolation for 3D Image Registration. Naveen Himthani (University of Texas, Institute for Computational Engineering and Sciences), Andreas Mang (University of Houston), Amir Gholami (University of California, Berkeley), George Biros (University of Texas, Institute for Computational Engineering and Sciences)
Hardware Acceleration of CNNs with Coherent FPGAs. Md Syadus Sefat (Texas State University), Semih Aslan (Texas State University), Apan Qasem (Texas State University)
Hermes: a Multi-Tiered Distributed I/O Buffering System for HDF5. Hariharan Devarajan (Illinois Institute of Technology, HDF Group)
High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem. Apan Qasem (Texas State University), Clara Novoa (Texas State University), Chandra Kolla (Texas State University), Samantha Coyle (Texas State University)
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments. Jorji Nonaka (Riken Center for Computational Science), Kenji Ono (Kyushu University, RIKEN), Naohisa Sakamoto (Kobe University, RIKEN), Kengo Hayashi (Kobe University, RIKEN), Tomohiro Kawanabe (Riken Center for Computational Science), Fumiyoshi Shoji (Riken Center for Computational Science), Masahiro Fujita (LTE Inc), Kentaro Oku (Kashika Inc), Kazuma Hatta (Imagica Digitalscape)
HPC-as-a-Service for Life Sciences. Vaclav Svaton (Technical University of Ostrava, Czech Republic), Jan Martinovic (Technical University of Ostrava, Czech Republic), Nina Jeliazkova (IDEAconsult Ltd, Bulgaria), Vladimir Chupakhin (Janssen Pharmaceutika NV), Pavel Tomancak (Max Planck Institute of Molecular Cell Biology and Genetics), Petr Vojta (Palacký University Olomouc, Czech Republic)
Implementing Efficient Data Compression and Encryption in a Persistent Key-Value Store for HPC. Jungwon Kim (Oak Ridge National Laboratory), Jeffrey S. Vetter (Oak Ridge National Laboratory)
Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation. Sihuan Li (University of California, Riverside), Sheng Di (Argonne National Laboratory), Xin Liang (University of California, Riverside), Zizhong Chen (University of California, Riverside), Franck Cappello (Argonne National Laboratory)
Improving the I/O Performance and Memory Usage of the Xolotl Cluster Dynamics Simulator. Philip C. Roth (Oak Ridge National Laboratory), Sophie Blondel (University of Tennessee), David E. Bernholdt (Oak Ridge National Laboratory), Brian D. Wirth (University of Tennessee)
Interactive HPC Deep Learning with Jupyter Notebooks. Wahid Bhimji (Lawrence Berkeley National Laboratory), Steven Farrell (Lawrence Berkeley National Laboratory), Oliver Evans (Lawrence Berkeley National Laboratory), Matthew Henderson (Lawrence Berkeley National Laboratory), Shreyas Cholia (Lawrence Berkeley National Laboratory), Aaron Vose (Cray Inc), Mr Prabhat (Lawrence Berkeley National Laboratory), Rollin Thomas (Lawrence Berkeley National Laboratory), Richard Shane Canon (Lawrence Berkeley National Laboratory)
Job Simulation for Large-Scale PBS-Based Clusters with the Maui Scheduler. Georg Zitzlsberer (IT4Innovations, Czech Republic; Technical University of Ostrava, Czech Republic), Branislav Jansik (IT4Innovations, Czech Republic; Technical University of Ostrava, Czech Republic), Jan Martinovic (IT4Innovations, Czech Republic; Technical University of Ostrava, Czech Republic)
Kernel-Based and Total Performance Analysis of CGYRO on 4 Leadership Systems. Igor Sfiligoi (General Atomics), Jeff Candy (General Atomics), Emily Belli (General Atomics)
Large Scale Computation of Quantiles Using MELISSA. Alejandro Ribes (EDF Research and Development), Théophile Terraz (French Institute for Research in Computer Science and Automation (INRIA)), Yvan Fournier (EDF Research and Development), Bertrand Iooss (EDF Research and Development), Bruno Raffin (French Institute for Research in Computer Science and Automation (INRIA))
Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC. Bohumir Jelinek (Mississippi State University), George Mason (Mississippi State University), John Peters (Mississippi State University), Daniel Johnson (Mississippi State University), Marcus Brumfield (Mississippi State University), Alex Carrillo (US Army Engineer Research and Development Center), Clay Goodman (Mississippi State University), Farshid Vahedifard (Mississippi State University)
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning. Kenji Tanaka (Japan Telegraph and Telephone Corporation), Yuki Arikawa (Japan Telegraph and Telephone Corporation), Kenji Kawai (Japan Telegraph and Telephone Corporation), Junichi Kato (Japan Telegraph and Telephone Corporation), Tsuyoshi Ito (Japan Telegraph and Telephone Corporation), Huy Cu Ngo (Japan Telegraph and Telephone Corporation), Kazutaka Morita (Japan Telegraph and Telephone Corporation), Fumiaki Miura (Japan Telegraph and Telephone Corporation), Takeshi Sakamoto (Japan Telegraph and Telephone Corporation), Satoshi Shigematsu (Japan Telegraph and Telephone Corporation)
A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems. Mulya Agung (Tohoku University), Muhammad Alfian Amrizal (Tohoku University), Ryusuke Egawa (Tohoku University), Hiroyuki Takizawa (Tohoku University)
A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids. Brian Van Straalen (Lawrence Berkeley National Laboratory), Peter McCorquodale (Lawrence Berkeley National Laboratory), Phil Colella (Lawrence Berkeley National Laboratory), Christos Kavouklis (Lawrence Livermore National Laboratory)
Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling. Changnian Han (Stony Brook University), Prachi Gupta (Stony Brook University), Peng Zhang (Stony Brook University), Danny Bluestein (Stony Brook University), Yuefan Deng (Stony Brook University)
Making Sense of Scientific Simulation Ensembles. Mai Dahshan (Virginia Tech), Nicholas Polys (Virginia Tech)
A Massively Parallel Evolutionary Markov Chain Monte Carlo Algorithm for Sampling Complicated Multimodal State SpacesState. Wendy K. Tam Cho (University of Illinois), Yan Liu (University of Illinois)
Massively Parallel Stress Chain Characterization for Billion Particle DEM Simulation of Accretionary Prism Formation. Mikito Furuichi (Japan Agency for Marine-Earth Science and Technology), Daisuke Nishiura (Japan Agency for Marine-Earth Science and Technology), Takane Hori (Japan Agency for Marine-Earth Science and Technology)
MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines. Ahmad Abdelfattah (University of Tennessee), Jack Dongarra (University of Tennessee), Stanimire Tomov (University of Tennessee), Ichitaro Yamazaki (University of Tennessee), Azzam Haidar (Nvidia Corporation)
MGRIT Preconditioned Krylov Subspace Method. Ryo Yoda (Kogakuin University), Akihiro Fujii (Kogakuin University), Teruo Tanaka (Kogakuin University)
MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines. Abdul Dakkak (University of Illinois), Cheng Li (University of Illinois), Wen-mei Hwu (University of Illinois), Jinjun Xiong (IBM)
MPI/OpenMP parallelization of the Fragment Molecular Orbitals Method in GAMESS. Vladimir Mironov (Lomonosov Moscow State University), Yuri Alexeev (Argonne National Laboratory), Dmitri Fedorov (National Institute of Advanced Industrial Science and Technology (AIST))
Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems. Yue Zhu (Florida State University), Fahim Chowdhury (Florida State University), Huansong Fu (Florida State University), Adam Moody (Lawrence Livermore National Laboratory), Kathryn Mohror (Lawrence Livermore National Laboratory), Kento Sato (Lawrence Livermore National Laboratory), Weikuan Yu (Florida State University)
Multi-GPU Accelerated Non-Hydrostatic Numerical Ocean Model with GPUDirect RDMA Transfers. Takateru Yamagishi (Research Organization for Information Science and Technology, Japan), Yoshimasa Matsumura (University of Tokyo), Hiroyasu Hasumi (University of Tokyo)
The NAStJA Framework: Non-Collective Scalable Global Communications. Marco Berghoff (Karlsruhe Institute of Technology), Ivan Kondov (Karlsruhe Institute of Technology)
OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing. Seyong Lee (Oak Ridge National Laboratory), Jacob Lambert (University of Oregon), Jungwon Kim (Oak Ridge National Laboratory), Jeffrey S. Vetter (Oak Ridge National Laboratory), Allen D. Malony (University of Oregon)
OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine. Bryce Kroencke (American River College), Shawn Shacterman (University of California, Berkeley), Nicholas Pavini (American River College), Benjamin Samudio (American River College, Sierra College), Silvia Crivelli (Lawrence Berkeley National Laboratory)
Optimization of Ultrasound Simulations on Multi-GPU Servers. Filip Vaverka (Brno University of Technology, Faculty of Information Technology), Matej Spetko (Brno University of Technology, Faculty of Information Technology), Bradley E. Treeby (University College London, Biomedical Ultrasound Group), Jiri Jaros (Brno University of Technology, Faculty of Information Technology)
Optimizing Next Generation Hydrodynamics Code for Exascale Systems. Dana Akhmetova (KTH Royal Institute of Technology), Sumathi Lakshmiranganatha (University of Wyoming), Diptajyoti Mukherjee (Allegheny College), Frederick Oullet (University of Florida), Patrick Payne (Los Alamos National Laboratory), Nicholas Stegmeier (South Dakota State University), Christoph Junghans (Los Alamos National Laboratory), Robert Pavel (Los Alamos National Laboratory), Vinay Ramakrishnaiah (Los Alamos National Laboratory)
Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU. Yaoguang Zhai (University of California, San Diego), Nathaniel Danandeh (University of California, San Diego), Zhenye Tan (University of California, San Diego; Tongji University), Sicun Gao (University of California, San Diego), Francesco Paesani (University of California, San Diego), Andreas W. Goetz (San Diego Supercomputer Center)
A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks. Yidong Xia (Idaho National Laboratory), Ansel Blumers (Brown University, Idaho National Laboratory), Zhen Li (Brown University), Lixiang Luo (IBM), Jan Goral (University of Utah), Matthew Andrew (Carl Zeiss X-ray Microscopy Inc), Joshua Kane (Idaho National Laboratory), Yu-Hang Tang (Lawrence Berkeley National Laboratory)
Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining. Xuewen Cui (Virginia Tech), Thomas R. W. Scogland (Lawrence Livermore National Laboratory), Bronis R. de Supinski (Lawrence Livermore National Laboratory), Wu Feng (Virginia Tech)
Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices. Takeshi Fukaya (Hokkaido University), Ramaseshan Kannan (Arup UK), Yuji Nakatsukasa (National Institute of Informatics, Japan), Yusaku Yamamoto (University of Electro-Communications, Japan), Yuka Yanagisawa (Waseda University)
Portable Parallel Performance via Multi-Dimensional Homomorphisms. Ari Rasch (University of Münster), Richard Schulze (University of Münster), Sergei Gorlatch (University of Münster)
Processing-in-Storage Architecture for Machine Learning and Bioinformatics. Roman Kaplan (Israel Institute of Technology), Leonid Yavits (Israel Institute of Technology), Ran Ginosar (Israel Institute of Technology)
Programming the EMU Architecture: Algorithm Design Considerations for Migratory-Threads-Based Systems. Mehmet E. Belviranli (Oak Ridge National Laboratory), Seyong Lee (Oak Ridge National Laboratory), Jeffrey S. Vetter (Oak Ridge National Laboratory)
Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation. Rached Abdelkhalak (King Abdullah University of Science and Technology), Kadir Akbudak (King Abdullah University of Science and Technology), Vincent Etienne (Saudi Aramco), Thierry Tonellot (Saudi Aramco)
Refactoring and Optimizing Multiphysics Combustion Models for Data Parallelism. Christopher Stone (US Department of Defense HPC Modernization Program, Engility Corporation), Alexei Poludnenko (Texas A&M University), Brian Taylor (US Air Force Research Laboratory)
Reproducibility as Side Effect. Shu Wang (University of Chicago), Zhuo Zhen (University of Chicago), Jason Anderson (University of Chicago), Kate Keahey (Argonne National Laboratory, University of Chicago)
RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish. Elham Hojati (Texas Tech University), Yong Chen (Texas Tech University), Alan Sill (Texas Tech University), Jon Hass (Dell Inc)
SciGaP: Apache Airavata Hosted Science Gateways. Marlon Pierce (Indiana University), Suresh Marru (Indiana University), Eroma Abeysinghe (Indiana University), Sudhakar Pamidighantam (Indiana University), Marcus Christie (Indiana University), Dimuthu Upeksha (Indiana University)
Script of Scripts Polyglot Notebook and Workflow System. Gao Wang (University of Chicago), Man Chong Leong (Rice University), Bo Peng (University of Texas, MD Anderson Cancer Center)
Sol: Transparent Neural Network Acceleration Platform. Nicolas Weber (NEC Laboratories Europe, NEC Corporation)
SpotSDC: an Information Visualization System to Analyze Silent Data Corruption. Zhimin Li (University of Utah), Harshitha Menon (Lawrence Livermore National Laboratory), Yarden Livnat (University of Utah), Kathryn Mohror (Lawrence Livermore National Laboratory), Valerio Pascucci (University of Utah)
Tensor-Optimized Hardware Accelerates Fused Discontinuous Galerkin Simulations. Alexander Breuer (University of California, San Diego), Alexander Heinecke (Intel Corporation), Yifeng Cui (San Diego Supercomputer Center)
Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels. Michael Anderson (Intel Corporation), Evangelos Georganas (Intel Corporation), Sasikanth Avancha (Intel Corporation), Alexander Heinecke (Intel Corporation)
Top-Down Performance Analysis of Workflow Applications. Christian Herold (Technical University Dresden), Bill Williams (Technical University Dresden)
Toward Smoothing Data Movement Between RAM and Storage. Tariq Alturkestani (King Abdullah University of Science and Technology), Thierry Tonellot (Saudi Aramco), Vincent Etienne (Saudi Aramco), Hatem Ltaief (King Abdullah University of Science and Technology)
Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter. Izaak B. Beekman (ParaTools Inc), Nicholas Chaimov (ParaTools Inc), Sameer Shende (ParaTools Inc, University of Oregon), Allen D. Malony (ParaTools Inc, University of Oregon), Nicholas Bisek (US Air Force Research Laboratory), Ryan Gosse (US Air Force Research Laboratory), Andrew Wissink (Create AV, US Army)
Understanding Potential Performance Issues Using Resource-Based alongside Time Models. Nan ding (Lawrence Berkeley National Laboratory), Victor W. Lee (Intel Corporation), Wei Xue (Tsinghua University), Weimin Zheng (Tsinghua University)
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes. Scott B. Baden (Lawrence Berkeley National Laboratory), Paul H. Hargrove (Lawrence Berkeley National Laboratory), Hadia Ahmed (Lawrence Berkeley National Laboratory), John Bachan (Lawrence Berkeley National Laboratory), Dan Bonachea (Lawrence Berkeley National Laboratory), Steven Hofmeyr (Lawrence Berkeley National Laboratory), Mathias Jacquelin (Lawrence Berkeley National Laboratory), Amir Kamil (Lawrence Berkeley National Laboratory), Brian van Straalen (Lawrence Berkeley National Laboratory)
Using Darshan and CODES to Evaluate Application I/O Performance. Harsh Khetawat (North Carolina State University), Christopher Zimmer (Oak Ridge National Laboratory), Frank Mueller (North Carolina State University), Sudharshan Vazhkudai (Oak Ridge National Laboratory), Scott Atchley (Oak Ridge National Laboratory)
Using Thrill to Process Scientific Data on HPC. Mariia Karabin (Clemson University, Los Alamos National Laboratory), Xinyu Chen (University of New Mexico), Supreeth Suresh (University of Wyoming), Ivo Jimenez (University of California, Santa Cruz), Li-Ta Lo (Los Alamos National Laboratory), Pascal Grosset (Los Alamos National Laboratory)
VeloC: Very Low Overhead Checkpointing System. Bogdan Nicolae (Argonne National Laboratory), Franck Cappello (Argonne National Laboratory), Adam Moody (Lawrence Livermore National Laboratory), Elsa Gonsiorowski (Lawrence Livermore National Laboratory), Kathryn Mohror (Lawrence Livermore National Laboratory)
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators. Maxence Thevenet (Lawrence Berkeley National Laboratory), Jean-Luc Vay (Lawrence Berkeley National Laboratory), Ann Almgren (Lawrence Berkeley National Laboratory), John Bell (Lawrence Berkeley National Laboratory), Remi Lehe (Lawrence Berkeley National Laboratory), Andrew Myers (Lawrence Berkeley National Laboratory), Jaehong Park (Lawrence Berkeley National Laboratory), Olga Shapoval (Lawrence Berkeley National Laboratory), Weiqun Zhang (Lawrence Berkeley National Laboratory), Lixin Ge (SLAC National Accelerator Laboratory), Mark Hogan (SLAC National Accelerator Laboratory), Cho Ng (SLAC National Accelerator Laboratory), Dave Grote (Lawrence Livermore National Laboratory)
Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?. Martin Kronbichler (Technical University Munich), Momme Allalen (Leibniz Supercomputing Centre), Martin Ohlerich (Leibniz Supercomputing Centre), Wolfgang A. Wall (Technical University Munich)
Workflow for Parallel Processing of Sequential Mesh Databases. Ondřej Meca (Technical University of Ostrava, Czech Republic), Lubomír Říha (Technical University of Ostrava, Czech Republic), Tomáš Brzobohatý (Technical University of Ostrava, Czech Republic)