Presenter Index - SC18

Presenter Index

Full Program · Presenters · Organizations · Search Program · Flagged · Happening Now · Maps · Notifications

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

A

Tanuj K. Aasawat

University of British Columbia

Scale-Free Graph Processing on a NUMA Machine

Sandia National Laboratories

Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications

Ahmad Abdelfattah

University of Tennessee

MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines

Rached Abdelkhalak

King Abdullah University of Science and Technology

Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation

Lawrence Livermore National Laboratory

Data Analytics for System and Facility Energy Management

IBM Zurich Research Laboratory

Integrating Network-Attached FPGAs into the Cloud Using Partial Reconfiguration

Eroma Abeysinghe

Indiana University

SciGaP: Apache Airavata Hosted Science Gateways

University of Texas

Texas Advanced Computing Center

The First Water in the Universe

University of Queensland

Energy Efficiency Modeling of Parallel Applications

Jean-Thomas Acquaviva

DataDirect Networks

International HPC Certification Program

Toward a HPC Certification Program

Power Aware Heterogeneous Node Assembly

Adedoyin Adetokunbo

Los Alamos National Laboratory

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Md Musabbir Adnan

University of Tennessee

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

University of Illinois

Kennedy Award Presentation - Memory Consistency Models: They Are Broken and Why We Should Care

Netherlands eScience Center

Data Archiving and Networked Services (DANS)

Sustaining Research Software

Deborah Agarwal

Lawrence Berkeley National Laboratory

Dac-Man: Data Change Management for Scientific Datasets on HPC Systems

University of California, Berkeley

SaNSA - the Supercomputer and Node State Architecture

Northwestern University

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies

Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training

Boston University

Panel: Open-Source Hardware

RV128 Instruction Set Architecture

Tohoku University

A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems

Muhammed Abdullah Al Ahad

KTH Royal Institute of Technology

Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows

Lawrence Berkeley National Laboratory

From Message Passing to PGAS

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

Lawrence Livermore National Laboratory

Panel Discussion

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Los Alamos National Laboratory

In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization

Stanford University

Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes

Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems

James B. Aimone

Sandia National Laboratories

Non-Neural Network Applications for Spiking Neuromorphic Hardware

Brown University

Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

Jonathan Ajo-Franklin

Lawrence Berkeley National Laboratory

Automated Parallel Data Processing Engine with Application to Large-Scale Feature Extraction

King Abdullah University of Science and Technology

Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation

KTH Royal Institute of Technology

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

Southeastern Universities Research Association (SURA)

Hot Topics Discussion I: Thriving at Work

Reda Al-Bahrani

Northwestern University

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

Fujitsu Laboratories of Europe Ltd.

DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping

University of Huddersfield

Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks

Swiss National Supercomputing Centre

RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management

Convergence between HPC and Big Data: The Day After Tomorrow

HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation

“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities

Interactivity in HPC

Christie L. Alappat

University of Erlangen-Nuremberg

Erlangen Regional Computing Center

Applying the Execution-Cache-Memory Model: Current State of Practice

Recursive Algebraic Coloring Engine

Mohammed Alawad

Oak Ridge National Laboratory

HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports

Johannes Albert-von der Gönna

Leibniz Supercomputing Centre

Spack Community BoF

Nia Alexandrova

Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education

Fifth SC Workshop on Best Practices for HPC Training and Education

Vassil Alexandrov

Barcelona Supercomputing Center

Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures

9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

Argonne National Laboratory

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

MPI/OpenMP parallelization of the Fragment Molecular Orbitals Method in GAMESS

Full State Quantum Circuit Simulation by Using Lossy Data Compression

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

Community Detection Across Emerging Quantum Architectures

Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

Hybrid Quantum-Classical Computing Architectures

Texas Tech University

Out-of-Band (BMC based) Data Center Monitoring DMTF Redﬁsh API Integration with Nagios

Japan Atomic Energy Agency

Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers

Leibniz Supercomputing Centre

Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?

Mentor, a Siemens Business

Session 4: Using OpenACC

OpenCAPI Consortium

OpenCAPI: High Performance, Host-Agnostic, Coherent Accelerator Architecture and Ecosystem

Seeking Quantum Supremacy with Numerical Simulation

Lawrence Berkeley National Laboratory

Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Mitchell Aloserij

University of Amsterdam

Tracking Network Flows with P4

San Diego Supercomputer Center

Data Science and HPC Education and Outreach

National Center for Atmospheric Research

Hybrid Theorem Proving as a Lightweight Method for Verifying Numerical Software

Tariq Alturkestani

King Abdullah University of Science and Technology

Toward Smoothing Data Movement Between RAM and Storage

Georgia Institute of Technology

School of Computational Science and Engineering

Optimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting

Parallel and Scalable Combinatorial String and Graph Algorithms on Distributed Memory Systems

Lluc Alvarez Marti

Barcelona Supercomputing Center

Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs

Teaching HPC Systems and Parallel Programming with Small Scale Clusters of Embedded SoCs

OpenMP: What’s Inside the Black Box?

University of Alberta

OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries

Barcelona Supercomputing Center

AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests

Abdelhalim Amer

Argonne National Laboratory

Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading

MPICH: A High Performance Open-Source MPI Implementation

Louisiana State University

Asynchronous Execution of Python Code on Task Based Runtime Systems

Christopher Amos

Baylor College of Medicine

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Muhammad Alfian Amrizal

Tohoku University

A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems

Jefferson Amstutz

Intel Corporation

libIS: A Lightweight Library for Flexible In Transit Visualization

George Amvrosiadis

Carnegie Mellon University

Scaling Embedded In Situ Indexing with DeltaFS

Rachana Ananthakrishnan

University of Chicago

National Research Infrastructure: Collaborative Session

National Center for Atmospheric Research

HPCSYSPROS18: Keynote

University of Chicago

Reproducibility as Side Effect

Kristin Anderson

Colder Products Company

The Case for Thermoplastic Quick Disconnects in Liquid Cooling

Michael Anderson

Intel Corporation

Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels

Puppet in HPC: Building on 10 Years of Practice

Georgios Andreadis

Delft University of Technology

Vrije University Amsterdam

A Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments

Carl Zeiss X-ray Microscopy Inc

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

Samuel Freitas Antao

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling

Dartmouth Medical School

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Gabriel Antoniu

French Institute for Research in Computer Science and Automation (INRIA)

Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage

Planner: Cost-efficient Execution Plans Placement for Uniform Stream Analytics on Edge and Cloud

Lawrence Berkeley National Laboratory

Convergence between HPC and Big Data: The Day After Tomorrow

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Karlsruhe Institute of Technology

High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation

Toshikazu Aoyama

NEC Corporation

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

David Appelhans

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

University of Washington

The Human Side of Data Science

Appentra Solutions

Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training

Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

Allison Armstrong

Igneous Systems Inc

Data Protection Solutions for ML/AI

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

University of California, Berkeley

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Dorian C. Arnold

Emory University

Invited Talk Session 6

Texas Advanced Computing Center

University of Texas

Toward Developing a Repository of Logical Errors Observed in Parallel Code for Teaching Code Correctness

Richard B. Arthur

General Electric Company

Computationally-Accelerated Engineering at GE: Physics + Deep Learning

Preferred Networks

Large Scale Deep Learning in PFN: from 15-Min Imagenet to PFDet

King Abdullah University of Science and Technology

Title: Distributed Memory Fast Fourier Transforms in the Exascale Era

Rizwan A. Ashraf

Oak Ridge National Laboratory

Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer

University of Manchester

First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture

Texas State University

Hardware Acceleration of CNNs with Coherent FPGAs

Oak Ridge National Laboratory

GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Using Darshan and CODES to Evaluate Application I/O Performance

Boston University

Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance

University of Manchester

First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture

Nvidia Corporation

OpenMP GPU Offload in Flang and LLVM

SLURM User Group Meeting

French Institute for Research in Computer Science and Automation (INRIA)

University of Bordeaux

Scheduling for In-machine Analytics: Data Size Is Important

Lawrence Berkeley National Laboratory

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

Hewlett Packard Enterprise

Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective

Power API and Redfish: Standardizing Power Measurement and Control for HPC

Sasikanth Avancha

Intel Corporation

Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures

Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels

University of Central Florida

Exploring Allocation Policies in Disaggregated Non-Volatile Memories

Barcelona Supercomputing Center

Polytechnic University of Catalonia

Compiler and Runtime Based Parallelization and Optimization for GPUs

Teaching HPC Systems and Parallel Programming with Small Scale Clusters of Embedded SoCs

OpenMP: What’s Inside the Black Box?

B

Lawrence Berkeley National Laboratory

Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

Lawrence Berkeley National Laboratory

From Message Passing to PGAS

Doomsday: Predicting Which Node Will Fail When on Supercomputers

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

Georgia Institute of Technology

Convergence between HPC and Big Data: The Day After Tomorrow

17th Graph500 List

Technical University Munich

Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG Schemes

Barcelona Supercomputing Center

Big Data and Exascale Computing (BDEC2) Application Roundtable

AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests

Non-Volatile Memory

European Open File System Association (EOFS)

LUSTRE Community BOF: Lustre in HPC and Emerging Data Markets: Roadmap, Features and Challenges

Stanford University

Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud

Beihang University

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

Anna Maria Bailey

Lawrence Livermore National Laboratory

Energy Efficient HPC Working Group

Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification

Energy Efficiency Considerations for HPC Procurements

High Performance Computing (HPC) Data Center Planning and TCO: A Case Study and Roadmap

Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

INDIS Morning Keynote

Stephen J. Bailey

Lawrence Berkeley National Laboratory

Optimizing Python Data Processing for the DESI Experiment on the NERSC Cori Supercomputer

Intel Corporation

OpenHPC Community BoF

National Center for Atmospheric Research

Students@SC: Careers in Industry, Research Labs, and Academia

Panel Discussion

A Statistical Analysis of Compressed Climate Model Data

Biology Applications

Zachary K. Baker

Los Alamos National Laboratory

Accelerating the Signal Alignment Process in Time-Evolving Geometries Using Python

University of South Carolina

Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)

Argonne National Laboratory

Characterization of MPI Usage on a Production Supercomputer

Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading

Runtime for Exascale and Beyond: Convergence or Divergence?

Navigating the SC Conference Technical Program Submission Process

MPICH: A High Performance Open-Source MPI Implementation

Advanced MPI Programming

Prasanna Balaprakash

Argonne National Laboratory

Argonne National Laboratory

Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training

California Institute of Technology

Division of Physics, Mathematics and Astronomy

SDN for End-to-End Networked Science at the Exascale (SENSE)

Renaissance Computing Institute (RenCI)

Introduction - Innovating the Network for Data Intensive Science (INDIS)

Innovating the Network for Data Intensive Science (INDIS)

Marc Gamell Balmana

Intel Corporation

Framework for Scalable Intra-Node Collective Operations Using Shared Memory

Illinois Institute of Technology

Student Cluster Competition Team Panel Presentation

Gábor Dániel Balogh

Pázmány Péter Catholic University, Hungary

OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling

Fabio Francisco Banchelli

Barcelona Supercomputing Center

Filling the Gap between Education and Industry: Evidence-Based Methods for Introducing Undergraduate Students to HPC

OpenMP: What’s Inside the Black Box?

Intel Corporation

Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures

Purushotham V. Bangalore

University of Alabama, Birmingham

Introduction - Workshop on Exascale MPI (ExaMPI)

Workshop on Exascale MPI (ExaMPI)

XiDian University

Optimizing the Throughput of Storm-Based Stream Processing in Clouds

Ingrid Barcena Roig

KU Leuven, Belgium

The Business of HPC: TCO, Funding Models, Metrics, Value, and More

Procurement and Commissioning of HPC Systems

National Energy Research Scientific Computing Center (NERSC)

Lawrence Berkeley National Laboratory

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Deep Learning at Scale

Jaydeep Bardhan

GlaxoSmithKline

Career Development Panel

Md Abdullah Shahneous Shahneous Bari

Stony Brook University

Is Data Placement Optimization Still Relevant on Newer GPUs?

Oak Ridge National Laboratory

Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training

The HPC Best Practices Webinar Series

Large Scale System Deployments

Pacific Northwest National Laboratory

Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations

Nationwide Children's Hospital

Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)

Fourth Computational Approaches for Cancer Workshop (CAFCW18)

University of Texas

Texas Advanced Computing Center

The New NSF-Funded Resource: Frontera - Towards a Leadership Class Computing Facility

French Institute for Research in Computer Science and Automation (INRIA)

PARCOACH Extension for a Full-Interprocedural Collectives Verification

Andrea Bartolini

University of Bologna

DiG: Enabling Out-of-Band Scalable High-Resolution Monitoring for Data-Center Analytics, Automation, and Control

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

Data Analytics for System and Facility Energy Management

Elisabeth Baseman

Los Alamos National Laboratory

Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo

Achim Basermann

German Aerospace Center

HPC Software Infrastructures at German Aerospace Center

HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making

Reservoir Labs Inc

Analysis of Explicit vs. Implicit Tasking in OpenMP Using Kripke

University of Chicago

The Power of Storytelling: Exposing User Experiences and Lessons Learned to Inspire and Instruct Technology Adoption

Lawrence Livermore National Laboratory

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Lawrence Berkeley National Laboratory

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Energy Efficient HPC Working Group

Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

The Green 500: Trends in Energy Efficient Supercomputing

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Colorado School of Mines

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Gregory H. Bauer

University of Illinois

National Center for Supercomputing Applications

Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience

Nvidia Corporation

Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes

North Carolina State University

Hybrid Theorem Proving as a Lightweight Method for Verifying Numerical Software

Leonardo Bautista-Gomez

Barcelona Supercomputing Center

Toward Ad Hoc Recovery For Soft Errors

Approximating a Multi-Grid Solver

On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale

Ohio State University

Cooperative Rendezvous Protocols for Improved Performance and Overlap

Alexandre Bayen

University of California, Berkeley

High Performance Computing in Dynamic Traffic Simulation

Lawrence Livermore National Laboratory

Students@SC: Making the Best of Your HPC Education

Neelima Bayyapu

Argonne National Laboratory

MPICH: A High Performance Open-Source MPI Implementation

Julia Bazińska

University of Warsaw

Panel 4: Student Spotlight Presentation

Jonathan C. Beard

MCHPC'18 Panel: Research Challenges in Memory-Centric Computing

North Carolina State University

Floating-Point Autotuner for CPU-Based Mixed-Precision Applications

North Carolina State University

Efficient Deployment of Irregular Computations on Multi- and Many-Core Architectures

A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms

Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA

Floating-Point Autotuner for CPU-Based Mixed-Precision Applications

Gregory B. Becker

Lawrence Livermore National Laboratory

Managing HPC Software Complexity with Spack

Argonne National Laboratory

Artificial Intelligence at the Edge: How the Internet of Things and HPC Connect in the Computing Continuum

Fluminense Federal University, Fluminense Federal University, Brazil

A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows

Izaak B. Beekman

Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter

University of California, Santa Cruz

Geomancy: Automated Data Placement Optimization

Viking Enterprise Solutions

Cassandra in Dockers Deployment Using an NVMe Fabric

Oak Ridge National Laboratory

Spack Community BoF

Lawrence Berkeley National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

General Atomics

Kernel-Based and Total Performance Analysis of CGYRO on 4 Leadership Systems

Vicenç Beltran

Barcelona Supercomputing Center

On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale

Mehmet E. Belviranli

Oak Ridge National Laboratory

DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

Programming the EMU Architecture: Algorithm Design Considerations for Migratory-Threads-Based Systems

Deep500: An HPC Deep Learning Benchmark and Competition

DiG: Enabling Out-of-Band Scalable High-Resolution Monitoring for Data-Center Analytics, Automation, and Control

Lawrence Livermore National Laboratory

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

IBM Research, UK

Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture

DataDirect Networks

The IO-500 and the Virtual Institute of I/O

Advanced Micro Devices Inc

Unified Communication X (UCX) Community

Florian Berberich

Partnership for Advanced Computing in Europe (PRACE)

Big Data Challenge - How to Engage with Large Scale Facilities?

Alexandre Bergel

University of Chile

Visual Analytics Challenges in Analyzing Calling Context Trees

Karlsruhe Institute of Technology

The NAStJA Framework: Non-Collective Scalable Global Communications

Non-Collective Scalable Global Network Based on Local Communications

Forschungszentrum Juelich

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

University of Texas

Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method

David E. Bernholdt

Oak Ridge National Laboratory

Improving the I/O Performance and Memory Usage of the Xolotl Cluster Dynamics Simulator

Software Engineering and Reuse in Computational Science and Engineering

The HPC Best Practices Webinar Series

Better Scientific Software

Oak Ridge National Laboratory

Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes

OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Lawrence Berkeley National Laboratory

Python-Based In Situ Analysis and Visualization

SENSEI Cross-Platform View of In Situ Analytics

Blair Bethwaite

New Zealand eScience Infrastructure

Cloud Infrastructure Solutions To Run HPC Workloads

Abhinav Bhatele

Lawrence Livermore National Laboratory

Visual Analytics Challenges in Analyzing Calling Context Trees

Evaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters

Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing

Panel Discussion

Introduction - Fifth International Workshop on Visual Performance Analysis (VPA 18)

Students@SC: Careers in Industry, Research Labs, and Academia

Lawrence Berkeley National Laboratory

Interactive HPC Deep Learning with Jupyter Notebooks

Sanjukta Bhowmick

University of Nebraska, Omaha

Doctoral Showcase III

Doctoral Showcase I

Swiss National Supercomputing Centre

RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management

KTH Royal Institute of Technology

Distributed L-Shaped Algorithms in Julia

University of Texas

Institute for Computational Engineering and Sciences

Arctic Ocean-Sea Ice Interactions

University of California, Santa Barbara

Visualizing Outbursts of Massive Stars

Simon J. L. Billinge

Columbia University

Reproducibility for Streaming Analysis

Jay Jay Billings

Oak Ridge National Laboratory

Software Engineers: Careers in Research

Los Alamos National Laboratory

Effective Performance Portability

University of Texas

Distributed-Memory Hierarchical Compression of Dense SPD Matrices

Approximating for Faster, Better and Cheaper Scientific Computing

GPU-Accelerated Interpolation for 3D Image Registration

Prentice Bisbal

Princeton Plasma Physics Laboratory

Training Computational Scientists to Build and Package Code

US Air Force Research Laboratory

Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter

Los Alamos National Laboratory

A Flexible System For In Situ Triggers

In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization

US Department of Energy Office of Advanced Scientific Computing Research

Keynote: Perspectives on In Situ

Perspectives on Data Reduction from ASCR

University of Colorado

Stateless Provisioning: Modern Practice in HPC

Datacenter and Cooling Technologies

Robert Blackmore

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

University of California, Santa Barbara

Visualizing Outbursts of Massive Stars

Los Alamos National Laboratory

SaNSA - the Supercomputer and Node State Architecture

Improving Application Resilience by Extending Error Correction with Contextual Information

Arthur S. Bland

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

Alexander Blass

University of Twente

Volume Renderings of Sheared Thermal Convection

University of Tennessee

Improving the I/O Performance and Memory Usage of the Xolotl Cluster Dynamics Simulator

Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)

Danny Bluestein

Stony Brook University

Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling

Brown University

Idaho National Laboratory

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

Matthias A. Blumrich

Nvidia Corporation

Exploiting Idle Resources in a High-Radix Switch for Supplemental Storage

University of Illinois

National Center for Supercomputing Applications

Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience

Lawrence Livermore National Laboratory

Visual Analytics Challenges in Analyzing Calling Context Trees

Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)

Students@SC: HPC Research

University of Illinois

Charm++ and AMPI: Adaptive and Asynchronous Parallel Programming

National Institute of Standards and Technology

Federated Cloud: An Evolutionary Path from Grid Computing

Asia Supercomputer Community (ASC)

Panel 4: Asia Supercomputing Community: Profound Inspiration through Strong Competition

University of Tsukuba

Welcome and Introduction

Panel 3: Japan's HPC Program for System Development and Deployment toward Exascale

Benchmarking Scientific Reconfigurable / FPGA Computing

2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development

Matthias Bollhöfer

Braunschweig University of Technology

Distributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures

Lawrence Berkeley National Laboratory

GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

Northeastern University

Student Cluster Competition Team Panel Presentation

R. Christopher Bording

Introduction - 5th International Workshop on HPC User Support Tools: HUST-18

5th International Workshop on HPC User Support Tools: HUST-18

San Diego Supercomputer Center

University of California, San Diego

Computational Cosmology and Astrophysics on Adaptive Meshes Using Charm++

Andrea Borghesi

University of Bologna

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

University of Tennessee

Open MPI State of the Union 2018

Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice

Nader Boushehrinejadmoradi

Rutgers University

A Parallelism Profiler with What-If Analyses for OpenMP Programs

Aurelien Bouteiller

University of Tennessee

Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice

Gen-Z Consortium

The Data-Centric Future and Gen-Z's Next Generation Interconnect

United Technologies Corporation - Pratt & Whitney Division

High Performance Computing in the Cloud at United Technologies

Briana Bradshaw

University of Texas

Texas Advanced Computing Center

Arctic Ocean-Sea Ice Interactions

Sandia National Laboratories

Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights

Louisiana State University

Asynchronous Execution of Python Code on Task Based Runtime Systems

Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins

Leibniz Supercomputing Centre

OpenHPC Community BoF

Purdue University

Upcoming Events in the HPC Systems Professionals Community

Maximilian H. Bremer

University of Texas at Austin

Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations

Peer-Timo Bremer

Lawrence Livermore National Laboratory

A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing

University of Notre Dame

Compliant Cloud+Campus Hybrid HPC Infrastructure

Argonne National Laboratory

Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

CANDLE Framework for Large Scale Deep Learning

Alexander Breuer

University of California, San Diego

Tensor-Optimized Hardware Accelerates Fused Discontinuous Galerkin Simulations

US Department of Defense HPC Modernization Program

Deep Learning Evolutionary Optimization for Regression of Rotorcraft Vibrational Spectra

Georgia Institute of Technology

Upcoming Events in the HPC Systems Professionals Community

Workloads and Benchmarks for System Acquisition

Sandia National Laboratories

ExaMPI Invited Talk

Introduction - MCHPC’18: Workshop on Memory Centric High Performance Computing

MCHPC’18: Workshop on Memory Centric High Performance Computing

Extending On-Premise HPC to the Cloud

US Army Engineer Research and Development Center

Workloads and Benchmarks for System Acquisition

University of Illinois, Chicago

SAGE2 10th Annual International SC BOF: Scalable Amplified Group Environment for Global Collaboration

University of Edinburgh

Panel: Open-Source Software

Driving Asynchronous Distributed Tasks with Events

HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making

Strategies for Inclusive and Scalable HPC Outreach and Education

Student Cluster Competition Team Panel Presentation

Marcus Brumfield

Mississippi State University

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

Texas A&M University

CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity

Atomic Energy and Alternative Energies Commission (CEA)

PARCOACH Extension for a Full-Interprocedural Collectives Verification

Oklahoma State University

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Univa Corporation

Enabling HPC and Deep Learning Workloads at Extreme Scale in the Cloud

Erik Brynjolfsson

Massachusetts Institute of Technology

Keynote: Explore How to Deploy the Unruly Power of Machine, Platform, and Crowd

Tomáš Brzobohatý

Technical University of Ostrava, Czech Republic

Workflow for Parallel Processing of Sequential Mesh Databases

Colorado State University

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Middle Tennessee State University

Energy-Aware Workflow Scheduling and Optimization in Clouds Using Bat Algorithm

Pittsburgh Supercomputing Center

Cloud Infrastructure Solutions To Run HPC Workloads

Reuben Budiardja

Oak Ridge National Laboratory

High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability

Rice University

A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

Linear Algebra Is the Right Way to Think About Graphs

HPC Graph Toolkits and the GraphBLAS Forum

Lawrence Berkeley National Laboratory

GraphBLAS Forum and Its Relevant Software Zoo

Peachy Introduction

Nvidia Corporation

Deep Learning by Doing: Nvidia Deep Learning Institute

Jeffery Bunting

NVXL Technology Inc

NVXL Acceleration Platform for Polymorphic Acceleration

University of Bologna

On Adam-Trained Models and a Parallel Method to Improve the Generalization Performance

Panel Discussion

Rogue Wave Software Inc

Advanced Technologies and Techniques for Debugging HPC Applications

Martin Burtscher

Texas State University

Computing a Movie of Zooming into a Fractal

PARLOT: Efficient Whole-Program Call Tracing for HPC Applications

Anastasiia Butko

Lawrence Berkeley National Laboratory

Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)

Gregory F. Butler

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

A Year in the Life of a Parallel File System

Anycast: Rootless Broadcasting with MPI

Distributed Adaptive Radix Tree for Efficient Metadata Search on HPC Systems

C

Hector Carrillo Cabada

University of New Mexico

Effective Performance Portability

Ruben M. Cabezón

University of Basel

Detection of Silent Data Corruptions in Smooth Particle Hydrodynamics Simulations

Barcelona Supercomputing Center

Polytechnic University of Catalonia

Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs

Colorado State University

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Lawrence Berkeley National Laboratory

Python-Based In Situ Analysis and Visualization

US Department of Defense HPC Modernization Program

General Atomics

Kernel-Based and Total Performance Analysis of CGYRO on 4 Leadership Systems

Richard Shane Canon

Lawrence Berkeley National Laboratory

Interactive HPC Deep Learning with Jupyter Notebooks

Containers in HPC

Container Computing for HPC and Scientific Workflows

Matteo Cantiello

Flatiron Institute

Visualizing Outbursts of Massive Stars

University of Central Missouri

Engaging Students in Parallel and Distributed Computing Learning by Games Design Using Unity

New Jersey Institute of Technology

Optimizing the Throughput of Storm-Based Stream Processing in Clouds

Franck Cappello

Argonne National Laboratory

Reconfigurable Computing for HPC: Will It Make It this Time?

Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)

Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation

Full State Quantum Circuit Simulation by Using Lossy Data Compression

VeloC: Very Low Overhead Checkpointing System

Benchmarking Scientific Reconfigurable / FPGA Computing

Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

Compression for Scientific Data

Atomic Energy and Alternative Energies Commission (CEA)

PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows

Columbia University

Panel: Open-Source Hardware

How System-Level Design Can Benefit the Progress of Open-Source Hardware

Los Alamos National Laboratory

Performance Portability Challenges for Fortran Applications

Argonne National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Toward Understanding I/O Behavior in HPC Workflows

A Year in the Life of a Parallel File System

Enabling Data Services for HPC

Analyzing Parallel I/O

Christopher D. Carothers

Rensselaer Polytechnic Institute

Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems

Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps

Alexandra Carpen-Amarie

Fraunhofer Institute for Industrial Mathematics

Algorithm Selection of MPI Collectives Using Machine Learning Techniques

Patrick Carribault

Atomic Energy and Alternative Energies Commission (CEA)

PARCOACH Extension for a Full-Interprocedural Collectives Verification

US Army Engineer Research and Development Center

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

Caroline Weilhamer

Indiana University

INDIS Showcases Panel: NRE and XNET and Architecture

University of Alabama

Software Engineering and Reuse in Computational Science and Engineering

University of Hawaii at Manoa

WRENCH: A Framework for Simulating Workflow Management Systems

SMPI Courseware: Teaching Distributed-Memory Computing with MPI in Simulation

Marc Casas Guix

Barcelona Supercomputing Center

Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs

Approximating a Multi-Grid Solver

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Intel Corporation

PMIx: Enabling Workflow Orchestration

Vito Giovanni Castellana

Pacific Northwest National Laboratory

Introduction - IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

Enabling High-Level Graph Processing via Dynamic Tasking

IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

Bryan Catanzaro

Nvidia Corporation

Applying Deep Learning

Carlo Cavazzoni

Data Analytics for System and Facility Energy Management

Aurélien Cavelan

University of Basel

Detection of Silent Data Corruptions in Smooth Particle Hydrodynamics Simulations

Moffitt Cancer Center

Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches

Mohamad Chaarawi

Intel Corporation

Evaluation of HPC Application I/O on Object Storage Systems

Nicholas Chaimov

Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter

Venkatesan Chakaravarthy

High-Performance Dense Tucker Decomposition on GPU Clusters

Ohio State University

Cooperative Rendezvous Protocols for Improved Performance and Overlap

High Performance Middlewares for Next Generation Architectures: Challenges and Solutions

InfiniBand, Omni-Path, and High-Speed Ethernet: Advanced Features, Challenges in Designing HEC Systems, and Usage

InfiniBand, Omni-Path, and High-Speed Ethernet for Beginners

Dhruva Chakravorty

Texas A&M University

Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level

CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity

Bradford L. Chamberlain

Panel Discussion

Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI

Chris Chambreau

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Lawrence Berkeley National Laboratory

Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations

Sunita Chandrasekaran

University of Delaware

University of Delaware

5th Workshop on Accelerator Programming Using Directives (WACCPD): Closing Remarks

Swiss Army Programming: Performance and Portability from Modern Tools

Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)

Introduction - Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

OpenACC API User Experience, Vendor Reaction, Relevance, and Roadmap

Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

Chia Cheng Chang

Lawrence Berkeley National Laboratory

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

University of Texas

Evaluating and Accelerating High-Fidelity Error Injection for HPC

NVXL Technology Inc

NVXL Acceleration Platform for Polymorphic Acceleration

Barbara Chapman

Stony Brook University

Swiss Army Programming: Performance and Portability from Modern Tools

Is Data Placement Optimization Still Relevant on Newer GPUs?

University of Delaware

University of Tennessee

Introduction of Practical Approaches to Data Analytics for HPC with Spark

Prasanth Chatarasi

Georgia Institute of Technology

A Preliminary Study of Compiler Transformations for Graph Applications on the Emu System

Samit Chaudhuri

NVXL Technology Inc

NVXL Acceleration Platform for Polymorphic Acceleration

Thomas Cheatham

University of Utah

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

University of Central Arkansas

Eight Years Analysis of Adopting PDC in Data Structures at UCA

Tsinghua University

National Supercomputing Center, Wuxi

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Tsinghua University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Hsing-bung Chen

Los Alamos National Laboratory

Characterizing Declustered Software RAID for Enhancing Storage Reliability and Performance

Sandia National Laboratories

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

International Center for Advanced Internet Research (iCAIR)

Northwestern University

Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Georgia Institute of Technology

A Unified Runtime for PGAS and Event-Driven Programming

Tsinghua University

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

University of Kentucky

Deep Learning by Doing: Nvidia Deep Learning Institute

Southern University of Science and Technology, China

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

University of New Mexico

Using Thrill to Process Scientific Data on HPC

Texas Tech University

Welcome, Workshop Goals, and Opening Remarks

Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish

Distributed Adaptive Radix Tree for Efficient Metadata Search on HPC Systems

Simulating Data Centers with Redfish-Enabled Equipment

HPCViz: Monitoring Health Status of High Performance Computing Systems

Out-of-Band (BMC based) Data Center Monitoring DMTF Redﬁsh API Integration with Nagios

xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory

The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation

Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation

Hong Kong University of Science and Technology

Accelerating 2D FFT: Exploit GPU Tensor Cores through Mixed-Precision

National Tsing Hua University, Taiwan

Student Cluster Competition Team Panel Presentation

George Mason University

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Purdue University

Student Cluster Competition Team Panel Presentation

Nathanael Cheriere

Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage

Institute of Computational Mathematics and Mathematical Geophysics SB RAS

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

University of Toronto

ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism

Ron Chi-Lung Chiang

University of St. Thomas

University of St. Thomas

Contention-Aware Container Placement Strategy for Docker Swarm

KTH Royal Institute of Technology

Characterizing Deep-Learning I/O Workloads in TensorFlow

University of Alberta

OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries

University of Pittsburgh

Supporting Thorough Artifact Evaluation with Occam

J. Taylor Childers

Argonne National Laboratory

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

University of Oregon

A Flexible System For In Situ Triggers

Wendy K. Tam Cho

University of Illinois

A Massively Parallel Evolutionary Markov Chain Monte Carlo Algorithm for Sampling Complicated Multimodal State SpacesState

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

High-Performance Dense Tucker Decomposition on GPU Clusters

Oak Ridge National Laboratory

Feature-Relevant Data Reduction for In Situ Workflows

Lawrence Berkeley National Laboratory

Interactive HPC Deep Learning with Jupyter Notebooks

Frederic T. Chong

University of Chicago

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

Full State Quantum Circuit Simulation by Using Lossy Data Compression

Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

Hybrid Quantum-Classical Computing Architectures

Krzysztof Choromanski

Adaptive Anonymization of Data with b-Edge Covers

Northwestern University

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies

Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training

Georgia Institute of Technology

Accelerating Quantum Chemistry with Vectorized and Batched Integrals

Fahim Chowdhury

Florida State University

Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems

Blair Christian

Oak Ridge National Laboratory

HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports

Marcus Christie

Indiana University

SciGaP: Apache Airavata Hosted Science Gateways

Lawrence Livermore National Laboratory

Enabling Data Analytics Workflows Using Node-Local Storage

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Hong Kong Baptist University

Hong Kong Baptist University

GPGPU Performance Estimation with Core and Memory Frequency Scaling

Neil P. Chue Hong

Software Sustainability Institute

University of Edinburgh

Sustaining Research Software

Sudheer Chunduri

Argonne National Laboratory

Characterization of MPI Usage on a Production Supercomputer

A Cost-Effective Flexible System Optimized for DNN and ML

Vladimir Chupakhin

Janssen Pharmaceutika NV

HPC-as-a-Service for Life Sciences

Jet Propulsion Laboratory

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Florina M. Ciorba

University of Basel

Detection of Silent Data Corruptions in Smooth Particle Hydrodynamics Simulations

Antonio Cisternino

University of Pisa

Applications of Deep Learning in Industry and Research

Nvidia Corporation

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

French Institute for Research in Computer Science and Automation (INRIA)

University of Strasbourg

AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests

Beverly Clayton

Pittsburgh Supercomputing Center

SC: The Conference

Igneous Systems Inc

Data Protection Solutions for ML/AI

Introduction - HPC Systems Professionals Workshop (HPCSYSPROS18)

University of Missouri, St Louis

Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction

Douglas D. Cline

Lockheed Martin Aeronautics Company

Challenges and Solutions in the Industrial Application of HPC at Lockheed Martin

Texas Advanced Computing Center

Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

Valeriu Codreanu

Fast and Accurate Training of an AI Radiologist

Large Minibatch Training on Supercomputers with Improved Accuracy and Reduced Time to Train

Henrique Colao Zanúz

Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG

In-Transit Molecular Dynamics Analysis with Apache Flink

Maureen Colbert

Dartmouth Medical School

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Lawrence Berkeley National Laboratory

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids

Oak Ridge National Laboratory

Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes

Amazon Web Services

The Difference Between HPC on Premises and in the Cloud

What Would You Do with a Million Cores of Compute Capacity?

Nicholson Collier

Argonne National Laboratory

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

Appentra Solutions

Developing Workplace Resilience and Managing Stress

Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training

Introduction - Women in HPC: Diversifying the HPC Community

Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.

Women in HPC: Diversifying the HPC Community

Reservoir Labs Inc

Fast Detection of Elephant Flows with Dirichlet-Categorical Inference

On Adam-Trained Models and a Parallel Method to Improve the Generalization Performance

Colorado College

Building a Low Budget Cluster Through Hardware Reuse

Giuseppe Congiu

Argonne National Laboratory

MPICH: A High Performance Open-Source MPI Implementation

Lawrence Berkeley National Laboratory

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Sandia National Laboratories

Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications

New Mexico State University

Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications

Julita Corbalan

Barcelona Supercomputing Center

Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa

Boston University

Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights

Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance

Alexandru Costan

IRISA, INSA Rennes

Planner: Cost-efficient Execution Plans Placement for Uniform Stream Analytics on Edge and Cloud

J. Eric Coulter

Indiana University

Programmable Education Infrastructure: Cloud Resources as HPC Education Environments

University College London

Personalized Medicine and HPC

Intel Corporation

OpenMP® 5.0 Is Here: Find Out All the Things You Need to Know About It!

LLVM in HPC: What's New?

Texas State University

High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem

Charles D. Cranor

Carnegie Mellon University

Scaling Embedded In Situ Indexing with DeltaFS

Francesco Cremonesi

Swiss Federal Institute of Technology in Lausanne

Applying the Execution-Cache-Memory Model: Current State of Practice

Daniel Crichton

Jet Propulsion Laboratory

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Silvia Crivelli

Lawrence Berkeley National Laboratory

OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine

Capsule Networks for Protein Structure Classification

Clara E. Cromey

University of Arizona

Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing

Julian Cuevas Paniagua

Lawrence Berkeley National Laboratory

University of Puerto Rico at Mayaguez

Capsule Networks for Protein Structure Classification

Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining

San Diego Supercomputer Center

Tensor-Optimized Hardware Accelerates Fused Discontinuous Galerkin Simulations

Christine Cuicchi

US Department of Defense

Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.

Volunteer Opportunities for SC Conference Planning

Moffitt Cancer Center

Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches

University of California, Berkeley

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Massimiliano Culpo

Swiss Federal Institute of Technology in Lausanne

Managing HPC Software Complexity with Spack

Sandia National Laboratories

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Stony Brook University

OpenSHMEM in the Era of Exascale

Maciej Cytowski

Pawsey Supercomputing Centre

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

HPC Education and Training: An Australian Perspective

D

Barcelona Supercomputing Center

Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa

University of Texas

Hot Topics Discussion I: Thriving at Work

Developing Workplace Resilience and Managing Stress

Making Sense of Scientific Simulation Ensembles

University of North Carolina, Charlotte

Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

University of Illinois

MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines

Lawrence Berkeley National Laboratory

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

Laboratory for Physical Sciences at University of Maryland

Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)

Nathaniel Danandeh

University of California, San Diego

Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU

University of Illinois

Fast and Generic Concurrent Message-Passing

Texas Tech University

Texas Tech University

Visualizing Multidimensional Health Status of Data Centers

HPCViz: Monitoring Health Status of High Performance Computing Systems

Frederica Darema

United States Air Force

Federated Cloud: An Evolutionary Path from Grid Computing

Energy Sciences Network (ESnet)

HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

North Carolina State University

Doomsday: Predicting Which Node Will Fail When on Supercomputers

Holistic Root Cause Analysis of Node Failures in Production HPC

University of Warwick

Optimizing Machine Learning on Apache Spark in HPC Environments

Bigstream Networks

Accelerating Intelligence

Joshua H. Davis

University of Delaware

Studying the Impact of Power Capping on MapReduce-Based, Data-Intensive Mini-Applications on Intel KNL and KNM Architectures

Rutgers University

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Johannes de Fine Licht

Productive Parallel Programming for FPGA with High-Level Synthesis

Maarten V. de Hoop

Rice University

Computing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver

Lawrence Berkeley National Laboratory

Quantum Computing for Scientific Applications

University of Amsterdam

Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners

Mix-and-Match: A Model-Driven Runtime Optimization Strategy for BFS on GPUs

University of Grenoble

CPU Overheating Characterization in HPC Systems: a Case Study

Bronis R. de Supinski

Lawrence Livermore National Laboratory

Energy Efficiency Modeling of Parallel Applications

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining

Mastering Tasking with OpenMP

Advanced OpenMP: Host Performance and 5.0 Features

Resource Management and Interference

Winston-Salem State University

Introduction - Workshop on Education for High Performance Computing (EduHPC)

An Alternative Approach to Teaching Bigdata and Cloud Computing Topics at CS Undergraduate Level

Nathan Debardeleben

Los Alamos National Laboratory

Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo

SaNSA - the Supercomputer and Node State Architecture

Improving Application Resilience by Extending Error Correction with Contextual Information

Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)

Tivan: A Scalable Data Collection and Analytics Cluster

Bert Debusschere

Sandia National Laboratories

Presenting / Communication

CRE218 – Plenary II

University of Southern California, Information Sciences Institute

Enabling Data Analytics Workflows Using Node-Local Storage

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Davide Del Vento

National Center for Atmospheric Research

AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication Libraries

Robert L. DeLeon

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

University of Amsterdam

Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners

Fermi National Accelerator Laboratory

Computing Division

SDN for End-to-End Networked Science at the Exascale (SENSE)

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Gökalp Demirci

University of Chicago

A Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints

University of California, Berkeley

Correctness of Floating Point Programs - Exception Handling and Reproducibility

Tencent Holdings Ltd

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Stony Brook University

Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling

University of Erlangen-Nuremberg

Student Cluster Competition Team Panel Presentation

Nvidia Corporation

Light-Weight Protocols for Wire-Speed Ordering

Exploiting Idle Resources in a High-Radix Switch for Supplemental Storage

Oak Ridge National Laboratory

Clacc: Translating OpenACC to OpenMP in Clang

Clemson University

Using CloudLab as a Scalable Platform for Teaching Cluster Computing

Energy Efficiency Modeling of Parallel Applications

General Motors Company

Lawrence Berkeley National Laboratory

“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Exascale Deep Learning for Climate Analytics

A Case Study for Performance Portability Using OpenMP 4.5

TAE Technologies

Data Fusion for Nuclear Fusion – Using HPC To Put a Star in a Bottle

Data Reduction Challenges in Coordinated Simulation and Experimental Fusion Science

Texas State University

PARLOT: Efficient Whole-Program Call Tracing for HPC Applications

Hariharan Devarajan

Illinois Institute of Technology

Hermes: a Multi-Tiered Distributed I/O Buffering System for HDF5

University of Texas

Data Science and HPC Education and Outreach

TCHPC Career Panel

Salvatore Di Girolamo

SimFS: A Simulation Data Virtualizing File System Interface

Diana Di Luccio

Parthenope University of Naples

DagOn*: Executing Direct Acyclic Graphs as Parallel Jobs on Anything

Oxford Thermofluids Institute

University of Oxford

Software Prefetching for Unstructured Mesh Applications

Argonne National Laboratory

Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation

Full State Quantum Circuit Simulation by Using Lossy Data Compression

Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

Gerrett Diamond

Rensselaer Polytechnic Institute

Dynamic Load Balancing of Plasma and Flow Simulations

University of Warwick

Performance Portability of an Unstructured Hydrodynamics Mini-Application

Louisiana State University

Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)

Asynchronous Execution of Python Code on Task Based Runtime Systems

Carnegie Mellon University

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

University of Central Missouri

Engaging Students in Parallel and Distributed Computing Learning by Games Design Using Unity

Lawrence Berkeley National Laboratory

Understanding Potential Performance Issues Using Resource-Based alongside Time Models

Fermi National Accelerator Laboratory

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

University of Queensland

Energy Efficiency Modeling of Parallel Applications

Alexander Ditter

University of Erlangen-Nuremberg

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Integrating Network-Attached FPGAs into the Cloud Using Partial Reconfiguration

University of Southern California, Information Sciences Institute

Enabling Data Analytics Workflows Using Node-Local Storage

Lawrence Berkeley National Laboratory

P3HPC Session 1 Panel Discussion

Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Tokyo Institute of Technology

Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing

University of Massachusetts, Boston

Student Cluster Competition Team Panel Presentation

Lawrence Berkeley National Laboratory

Automated Parallel Data Processing Engine with Application to Large-Scale Feature Extraction

University of Tennessee

Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers

Approximating for Faster, Better and Cheaper Scientific Computing

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines

Batched, Reproducible, and Reduced Precision BLAS

HPCG Benchmark Update

TOP500 Supercomputers

Big Data and Exascale Computing (BDEC2) Application Roundtable

Invited Talk Session 5

9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

Lawrence Berkeley National Laboratory

Closing Remarks

Opening Remarks

Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)

xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory

4th Workshop for Open Source Supercomputing (OpenSuCo)

University of Texas

Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins

Matthieu Dorier

Argonne National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Red Oak Consulting

OpenHPC Community BoF

Colorado State University

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Erik W. Draeger

Lawrence Livermore National Laboratory

Toward a Computational Simulation of Circulating Tumor Cell Transport in Vascular Geometries

Physics and Tensor Applications

Purdue University

Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps

Maurizio Drocco

Pacific Northwest National Laboratory

Enabling High-Level Graph Processing via Dynamic Tasking

University of Illinois

Lawrence Livermore National Laboratory

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

Kristof Du Bois

Intel Corporation

Many-Core Graph Workload Analysis

Rutgers University

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows

Shandong University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Argonne National Laboratory

An Application Perspective on Programming Models for the Future

Keynote: Better Scientific Software (BSSw)

Panel Discussion

Sustaining Research Software

Better Scientific Software

University of Versailles

Design of Data Management for Multi-SPMD Workflow Programming Model

University of Washington

Pacific Northwest National Laboratory

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

Earl P.N. Duque

Intelligent Light

ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

Intel Corporation

Framework for Scalable Intra-Node Collective Operations Using Shared Memory

Los Alamos National Laboratory

A Flexible System For In Situ Triggers

In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization

E

Jonathan Eastep

Intel Corporation

Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack

Nvidia Corporation

Light-Weight Protocols for Wire-Speed Ordering

University of Tennessee

High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability

Maui High Performance Computing Center

OpenMP Common Core: a “Hands-On” Exploration

Stratos Efstathiadis

New York University

Third Annual Meeting of the SIGHPC - Big Data Chapter

Aryan Eftekhari

University of Lugano

Distributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

Tohoku University

A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems

University of Erlangen-Nuremberg

Erlangen Regional Computing Center

Applying the Execution-Cache-Memory Model: Current State of Practice

Barcelona Supercomputing Center

AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests

Samer El Haj Mahmoud

Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective

Kaoutar El Maghraoui

Developing Workplace Resilience and Managing Stress

Tarek El-Ghazawi

George Washington University

Productive Data Locality Optimizations in Distributed Memory

Mohamad S. El-Zein

Deere & Company

Emergence of Tools - a Competitive Advantage at John Deere

A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management

University of Illinois

Scalable Non-Blocking Krylov Solvers for Extreme-Scale Computing

Sally Ellingson

University of Kentucky

Building Lasting and Effective Mentoring Relationships

Sandia National Laboratories

Low Thread-Count Gustavson: A Multithreaded Algorithm for Sparse Matrix-Matrix Multiplication Using Perfect Hashing

Purdue University

Best Practices from Organizations on Improving Workplace Diversity

Daniel Ellsworth

Colorado College

Building a Low Budget Cluster Through Hardware Reuse

Lawrence Livermore National Laboratory

Is Data Placement Optimization Still Relevant on Newer GPUs?

Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling

Science and Technology Facilities Council, UK

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

University of Queensland

Energy Efficiency Modeling of Parallel Applications

Christian Engelmann

Oak Ridge National Laboratory

A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform

Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer

Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

University of Illinois

National Center for Supercomputing Applications

Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience

University of Texas

Evaluating and Accelerating High-Fidelity Error Injection for HPC

Steven Eschrich

Moffitt Cancer Center

Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches

University of New Mexico

PDC Curriculum Update

Vincent Etienne

Toward Smoothing Data Movement Between RAM and Storage

Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation

Sandia National Laboratories

Verifying Qthreads: Is Model Checking Viable for User Level Tasking Runtimes?

Lawrence Berkeley National Laboratory

Interactive HPC Deep Learning with Jupyter Notebooks

Intel Corporation

Many-Core Graph Workload Analysis

Matthew A. Ezell

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

F

Rice University

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

Kjiersten Fagnan

Lawrence Berkeley National Laboratory

US Department of Energy Joint Genome Institute

Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction

Managing the Convergence of HPC and AI

Lawrence Berkeley National Laboratory

Efficient Application of Low Mach Number Hydrodynamics Code to Stellar Flows

Alessandro Fanfarillo

National Center for Atmospheric Research

AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication Libraries

Northwest University, China

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Amin Farmahini-Farahani

Advanced Micro Devices Inc

Challenges of High-Capacity DRAM Stacks and Potential Directions

Muhammad Nufail Farooqi

Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows

Lawrence Berkeley National Laboratory

Interactive HPC Deep Learning with Jupyter Notebooks

Deep Learning at NERSC: Usability, Capability, and Everything in Between

Deep Learning at Scale

Massimiliano Fatica

Nvidia Corporation

Exascale Deep Learning for Climate Analytics

Farzad Fatollahi-Fard

Lawrence Berkeley National Laboratory

Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)

xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory

4th Workshop for Open Source Supercomputing (OpenSuCo)

Considering the Development Workflow to Achieve Reproducibility with Variation

Swiss National Supercomputing Centre

Volume Renderings of Sheared Thermal Convection

National Institute of Advanced Industrial Science and Technology (AIST)

MPI/OpenMP parallelization of the Fragment Molecular Orbitals Method in GAMESS

Carleton College

A Statistical Analysis of Compressed Climate Model Data

Pacific Northwest National Laboratory

CView and NWPerf for Supercomputer Performance Collection and Display.

Shengzhong Feng

Shenzhen Institutes of Advanced Technology

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining

The Green 500: Trends in Energy Efficient Supercomputing

Pacific Northwest National Laboratory

Introduction - IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

Enabling High-Level Graph Processing via Dynamic Tasking

HPC Graph Toolkits and the GraphBLAS Forum

IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

Purdue University

Adaptive Anonymization of Data with b-Edge Covers

Michael Ferguson

Chapel Aggregation Library (CAL)

Hewlett Packard Enterprise

HPC in Space: An Update on Spaceborne Computer after 1+ Year on the ISS

Milinda Fernando

University of Utah

Dendro-GR: Massively Parallel Simulations of Binary Black Hole Intermediate-Mass-Ratio Inspirals

Mauricio H. Ferrato

University of Delaware

Estimating Molecular Dynamics Chemical Shift with GPUs

Rafael Ferreira da Silva

University of Southern California

Introduction - WORKS 2018: 13th Workshop on Workflows in Support of Large-Scale Science

WRENCH: A Framework for Simulating Workflow Management Systems

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Kurt B. Ferreira

Sandia National Laboratories

Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo

Mentor-Protégé Informational Session

Building Lasting and Effective Mentoring Relationships

Argonne National Laboratory

University of Chicago

Introduction - ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

libIS: A Lightweight Library for Flexible In Transit Visualization

ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

François Févotte

EDF Research and Development

Debugging and Optimization of HPC Programs in Mixed Precision with the Verrou Tool

University of Münster

Unified Cross-Platform Profiling of Parallel C++ Applications

Chris Fietkiewicz

Case Western Reserve University

Potential Influence of Prior Experience in an Undergraduate-Graduate Level HPC Course

Fast and Accurate Training of an AI Radiologist

Weronika Filinger

University of Edinburgh

Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training

International HPC Certification Program

Strategies for Inclusive and Scalable HPC Outreach and Education

Toward a HPC Certification Program

The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training

Women in HPC: Diversifying the HPC Community

Salvatore Filippone

Cranfield University

Panel Discussion

Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model

Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI

Argonne National Laboratory

Workshop Lunch (on your own)

Workshop Afternoon Break

Workshop Morning Break

LLVM-HPC2018: Final Discussion

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

Introduction - LLVM-HPC2018: The Fifth Workshop on the LLVM Compiler Infrastructure in HPC

Full State Quantum Circuit Simulation by Using Lossy Data Compression

LLVM in HPC: What's New?

Distributed and Heterogeneous Programming in C++ for HPC 2018

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

Hybrid Quantum-Classical Computing Architectures

User-Directed Loop-Transformations in Clang

Indiana University

Programmable Education Infrastructure: Cloud Resources as HPC Education Environments

University of Chicago

The Gen3 Approach to Portability and Repeatability for Cancer Genomics Projects

Justin Fletcher

US Air Force Research Laboratory

Technical University of Valencia

The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems

Georgia Institute of Technology

Parallel and Scalable Combinatorial String and Graph Algorithms on Distributed Memory Systems

Fernanda Foertter

Nvidia Corporation

Approximating for Faster, Better and Cheaper Scientific Computing

University of Kassel

Comparison of the HPC and Big Data Java Libraries Spark, PCJ and APGAS

Félix-Antoine Fortin

Laval University

Panel: Interactivity in Supercomputing

Evaluation of HPC Application I/O on Object Storage Systems

University of Texas

Texas Advanced Computing Center

Arctic Ocean-Sea Ice Interactions

Argonne National Laboratory

Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

Introduction - Deep Learning on Supercomputers - Welcome

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

University of California, Davis

FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects

EDF Research and Development

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

Large Scale Computation of Quantiles Using MELISSA

Indiana University

Big Data and Exascale Computing (BDEC2) Application Roundtable

Massachusetts Institute of Technology

Feature-Relevant Data Reduction for In Situ Workflows

Manaurae Francisquez

Princeton Plasma Physics Laboratory

Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method

Arrell Food Institute

Global Food Security

HPC Inspires Plenary: HPC and AI: Helping to Solve Humanity’s Grand Challenges

Melyssa Fratkin

University of Texas

Achieving Performance on Large-Scale Intel Xeon-Based Systems

Lawrence Berkeley National Laboratory

Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Joshua B. Fryman

Intel Corporation

Many-Core Graph Workload Analysis

Tsinghua University

National Supercomputing Center, Wuxi

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

Florida State University

Enabling Efficient Data Infrastructure and Analytics on HPC Systems

Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems

University of North Texas

Characterizing Declustered Software RAID for Enhancing Storage Reliability and Performance

Winston-Salem State University

An Alternative Approach to Teaching Bigdata and Cloud Computing Topics at CS Undergraduate Level

University of California, Irvine

Using Integrated Processor Graphics to Accelerate Concurrent Data and Index Structures

Using Integrated Processor Graphics to Accelerate Concurrent Data and Index Structures

Kogakuin University

MGRIT Preconditioned Krylov Subspace Method

University of Tokyo

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Masahiro Fujita

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

Hokkaido University

Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices

Ceph Applications in HPC Environments

University of Manchester

Brain-Inspired Massively-Parallel Computing

Thomas R. Furlani

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

cTuning Foundation

Open Panel: Automating Artifact Sharing, Evaluation, and Reuse

Introduction - ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC

Mikito Furuichi

Japan Agency for Marine-Earth Science and Technology

Massively Parallel Stress Chain Characterization for Billion Particle DEM Simulation of Accretionary Prism Formation

G

Joerg Gablonsky

The Enterprise HPC Service at Boeing

University of Illinois

The First Water in the Universe

Sanjaya Gajurel

Case Western Reserve University

Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images

Krell Institute

How to Analyze the Performance of Parallel Codes 101

Engility Corporation

Managing Python in HPC Environments

Brian Gallagher

Lawrence Livermore National Laboratory

Enabling Data Analytics Workflows Using Node-Local Storage

Jean-Mathieu Gallard

Technical University Munich

Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG Schemes

Steven M. Gallo

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

Lawrence Livermore National Laboratory

Lawrence Berkeley National Laboratory

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

Lawrence Livermore National Laboratory

Open Panel: Automating Artifact Sharing, Evaluation, and Reuse

Introduction - ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC

Spack Community BoF

Managing HPC Software Complexity with Spack

Tsinghua University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Gregory R. Ganger

Carnegie Mellon University

Scaling Embedded In Situ Indexing with DeltaFS

Microsoft Corporation

Data and Storage

Wilfried N. Gansterer

University of Vienna

Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods

Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture

Shandong University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Nvidia Corporation

Cross-Layer Group Regularization for Deep Neural Network Pruning

University of California, San Diego

Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU

Intel Corporation

Function/Kernel Vectorization via Loop Vectorizer

Michael Garland

Nvidia Corporation

Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes

A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs

Lawrence Livermore National Laboratory

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Intel Corporation

Framework for Scalable Intra-Node Collective Operations Using Shared Memory

Derek R. Gaston

Idaho National Laboratory

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

Rahulkumar Gayatri

Lawrence Berkeley National Laboratory

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

A Case Study for Performance Portability Using OpenMP 4.5

J. Michael Gaziano

Harvard Medical School

Complex Phenomics in the MVP

SLAC National Accelerator Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Assefaw Gebremedhin

Washington State University

miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

Boston University

Energy Efficiency of Reconfigurable Caches on FPGAs

Binarized ImageNet Inference in 29us

Lawrence Berkeley National Laboratory

National Energy Research Scientific Computing Center (NERSC)

Spectral Analysis: Building an LGBTQIA+ Community in Scientific Computing

Sandia National Laboratories

Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights

Evangelos Georganas

Intel Corporation

Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures

Extreme Scale De Novo Metagenome Assembly

Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels

The Next Wave of HPC in the Datacenter

Serban Georgescu

Fujitsu Laboratories of Europe Ltd.

DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping

Goethe University Frankfurt

Toward a HPC Certification Program

German Aerospace Center

HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making

On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale

Exascale Archiving - Challenges and Opportunities

University of Notre Dame

Sustaining Research Software

Introduction - WORKS 2018: 13th Workshop on Workflows in Support of Large-Scale Science

Noushin Ghaffari

Texas A&M University

Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level

Tennessee Technological University

PDC Curriculum Update

Yanzan Gharaibeh

Case Western Reserve University

Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images

Texas Tech University

HPCViz: Monitoring Health Status of High Performance Computing Systems

University of California, Berkeley

GPU-Accelerated Interpolation for 3D Image Registration

Washington State University

Scalable Methods for Genome Assembly

Washington State University

miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems

Soumyadip Ghosh

University of Notre Dame

Event-Triggered Communication in Parallel Computing

Devarshi Ghoshal

Lawrence Berkeley National Laboratory

Dac-Man: Data Change Management for Scientific Datasets on HPC Systems

Lawrence Berkeley National Laboratory

Flowzilla: A Methodology for Detecting Data Transfer Anomalies in Research Networks

Garth A. Gibson

Carnegie Mellon University

Scaling Embedded In Situ Indexing with DeltaFS

Swiss National Supercomputing Centre

RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management

Rice University

Hardware Transactional Persistent Memory

Hardware Transactional Persistent Memory

Boston University

SC: The Conference

Versity Software Inc

Exascale Archiving - Challenges and Opportunities

Mercedes Gimeno-Segovia

Quantum Communication Networks and Technologies

Israel Institute of Technology

In-Memory Accelerator Architectures for Machine Learning and Bioinformatics

Processing-in-Storage Architecture for Machine Learning and Bioinformatics

Nvidia Corporation

Swiss Army Programming: Performance and Portability from Modern Tools

Rensselaer Polytechnic Institute

Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps

Texas Tech University

Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs

Lewis & Clark College

Jupyter Notebooks and User-Friendly HPC Access

Madeleine Glick

Columbia University

Photonic Interconnects for Extreme Scale Computing

Next-Generation Networking

Texas A&M University

CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity

University of Alberta

OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries

William A. Goddard III

California Institute of Technology

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Michael Goesele

Graphics, Capture and Massively Parallel Computing

Technical University Darmstadt

A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs

Andreas W. Goetz

San Diego Supercomputer Center

Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU

French Institute for Research in Computer Science and Automation (INRIA)

University of Bordeaux

Scheduling for In-machine Analytics: Data Size Is Important

Lawrence Livermore National Laboratory

MCHPC'18 Panel: Research Challenges in Memory-Centric Computing

Opportunities for Extreme Heterogeneity in High Performance Architectures

Reconfigurable Computing for HPC: Will It Make It this Time?

Introduction - MCHPC’18: Workshop on Memory Centric High Performance Computing

MCHPC’18: Workshop on Memory Centric High Performance Computing

Daniel Goldberg

University of Edinburgh

A Study on Checkpoints Compression for Adjoint Computation

Robin Goldstone

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

File Systems: Data Movement and Provenance

Eugene Goltsman

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

University of California, Berkeley

High Performance Computing in Dynamic Traffic Simulation

Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners

An Efficient SIMD Implementation of Pseudo-Verlet Lists for Neighbor Interactions in Particle-Based Codes

Elsa Gonsiorowski

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Introduction - 5th International Workshop on HPC User Support Tools: HUST-18

Introduction - Women in HPC: Diversifying the HPC Community

VeloC: Very Low Overhead Checkpointing System

Jaime González Cuevas

Appentra Solutions

Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.

Arturo Gonzalez-Escribano

University of Valladolid

Storms of High-Energy Particles: An assignment for OpenMP, MPI, and CUDA/OpenCL

Massachusetts Green High Performance Computing Center

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Mississippi State University

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

Ganesh Gopalakrishnan

University of Utah

Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges

Making Formal Methods for HPC Disappear

Facilitating the Adoption of Correctness Tools in HPC Applications

PARLOT: Efficient Whole-Program Call Tracing for HPC Applications

University of Utah

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

Sergei Gorlatch

University of Münster

Portable Parallel Performance via Multi-Dimensional Homomorphisms

Unified Cross-Platform Profiling of Parallel C++ Applications

US Air Force Research Laboratory

Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter

Karlsruhe Institute of Technology

Machine Learning-Aided Numerical Linear Algebra: Convolutional Neural Networks for the Efficient Preconditioner Generation

Duke University

Toward a Computational Simulation of Circulating Tumor Cell Transport in Vascular Geometries

National Oceanic and Atmospheric Administration

Purpose-Built HPC: Last Hope for Earth System Prediction?

High Performance Computing Center Stuttgart

Pros and Cons of HPCx benchmarks

Mellanox Technologies

Heterogeneous Systems and the Road to Exascale for HPC and AI

University of Stuttgart, Visualization Research Center

Visual Analytics Challenges in Analyzing Calling Context Trees

Oak Ridge National Laboratory

The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification

Sandia National Laboratories

Introduction - Workshop on Exascale MPI (ExaMPI)

Power API and Redfish: Standardizing Power Measurement and Control for HPC

Workshop on Exascale MPI (ExaMPI)

University of Texas, Dallas

NautDB: Toward a Hybrid Runtime for Processing Compiled Queries

Christopher Green

Fermi National Accelerator Laboratory

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Data-Parallel Python for High Energy Physics Analyses

Los Alamos National Laboratory

How to Analyze the Performance of Parallel Codes 101

Georgia Institute of Technology

A Fast and Simple Approach to Merge and Merge Sorting Using Wide Vector Instructions

Los Alamos National Laboratory

SaNSA - the Supercomputer and Node State Architecture

Tivan: A Scalable Data Collection and Analytics Cluster

Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights

Los Alamos National Laboratory

Scaling Embedded In Situ Indexing with DeltaFS

Exascale Archiving - Challenges and Opportunities

Lawrence Livermore National Laboratory

Visual Analytics Challenges in Analyzing Calling Context Trees

Andrew Grimshaw

University of Virginia

Invited Talk: The Campus Compute Cooperative Project as an Alternative to Commercial Clouds

Federated Cloud: An Evolutionary Path from Grid Computing

Leopold Grinberg

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Lawrence Livermore National Laboratory

Flux: Overcoming Scheduling Challenges for Exascale Workflows

University of Illinois

SC: The Conference

Scalable Non-Blocking Krylov Solvers for Extreme-Scale Computing

Software Engineering and Reuse in Computational Science and Engineering

Advanced MPI Programming

Los Alamos National Laboratory

Using Thrill to Process Scientific Data on HPC

The BP Data Science Sandbox

A Unified Runtime for PGAS and Event-Driven Programming

A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java

Robert L. Grossman

University of Chicago

The Gen3 Approach to Portability and Repeatability for Cancer Genomics Projects

University of Amsterdam

Introduction - Innovating the Network for Data Intensive Science (INDIS)

Tracking Network Flows with P4

Innovating the Network for Data Intensive Science (INDIS)

Lawrence Livermore National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Applications of Deep Learning in Industry and Research

University of Toronto

InfiniBand In-Network Computing Technology and Roadmap

Trends in Demand, Growth, and Breadth in Scientific Computing Training Delivered by a High-Performance Computing Center

Thomas Grützmacher

Karlsruhe Institute of Technology

High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation

North Carolina State University

Floating-Point Autotuner for CPU-Based Mixed-Precision Applications

Middle Tennessee State University

Energy-Aware Workflow Scheduling and Optimization in Clouds Using Bat Algorithm

Rice University

Dynamic Data Race Detection for OpenMP Programs

North Carolina State University

Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines

Kent State University

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Aditya Gudibanda

Reservoir Labs Inc

Fast Detection of Elephant Flows with Dirichlet-Categorical Inference

Fluminense Federal University, Fluminense Federal University, Brazil

A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows

Shashank Gugnani

Ohio State University

Accelerating Big Data Processing in the Cloud with Scalable Communication and I/O Schemes

Exploiting HPC Technologies for Accelerating Big Data Processing and Associated Deep Learning

Vineet Gundecha

Fast and Accurate Training of an AI Radiologist

Lawrence Berkeley National Laboratory

Flowzilla: A Methodology for Detecting Data Transfer Anomalies in Research Networks

Carnegie Mellon University

Scaling Embedded In Situ Indexing with DeltaFS

Los Alamos National Laboratory

Scaling Embedded In Situ Indexing with DeltaFS

University of California, Merced

FlipTracker: Understanding Natural Error Resilience in HPC Applications

Argonne National Laboratory

MPICH: A High Performance Open-Source MPI Implementation

Lawrence Berkeley National Laboratory

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

SDN for End-to-End Networked Science at the Exascale (SENSE)

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

PDC Curriculum Update

Stony Brook University

Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling

University of Notre Dame

Event-Triggered Communication in Parallel Computing

Sudhanva Gurumurthi

Advanced Micro Devices Inc

Challenges of High-Capacity DRAM Stacks and Potential Directions

National University of Singapore

Open-Source Supercomputing

Julian Gutierrez

Northeastern University

Employing Student Retention Strategies for an Introductory GPU Programming Course

Optimization of an Image Processing Algorithm: Histogram Equalization

Samuel K. Gutiérrez

Los Alamos National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Ethan D Gutmann

National Center for Atmospheric Research

Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model

Gregory S. Gutmann

Tokyo Institute of Technology

Deep Learning by Doing: Nvidia Deep Learning Institute

Attila Gyulassy

University of Utah

A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing

H

University of Illinois

National Center for Supercomputing Applications

Programmable Interactive Visualization of a Core-Collapse Supernova Simulation

Oxford Thermofluids Institute

University of Oxford

Software Prefetching for Unstructured Mesh Applications

King Abdullah University of Science and Technology

Convergence between HPC and Big Data: The Day After Tomorrow

Intel Corporation

MCHPC'18 Panel: Research Challenges in Memory-Centric Computing

MCHPC'18 Morning Keynote: Converging Storage and Memory

University of Erlangen-Nuremberg

Erlangen Regional Computing Center

Applying the Execution-Cache-Memory Model: Current State of Practice

Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures

Node-Level Performance Engineering

Christoph Hagleitner

Scalable FPGA Deployments for HPC and DC Applications

Application Porting and Optimization on GPU-Accelerated POWER Architectures

University of Tennessee

Innovative Computing Laboratory

Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers

MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines

Princeton Plasma Physics Laboratory

Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method

Rafif Akila Hakim

Telkom University, Indonesia

Student Cluster Competition Team Panel Presentation

Mahantesh Halappanavar

Pacific Northwest National Laboratory

Adaptive Anonymization of Data with b-Edge Covers

HPC Graph Toolkits and the GraphBLAS Forum

miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems

University of British Columbia

There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline

University of Utah

A Renaissance for Domain-Specific Languages, Compilers and Code Generators for HPC and Big Data

Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks

Test of Time Award Presentation

Numerical Algorithms Group

The Business of HPC: TCO, Funding Models, Metrics, Value, and More

Procurement and Commissioning of HPC Systems

Kathleen E. Hamilton

Oak Ridge National Laboratory

Non-Neural Network Applications for Spiking Neuromorphic Hardware

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

University of Erlangen-Nuremberg

OoO Instruction Benchmarking Framework on the Back of Dragons

Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures

Dorit Hammerling

National Center for Atmospheric Research

A Statistical Analysis of Compressed Climate Model Data

Gregory Hammett

Princeton Plasma Physics Laboratory

Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method

Intel Corporation

Learning to Lead in HPC - Strategies to Start Your Leadership Journey

Mentor-Protégé Informational Session

Evaluating the Impact of Proposed OpenMP 5.0 Features on Performance, Portability, and Productivity

Simon D. Hammond

Sandia National Laboratories

Introduction - The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)

Exploring Allocation Policies in Disaggregated Non-Volatile Memories

Stony Brook University

Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling

Deep Learning at Scale on Nvidia V100 Accelerators

Monash University

Student Cluster Competition Team Panel Presentation

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Toshihiro Hanawa

University of Tokyo

Energy Efficiency Considerations for HPC Procurements

Indiana University

HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Nanyang Technological University, Singapore

Student Cluster Competition Team Panel Presentation

Daniel Harborne

Cardiff University

Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture

Lawrence Berkeley National Laboratory

Doomsday: Predicting Which Node Will Fail When on Supercomputers

GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

Siva Kumar Sastry Hari

Nvidia Corporation

Optimizing Software-Directed Instruction Replication for GPU Error Detection

Argonne National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Characterization of MPI Usage on a Production Supercomputer

Stephen Lien Harrell

Purdue University

Open Panel: Automating Artifact Sharing, Evaluation, and Reuse

Introduction - HPC Systems Professionals Workshop (HPCSYSPROS18)

Effective Performance Portability

Plasma Meets Portability: A Journey to Performance Portability in a Particle-in-Cell Code

Christopher Harrison

University of Wisconsin

Special Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)

Lawrence Livermore National Laboratory

Enabling Data Analytics Workflows Using Node-Local Storage

A Flexible System For In Situ Triggers

Institute for Disease Modeling at Intellectual Ventures

HPC Inspires Plenary: HPC and AI: Helping to Solve Humanity’s Grand Challenges

Rebecca Hartman-Baker

Lawrence Berkeley National Laboratory

Students@SC: Making the Best of Your HPC Education

Women in HPC: the Importance of Male Allies

The HPC Best Practices Webinar Series

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Christine Harvey

MITRE Corporation

Volunteer Opportunities for SC Conference Planning

Niranjan Hasabnis

Intel Corporation

Auto-Tuning TensorFlow Threading Model for CPU Backend

Ohio State University

Cooperative Rendezvous Protocols for Improved Performance and Overlap

Designing Shared Address Space MPI Libraries in Many-Core Era

Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective

RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish

Out-of-Band (BMC based) Data Center Monitoring DMTF Redﬁsh API Integration with Nagios

Hiroyasu Hasumi

University of Tokyo

Multi-GPU Accelerated Non-Hydrostatic Numerical Ocean Model with GPUDirect RDMA Transfers

Imagica Digitalscape

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

German Aerospace Center

Software Engineering and Reuse in Computational Science and Engineering

Akihiro Hayashi

Rice University

A Unified Runtime for PGAS and Event-Driven Programming

Kobe University

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

Tsinghua University

National Supercomputing Center, Wuxi

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Wuhan University

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

Carnegie Mellon University

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Lawrence Berkeley National Laboratory

OpenMP Common Core: a “Hands-On” Exploration

Ronnie Hedgepeth

US Department of Defense HPC Modernization Program

Patrick Heimbach

University of Texas

Institute for Computational Engineering and Sciences

Arctic Ocean-Sea Ice Interactions

Alexander Heinecke

Intel Corporation

Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures

Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels

Tensor-Optimized Hardware Accelerates Fused Discontinuous Galerkin Simulations

Imperial College, London

Large-Scale Clustering Using MPI-Based Canopy

Intel Corporation

Many-Core Graph Workload Analysis

Louisiana State University

Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)

Matthew Henderson

Lawrence Berkeley National Laboratory

Interactive HPC Deep Learning with Jupyter Notebooks

Bruce Hendrickson

Lawrence Livermore National Laboratory

Students@SC Keynote: Livin’ on the Edge: Thoughts on Careers in High Performance Computing

Intel Corporation

Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures

Robert Henschel

Indiana University

OpenACC API User Experience, Vendor Reaction, Relevance, and Roadmap

University of Edinburgh

The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training

Invited Talk: Resource Control at Facebook

University of Tennessee

Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice

Microsoft Corporation

HPC in the Cloud

Stephen Herbein

Lawrence Livermore National Laboratory

Evaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Introduction of Practical Approaches to Data Analytics for HPC with Spark

Martin Herbordt

Boston University

A Novel Approach to Supporting Communicators for In-Switch Processing of MPI Collectives

Energy Efficiency of Reconfigurable Caches on FPGAs

Binarized ImageNet Inference in 29us

Benchmarking Scientific Reconfigurable / FPGA Computing

SimBSP: Enabling RTL Simulation for Intel FPGA OpenCL Kernels

Atomic Weapons Establishment (AWE), UK

Performance Portability of an Unstructured Hydrodynamics Mini-Application

KTH Royal Institute of Technology

Characterizing Deep-Learning I/O Workloads in TensorFlow

Marc-André Hermanns

Forschungszentrum Juelich

Visual Analytics Challenges in Analyzing Calling Context Trees

Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)

Oscar Hernandez

Oak Ridge National Laboratory

OpenSHMEM in the Era of Exascale

Christian Herold

Technical University Dresden

Top-Down Performance Analysis of Workflow Applications

Michael A. Heroux

Sandia National Laboratories

MCHPC'18 Panel: Research Challenges in Memory-Centric Computing

Open Panel: Automating Artifact Sharing, Evaluation, and Reuse

Software Engineering and Reuse in Computational Science and Engineering

HPCG Benchmark Update

Navigating the SC Conference Technical Program Submission Process

Better Scientific Software

Juelich Supercomputing Centre

Application Porting and Optimization on GPU-Accelerated POWER Architectures

Introduction - Innovating the Network for Data Intensive Science (INDIS)

University of Wisconsin

Secure Coding Practices and Automated Assessment Tools

Students@SC: Making the Best of Your HPC Education

Los Alamos National Laboratory

Energy Efficiency Considerations for HPC Procurements

University of Huddersfield

Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks

Nicholas Higham

University of Manchester

School of Mathematics

Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers

Christopher Hill

Massachusetts Institute of Technology

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

University of Amsterdam

Tracking Network Flows with P4

Elizabett Hillery

Purdue University

Best Practices from Organizations on Improving Workplace Diversity

Fermi National Accelerator Laboratory

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

University of Hamburg

Toward a HPC Certification Program

Naveen Himthani

University of Texas

Institute for Computational Engineering and Sciences

GPU-Accelerated Interpolation for 3D Image Registration

Oak Ridge National Laboratory

HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports

University of Tokyo

Pros and Cons of HPCx benchmarks

Jeffrey Hittinger

Lawrence Livermore National Laboratory

ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning

Lawrence Berkeley National Laboratory

Carnegie Mellon University

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Los Alamos National Laboratory

The First Water in the Universe

Torsten Hoefler

Communication with the Reader

Reconfigurable Computing for HPC: Will It Make It this Time?

Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)

High Level Programming Languages for Quantum Computation

Deep500: An HPC Deep Learning Benchmark and Competition

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Productive Parallel Programming for FPGA with High-Level Synthesis

Advanced MPI Programming

University of Chicago

A Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints

Hybrid Quantum-Classical Computing Architectures

Johannes Hofmann

University of Erlangen-Nuremberg

Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

SLAC National Accelerator Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Markus Höhnerbach

RWTH Aachen University

PotC: Many-Body Potential Implementations à La Carte

Texas Tech University

RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish

Jeffrey K. Hollingsworth

University of Maryland

Career Development Panel

Sandia National Laboratories

Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications

University of Edinburgh

Heterogeneous Systems and the Road to Exascale for HPC and AI

Introduction - Workshop on Exascale MPI (ExaMPI)

Workshop on Exascale MPI (ExaMPI)

University of Huddersfield

Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks

Carissa Holohan

Argonne National Laboratory

Hot Topics Discussion II: Thriving at Work

Spectral Analysis: Building an LGBTQIA+ Community in Scientific Computing

University of Colorado

Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins

Fermi National Accelerator Laboratory

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Valentin Honore

University of Bordeaux

French Institute for Research in Computer Science and Automation (INRIA)

Scheduling for In-machine Analytics: Data Size Is Important

Hans-Christian Hoppe

Intel Corporation

Multi-Level Memory and Storage for HPC and Data Analytics

University of Tokyo

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Japan Agency for Marine-Earth Science and Technology

Massively Parallel Stress Chain Characterization for Billion Particle DEM Simulation of Accretionary Prism Formation

University of Erlangen-Nuremberg

Erlangen Regional Computing Center

Applying the Execution-Cache-Memory Model: Current State of Practice

Ghent University

Getting Scientific Software Installed

Northwest University, China

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Optimizing the Throughput of Storm-Based Stream Processing in Clouds

Northwestern University

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

A Study on Checkpoints Compression for Adjoint Computation

Michael Houston

Nvidia Corporation

Introduction - Machine Learning in HPC Environments

Exascale Deep Learning for Climate Analytics

Argonne National Laboratory

A Study on Checkpoints Compression for Adjoint Computation

Stony Brook University

Los Alamos National Laboratory

Challenges of Performance Portability for Fortran Unstructured Mesh Codes

Effective Performance Portability

Performance Portability Challenges for Fortran Applications

George Washington University

TriCore: Parallel Triangle Counting on GPUs

George Washington University

iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees

TriCore: Parallel Triangle Counting on GPUs

Algorithms on Sparse Data

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Georgia Institute of Technology

Accelerating Quantum Chemistry with Vectorized and Batched Integrals

Texas A&M University

Incremental Static Race Detection in OpenMP Programs

University of Texas

OOOPS: An Innovative Tool for IO Workload Management on Supercomputers

Hong Kong University of Science and Technology

SP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition

Nathanael Hübbe

University of Hamburg

Toward a HPC Certification Program

Leibniz Supercomputing Centre

The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification

Energy Efficiency Considerations for HPC Procurements

French Institute for Research in Computer Science and Automation (INRIA)

PARCOACH Extension for a Full-Interprocedural Collectives Verification

Alexander Hück

Technical University Darmstadt

Scientific Computing

Compiler-Aided Type Tracking for Correctness Checking of MPI Applications

University of Oregon

Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges

Asynchronous Execution of Python Code on Task Based Runtime Systems

Introduction - Fifth International Workshop on Visual Performance Analysis (VPA 18)

Enabling Data Services for HPC

Argonne National Laboratory

Keynote: Better Scientific Software (BSSw)

Sandia National Laboratories

Exploring Allocation Policies in Disaggregated Non-Volatile Memories

Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective

Oak Ridge National Laboratory

A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform

Arista Networks

The Difference Between HPC on Premises and in the Cloud

Christian Hundt

Johannes Gutenberg University Mainz

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Technical University Wien

Algorithm Selection of MPI Collectives Using Machine Learning Techniques

Keystone Initiative for Network Based Education and Research

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

Intel Corporation

Many-Core Graph Workload Analysis

Micron Technology Inc

Accelerate Machine Learning with High Performance Memory

PMIx: Enabling Workflow Orchestration

University of Pittsburgh

Partial Redundancy in HPC Systems with Non-Uniform Node Reliabilities

Lance Hutchinson

Sandia National Laboratories

INDIS Showcases Panel: NRE and XNET and Architecture

University of Illinois

MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines

I

Roman Iakymchuk

KTH Royal Institute of Technology

Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows

Lawrence Berkeley National Laboratory

Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI

Quantum Computing for Scientific Applications

Tsuyoshi Ichimura

University of Tokyo

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Yasuhiro Idomura

Japan Atomic Energy Agency

Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers

Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations

Mike Ignatowski

Advanced Micro Devices Inc

MCHPC'18 Panel: Research Challenges in Memory-Centric Computing

Challenges of High-Capacity DRAM Stacks and Potential Directions

NEC Corporation

Next Generation Vector Supercomputer

Aleksandar Ilic

INESC-ID, Portugal

Performance Tuning of Scientific Codes with the Roofline Model

Two Sigma Investments LP

A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java

Toshiyuki Imamura

Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations

Japan Atomic Energy Agency

Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations

Frank Indiviglio

National Oceanic and Atmospheric Administration

Managing Python in HPC Environments

Martins D. Innus

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

Joseph A. Insley

Argonne National Laboratory

Northern Illinois University

Visualizing Outbursts of Massive Stars

libIS: A Lightweight Library for Flexible In Transit Visualization

Latchesar Ionkov

Los Alamos National Laboratory

Heterogeneous Memory and Arena-Based Heap Allocation

EDF Research and Development

Large Scale Computation of Quantiles Using MELISSA

Alexandru Iosup

Vrije University Amsterdam

Delft University of Technology

A Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments

Winston-Salem State University

An Alternative Approach to Teaching Bigdata and Cloud Computing Topics at CS Undergraduate Level

Katherine E. Isaacs

University of Arizona

Introduction - Fifth International Workshop on Visual Performance Analysis (VPA 18)

Youhei Ishihara

Kyoto University

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Yutaka Ishikawa

On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale

Western Washington University

Automatic Generation of Mixed-Precision Programs

Tohoku University

NEC Corporation

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

Shintaro Iwasaki

University of Tokyo

Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading

Chander J. Iyer

Rensselaer Polytechnic Institute

Yahoo! Research

Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps

J

Christiane Jablonowski

University of Michigan

Parallel Computing 101

University of Maryland

MCHPC'18 Panel: Research Challenges in Memory-Centric Computing

MCHPC'18 Afternoon Keynote: All Tomorrow’s Memory Systems

Lawrence Livermore National Laboratory

Scalable Deep Ensemble Learning for Cancer Drug Discovery

Intel Corporation

Effective Performance Portability

Daniel Jacobson

Oak Ridge National Laboratory

Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction

Mathias Jacquelin

Lawrence Berkeley National Laboratory

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

Lawrence Livermore National Laboratory

Evaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters

Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing

Intel Corporation

Framework for Scalable Intra-Node Collective Operations Using Shared Memory

Versailles Saint-Quentin-en-Yvelines University

Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)

Siddhartha Jana

Intel Corporation

HPC PowerStack: a community-wide open collaboration for enabling system-wide power efficiency

Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Branislav Jansik

IT4Innovations, Czech Republic

Technical University of Ostrava, Czech Republic

Job Simulation for Large-Scale PBS-Based Clusters with the Maui Scheduler

Brno University of Technology

Faculty of Information Technology

Optimization of Ultrasound Simulations on Multi-GPU Servers

Stephen A. Jarvis

University of Warwick

Introduction - The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)

Optimizing Machine Learning on Apache Spark in HPC Environments

Performance Portability of an Unstructured Hydrodynamics Mini-Application

Ali Javadi-Abhari

Quantum Computing for Scientific Applications

Nina Jeliazkova

IDEAconsult Ltd, Bulgaria

HPC-as-a-Service for Life Sciences

Bohumir Jelinek

Mississippi State University

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

Pacific Northwest National Laboratory

Chapel Aggregation Library (CAL)

Michael Jennings

Los Alamos National Laboratory

Containers in HPC

Cloud Infrastructure Solutions To Run HPC Workloads

Grzegorz Jereczek

Intel Corporation

DAQDB - a Distributed Key-Value Store for Petascale Hot Storage

Elizabeth Jessup

University of Colorado

Invited Talk Session 1

A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management

SLURM User Group Meeting

Brookhaven National Laboratory, Rutgers University

Clouds and Distributed Computing

George Washington University

iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees

Lawrence Livermore National Laboratory

Enabling Data Analytics Workflows Using Node-Local Storage

Nvidia Corporation

Exploiting Idle Resources in a High-Radix Switch for Supplemental Storage

University of California, Santa Barbara

Visualizing Outbursts of Massive Stars

Los Alamos National Laboratory

Performance Portability Challenges for Fortran Applications

University of California, Santa Cruz

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

Using Thrill to Process Scientific Data on HPC

Spotting Black Swans With Ease: The Case for a Practical Reproducibility Platform

Semantically Organized Containers for Reproducible Research

University of Queensland

Energy Efficiency Modeling of Parallel Applications

Reservoir Labs Inc

Analysis of Explicit vs. Implicit Tasking in OpenMP Using Kripke

AI Matrix – Synthetic Benchmarks for DNN

Los Alamos National Laboratory

Improving Application Resilience by Extending Error Correction with Contextual Information

Lawrence Berkeley National Laboratory

Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks

Mikael Johansson

KTH Royal Institute of Technology

Distributed L-Shaped Algorithms in Julia

San Diego State University

Improving MPI Reduction Performance for Manycore Architectures with OpenMP and Data Compression

University of Utah

The Age of Data - Visualizing the Revolution

Christopher Johnson

University of Utah

Learning to Lead in HPC - Strategies to Start Your Leadership Journey

Mississippi State University

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

Australian National University

AIWC: OpenCL-Based Architecture Independent Workload Characterization

Centre for High Performance Computing, South Africa

Special Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Strategies for Inclusive and Scalable HPC Outreach and Education

J. Travis Johnston

Oak Ridge National Laboratory

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Introduction of Practical Approaches to Data Analytics for HPC with Spark

Kean University

The Wave Equation as a Motivating Example for High Performance Computing

Barcelona Supercomputing Center

Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa

Numerical Algorithms Group

The Business of HPC: TCO, Funding Models, Metrics, Value, and More

Procurement and Commissioning of HPC Systems

Catherine Jones

Science and Technology Facilities Council, UK

UK Research and Innovation

Software Engineers: Careers in Research

Matthew D. Jones

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

Timothy M. Jones

University of Cambridge

Software Prefetching for Unstructured Mesh Applications

William M. Jones

Coastal Carolina University

Improving Application Resilience by Extending Error Correction with Contextual Information

Thomas Jefferson National Accelerator Facility

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

Astrophysics Applications

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction

Guido Juckeland

Helmholtz-Zentrum Dresden-Rossendorf

5th Workshop on Accelerator Programming Using Directives (WACCPD): Closing Remarks

Introduction - Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

University of Notre Dame

Compliant Cloud+Campus Hybrid HPC Infrastructure

Massachusetts Institute of Technology

University of Chicago

Measuring Swampiness: Quantifying Chaos in Large Heterogeneous Data Repositories

Christoph Junghans

Los Alamos National Laboratory

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

University of Maryland

Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method

Eulerian Algorithms for the Discretization of Plasma Kinetic Equations

Yale University

US Department of Veterans Affairs

Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction

K

Mozhgan Kabiri Chimeh

University of Sheffield

FLAME GPU: Complex System Simulation Framework

Developing Workplace Resilience and Managing Stress

Northeastern University

PRISM: Predicting Resilience of GPU Applications Using Statistical Methods

Employing Student Retention Strategies for an Introductory GPU Programming Course

Optimization of an Image Processing Algorithm: Histogram Equalization

Asian Technology Information Program

Welcome and Introduction

Introduction - 2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development

2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Oak Ridge National Laboratory

Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction

Louisiana State University

Runtime for Exascale and Beyond: Convergence or Divergence?

Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)

Asynchronous Execution of Python Code on Task Based Runtime Systems

Dhiraj Kalamkar

Intel Corporation

Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures

Laxmikant Kalé

University of Illinois

Charm++ and AMPI: Adaptive and Asynchronous Parallel Programming

Laxmikant (Sanjay) Kale

University of Illinois

Parallel Programming Models for the Extreme Scale Era

Exascale Challenges in Across-Node Parallelism for Languages and Runtimes

Runtime for Exascale and Beyond: Convergence or Divergence?

Intel Corporation

Framework for Scalable Intra-Node Collective Operations Using Shared Memory

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Sergei V. Kalinin

Oak Ridge National Laboratory

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Kristy A. Kallback-Rose

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

Northeastern University

PRISM: Predicting Resilience of GPU Applications Using Statistical Methods

Ananth Kalyanaraman

Washington State University

Scalable Methods for Genome Assembly

miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems

Railway Technical Research Institute, Japan

Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis

Taming Datacenter Thermodynamics with Lenovo Neptune Technology

Lawrence Berkeley National Laboratory

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism

Yoshito Kanamori

University of Alaska, Anchorage

Stochastic Computing on Quantum Gates

Idaho National Laboratory

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

Northwestern University

Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies

Ramaseshan Kannan

Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices

Temple University

PDC Curriculum Update

Israel Institute of Technology

In-Memory Accelerator Architectures for Machine Learning and Bioinformatics

Processing-in-Storage Architecture for Machine Learning and Bioinformatics

Accelerating DNA Long Read Mapping with Emerging Technologies

Clemson University

Los Alamos National Laboratory

Using Thrill to Process Scientific Data on HPC

Vasileios Karakasis

Swiss National Supercomputing Centre

ReFrame: A Regression Testing and Continuous Integration Framework for HPC systems

Georgia Institute of Technology

Modeling Single-Source Shortest Path Algorithm Dynamics to Control Performance and Power Tradeoffs

Sagar Karandikar

University of California, Berkeley

Panel: Open-Source Hardware

FireSim: FPGA-Accelerated Cycle-Exact Scale-Out System Simulation in the Public Cloud

Deepthi Karkada

Intel Corporation

Training Speech Recognition Models on HPC Infrastructure

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Exploring Application Performance on Fat-Tree Networks in the Presence of Congestion

Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free

Technical University of Denmark

Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Thomas P. Karnowski

Oak Ridge National Laboratory

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

National Center for Supercomputing Applications

University of Illinois

Understanding Software Sustainability: Learning from Parsl and Other Projects

Sustaining Research Software

Software Engineering and Reuse in Computational Science and Engineering

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

Christos Kavouklis

Lawrence Livermore National Laboratory

A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

Tomohiro Kawanabe

Riken Center for Computational Science

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

Engin Kayraklioglu

George Washington University

Productive Data Locality Optimizations in Distributed Memory

Yoshii Kazutomo

Argonne National Laboratory

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

Argonne National Laboratory

Dynamically Negotiating Capacity Between On-Demand and Batch Clusters

INDIS Afternoon Keynote

Reproducibility as Side Effect

Stephen W. Keckler

Nvidia Corporation

Optimizing Software-Directed Instruction Replication for GPU Error Detection

Christopher Keefe

Northern Arizona University

Pathogen and Microbiome Institute

Enabling Reproducible Microbiome Science through Decentralized Provenance Tracking in QIIME 2

Kimberly Keeton

Hewlett Packard Enterprise

Panel Discussion

Wake Forest University

Student Cluster Competition Team Panel Presentation

Barcelona Supercomputing Center

Toward Ad Hoc Recovery For Soft Errors

University of Texas

Evaluating and Accelerating High-Fidelity Error Injection for HPC

Jet Propulsion Laboratory

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Keynote 2: HPC and AI as Drivers for Industrial Engagement

Icahn School of Medicine at Mount Sinai

Population Genetics and Computation in the Area of Precision Medicine

Los Alamos National Laboratory

Comparing Deep Learning with Quantum Inference on The D-Wave 2X

Reconfigurable Computing for HPC: Will It Make It this Time?

Oak Ridge National Laboratory

Characterization of the Impact of Soft Errors on Iterative Methods

Rajkumar Kettimuthu

Argonne National Laboratory

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Fraunhofer Institute for Industrial Mathematics

Introduction - Machine Learning in HPC Environments

Massachusetts Institute of Technology

xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory

Atomic Weapons Establishment (AWE), UK

Performance Portability of an Unstructured Hydrodynamics Mini-Application

King Abdullah University of Science and Technology

Panel 2: Arabia's Leap into the Cyber Era

Keynote 3: Hierarchical Algorithms on Hierarchical Architectures

National Institute of Standards and Technology

Introduction - Computational Reproducibility at Exascale 2018 (CRE2018)

Computational Reproducibility at Exascale 2018 (CRE2018)

M. Garda Khadafi

Telkom University, Indonesia

Student Cluster Competition Team Panel Presentation

Pacific Northwest National Laboratory

Adaptive Anonymization of Data with b-Edge Covers

Texas Tech University

Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs

Alireza Kheirkhahan

Louisiana State University

Asynchronous Execution of Python Code on Task Based Runtime Systems

North Carolina State University

Using Darshan and CODES to Evaluate Application I/O Performance

Duke University School of Medicine

The Role of Computing in Predictive and Precision Oncology

University of Michigan

Cloud Infrastructure Solutions To Run HPC Workloads

Introduction to Kubernetes

Brown University

Effective Performance Portability

Korea Advanced Institute of Science and Technology

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Oak Ridge National Laboratory

OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing

Implementing Efficient Data Compression and Encryption in a Persistent Key-Value Store for HPC

East Stroudsburg University of Pennsylvania

Introducing Three Basic Concepts in Parallel Computation to 1st Year Computer Science Students in a Simple and Effective Way

Yasuyuki Kimura

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Heather Kincaid

Jet Propulsion Laboratory

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Volodymyr Kindratenko

University of Illinois

National Center for Supercomputing Applications

Designing and Building Next-Generation Computer Systems for Deep Learning

Michael Kinsner

Intel Corporation

Reconfigurable Computing for HPC: Will It Make It this Time?

Energy Sciences Network (ESnet)

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Lara Kisielewska

The Difference Between HPC on Premises and in the Cloud

University of Delaware

Los Alamos National Laboratory

Plasma Meets Portability: A Journey to Performance Portability in a Particle-in-Cell Code

Effective Performance Portability

Oak Ridge National Laboratory

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows

Feature-Relevant Data Reduction for In Situ Workflows

Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

High Performance I/O Frameworks 101

Kerstin Kleese van Dam

Brookhaven National Laboratory

Career Development Panel

Intel Corporation

Mastering Tasking with OpenMP

Advanced OpenMP: Host Performance and 5.0 Features

OpenMP API Version 5.0 - Getting Ready for Exascale

OpenMP® 5.0 Is Here: Find Out All the Things You Need to Know About It!

Nvidia Corporation

OpenMP GPU Offload in Flang and LLVM

Tobias Klöffel

University of Erlangen-Nuremberg

Boosting the Scalability of Car-Parrinello Molecular Dynamics Simulations for Multi- and Manycore Architectures

Cornell University

Upcoming Events in the HPC Systems Professionals Community

Programmable Education Infrastructure: Cloud Resources as HPC Education Environments

Christian Kniep

Cloud Infrastructure Solutions To Run HPC Workloads

Christopher Knight

Argonne National Laboratory

Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations

Hiroaki Kobayashi

Tohoku University

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Energy Efficient HPC Working Group

A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

University of Notre Dame

17th Graph500 List

Revisiting the 2008 ExaScale Computing Study and Venturing Predictions for 2028

National Institute of Advanced Industrial Science and Technology (AIST)

FlowOS-RM: Disaggregated Resource Management System

Texas State University

High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem

Sandia National Laboratories

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows

Chaitanya Kolluru

Case Western Reserve University

Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images

Yuri Kolomiyets

Corsa Technology Inc

100G SSL/TLS Decryption Is Indeed Possible for High Capacity Links

Kazuhiko Komatsu

Tohoku University

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Vamsee Reddy Kommareddy

University of Central Florida

Exploring Allocation Policies in Disaggregated Non-Volatile Memories

University of Tokyo

Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack

Karlsruhe Institute of Technology

The NAStJA Framework: Non-Collective Scalable Global Communications

Non-Collective Scalable Global Network Based on Local Communications

Idaho National Laboratory

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

University of Hawaii

Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations

OpenMP Common Core: a “Hands-On” Exploration

Lawrence Livermore National Laboratory

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Micron Technology Inc

17th Graph500 List

Aalborg University, Copenhagen

DagOn*: Executing Direct Acyclic Graphs as Parallel Jobs on Anything

Oak Ridge National Laboratory

Delivering on the Exascale Computing Project Mission for the US Department of Energy

Patricia Kovatch

Icahn School of Medicine at Mount Sinai

Panel Discussion on Currents Trends, Needs, and Bottlenecks in Computational Human Phenomics

Introduction - Computational Phenomics @Scale: From Supercomputers to Bedside

Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)

Impacting Cancer with HPC: Opportunities and Challenges

James Kowalkowski

Fermi National Accelerator Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Data-Parallel Python for High Energy Physics Analyses

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

Anycast: Rootless Broadcasting with MPI

HDF5: I/O Middleware and Ecosystem for HPC and Experimental and Observational Sciences

Matthew S. Krafczyk

University of Illinois

Assessing Reproducibility: An Astrophysical Example of Computational Uncertainty in the HPC Context

William T. Kramer

University of Illinois

National Center for Supercomputing Applications

Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience

Michal Kravcenko

Technical University of Ostrava, Czech Republic

Distributed Fast Boundary Element Methods

Nathaniel Kremer-Herman

University of Notre Dame

A Lightweight Model for Right-Sizing Master-Worker Applications

Reduction of Workflow Resource Consumption Using a Density-based Clustering Model

Christopher D. Krieger

Laboratory for Physical Sciences at University of Maryland

Impact of Traditional Sparse Optimizations on a Migratory Thread Architecture

Aravind Krishnamoorthy

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Sriram Krishnamoorthy

Pacific Northwest National Laboratory

Characterization of the Impact of Soft Errors on Iterative Methods

HPC Software Verification in Action: A Case Study with Tensor Transposition

MPI Optimization and Characterization

Vandhana Krishnan

Stanford University

Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud

American River College

OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine

Martin Kronbichler

Technical University Munich

Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?

Cloud Infrastructure Solutions To Run HPC Workloads

Argonne National Laboratory

Argonne Leadership Computing Facility

User-Directed Loop-Transformations in Clang

Georgia Institute of Technology

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

Vladyslav Kucher

University of Münster

Unified Cross-Platform Profiling of Parallel C++ Applications

Andrey Kudryavtsev

Intel Corporation

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

Los Alamos National Laboratory

OpenSHMEM in the Era of Exascale

Unified Communication X (UCX) Community

Mohammad Amin Kuhail

University of Missouri, Kansas City

Lessons from Integrating Parallelism into Undergraduate Curriculum at UMKC

Nvidia Corporation

Programmable Interactive Visualization of a Core-Collapse Supernova Simulation

University of Hamburg

Toward a HPC Certification Program

Imperial College, London

A Study on Checkpoints Compression for Adjoint Computation

Institute of Computational Mathematics and Mathematical Geophysics SB RAS

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

Indian Institute of Tropical Meteorology

Visualization of Droplet Dynamics in Cloud Turbulence

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Argonne National Laboratory

Characterization of MPI Usage on a Production Supercomputer

Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications

Manaschai Kunaseth

National Science and Technology Development Agency, Thailand

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Louisiana State University

Introducing Three Basic Concepts in Parallel Computation to 1st Year Computer Science Students in a Simple and Effective Way

University of Reading

International HPC Certification Program

The IO-500 and the Virtual Institute of I/O

Analyzing Parallel I/O

Toward a HPC Certification Program

Toward Understanding I/O Behavior in HPC Workflows

Stony Brook University

Feature-Relevant Data Reduction for In Situ Workflows

Lawrence Berkeley National Laboratory

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

Exascale Deep Learning for Climate Analytics

Deep Learning at Scale

A Case Study for Performance Portability Using OpenMP 4.5

Gregory Kurtzer

The Difference Between HPC on Premises and in the Cloud

Missouri Western State University

CV Review and Career Development Panel

L

Barcelona Supercomputing Center

Polytechnic University of Catalonia

Compiler and Runtime Based Parallelization and Optimization for GPUs

University of Melbourne

Toward a HPC Certification Program

Lawrence Livermore National Laboratory

FlipTracker: Understanding Natural Error Resilience in HPC Applications

Introduction - 2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)

2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)

Sumathi Lakshmiranganatha

University of Wyoming

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

James Madison University

Lawrence Livermore National Laboratory

ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning

Facilitating the Adoption of Correctness Tools in HPC Applications

Automatic Generation of Mixed-Precision Programs

University of Oregon

OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing

Environmental Systems Design (ESD)

High Performance Computing (HPC) Data Center Planning and TCO: A Case Study and Roadmap

Shandong University

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Sandy Landsberg

US Department of Defense HPC Modernization Program

What the Heck Is HEC?

Los Alamos National Laboratory

Heterogeneous Memory and Arena-Based Heap Allocation

Students@SC Keynote: The Computing Hidden in Everyday Things

Intel Corporation

Framework for Scalable Intra-Node Collective Operations Using Shared Memory

The Power of Storytelling: Exposing User Experiences and Lessons Learned to Inspire and Instruct Technology Adoption

Nvidia Corporation

Session 3: Using OpenMP

Sandia National Laboratories

Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations

Power API and Redfish: Standardizing Power Measurement and Control for HPC

Energy Efficiency Considerations for HPC Procurements

Lawrence Livermore National Laboratory

A Flexible System For In Situ Triggers

Argonne National Laboratory

Hybrid Quantum-Classical Computing Architectures

Argonne National Laboratory

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

Parallel-IO in Practice

Methodology for the Rapid Development of Scalable HPC Data Services

University of Illinois

National Center for Supercomputing Applications

Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience

Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education

Software Engineering and Reuse in Computational Science and Engineering

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Fifth SC Workshop on Best Practices for HPC Training and Education

Bruno Lathuilière

EDF Research and Development

Debugging and Optimization of HPC Programs in Mixed Precision with the Verrou Tool

Texas A&M University

Student Cluster Competition Team Panel Presentation

University of Erlangen-Nuremberg

Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures

KTH Royal Institute of Technology

Characterizing Deep-Learning I/O Workloads in TensorFlow

Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows

University of Warwick

Performance Portability of an Unstructured Hydrodynamics Mini-Application

Margaret Lawson

University of Illinois

Sandia National Laboratories

Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales

Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales

Valentin Le Fèvre

Approximating a Multi-Grid Solver

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

Texas A&M University

Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level

University of Tübingen

On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures

Nanyang Technological University, Singapore

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Aerospace Corporation

Federated Cloud: An Evolutionary Path from Grid Computing

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Experience New Records for Speed and Scale: High Performance Genomics and Imaging

Los Alamos National Laboratory

Heterogeneous Memory and Arena-Based Heap Allocation

Oak Ridge National Laboratory

DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing

Programming the EMU Architecture: Algorithm Design Considerations for Migratory-Threads-Based Systems

Clacc: Translating OpenACC to OpenMP in Clang

Northwestern University

Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Understanding Potential Performance Issues Using Resource-Based alongside Time Models

Stanford University

Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes

Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems

Northeastern University

Preserving Privacy through Processing Encrypted Data

Introduction - Computational Reproducibility at Exascale 2018 (CRE2018)

Matthew P. Legendre

Lawrence Livermore National Laboratory

Managing HPC Software Complexity with Spack

Gotcha: A Function-Wrapping Interface for HPC Tools

Intelligent Light

Invited Talk: Data Science Meets CFD

University of Grenoble

SMPI Courseware: Teaching Distributed-Memory Computing with MPI in Simulation

Lawrence Berkeley National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

University of Maryland

Mid-Atlantic Crossroads

SDN for End-to-End Networked Science at the Exascale (SENSE)

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Jan-Patrick Lehr

Technical University Darmstadt

Scientific Computing

Compiler-Aided Type Tracking for Correctness Checking of MPI Applications

Olli-Pekka Lehto

Jump Trading LLC

Introduction - 5th International Workshop on HPC User Support Tools: HUST-18

Tactical Computing Laboratories

Texas Tech University

Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)

xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory

4th Workshop for Open Source Supercomputing (OpenSuCo)

University of Hawaii at Manoa

SAGE2 10th Annual International SC BOF: Scalable Amplified Group Environment for Global Collaboration

William Leinberger

General Dynamics Mission Systems

Analytic Based Monitoring of High Performance Computing Applications

Matthew L. Leininger

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Jacques-Bernard Lekien

Atomic Energy and Alternative Energies Commission (CEA)

PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows

John Leonardini

Simplifying HPC Data Management at Scale

Man Chong Leong

Rice University

Script of Scripts Polyglot Notebook and Workflow System

Humboldt University of Berlin

LOS: Level Order Sampling for Task Graph Scheduling on Heterogeneous Resources

Khaled Ben Letaief

Hong Kong University of Science and Technology

SP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition

Reservoir Labs Inc

Fast Detection of Elephant Flows with Dirichlet-Categorical Inference

Sustainable Horizons Institute

Welcome and Introduction

Building a Career on Your Strengths

Connecting and Thinking Strategically through Your Strengths

Randall LeVeque

University of Washington

Accelerating Wave-Propagation Algorithms with Adaptive Mesh Refinement Using the Graphics Processing Unit (GPU)

Dustin Leverman

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Markus Levonyak

University of Vienna

Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods

Sandia National Laboratories

Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo

Workshop Morning Break

Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)

Sandia National Laboratories

Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications

Argonne National Laboratory

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

Pacific Northwest National Laboratory

Energy Efficiency of Reconfigurable Caches on FPGAs

Binarized ImageNet Inference in 29us

University of Illinois

MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines

University of California, Merced

Runtime Data Management on Non-Volatile Memory-Based Heterogeneous Memory for Task-Parallel Programs

FlipTracker: Understanding Natural Error Resilience in HPC Applications

Understanding Application Recomputability without Crash Consistency in Non-Volatile Memory

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Georgia Institute of Technology

HiCOO: Hierarchical Storage of Sparse Tensors

Tsinghua University

National Supercomputing Center, Wuxi

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

Lawrence Livermore National Laboratory

Computing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver

Lawrence Berkeley National Laboratory

Approximating for Faster, Better and Cheaper Scientific Computing

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation

Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation

Lawrence Berkeley National Laboratory

Anycast: Rootless Broadcasting with MPI

Northeastern University

PRISM: Predicting Resilience of GPU Applications Using Statistical Methods

Xiaoye Sherry Li

Lawrence Berkeley National Laboratory

High Performance Computing in Dynamic Traffic Simulation

Tsinghua University

National Supercomputing Center, Wuxi

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Brown University

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

University of Warwick

Optimizing Machine Learning on Apache Spark in HPC Environments

University of Utah

SpotSDC: an Information Visualization System to Analyze Silent Data Corruption

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation

Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation

Lawrence Livermore National Laboratory

Is Data Placement Optimization Still Relevant on Newer GPUs?

Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling

Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free

Northwestern University

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies

Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training

DiG: Enabling Out-of-Band Scalable High-Resolution Monitoring for Data-Center Analytics, Automation, and Control

Oak Ridge National Laboratory

Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Georgia Institute of Technology

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

Tsinghua University

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Lawrence Livermore National Laboratory

Is Data Placement Optimization Still Relevant on Newer GPUs?

Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling

Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free

Peter Lindstrom

Lawrence Livermore National Laboratory

Compression for Scientific Data

The ARM HPC Experience: From Testbeds to Exascale

Performance Optimization Studies

Jet Propulsion Laboratory

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

University of Minnesota

Dynamically Negotiating Capacity Between On-Demand and Batch Clusters

University of Massachusetts, Lowell

iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees

TriCore: Parallel Triangle Counting on GPUs

Texas A&M University

Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

New Jersey Institute of Technology

Workshop Afternoon Break

Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

High Performance I/O Frameworks 101

University of Texas

OOOPS: An Innovative Tool for IO Workload Management on Supercomputers

Shandong University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

National Research Centre of Parallel Computer Engineering and Technology

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Cross-Layer Group Regularization for Deep Neural Network Pruning

Intel Corporation

High-Performance Dense Tucker Decomposition on GPU Clusters

Tongji University

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

University of Illinois

A Massively Parallel Evolutionary Markov Chain Monte Carlo Algorithm for Sampling Complicated Multimodal State SpacesState

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Argonne National Laboratory

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

University of Utah

SpotSDC: an Information Visualization System to Analyze Silent Data Corruption

Lawrence Livermore National Laboratory

ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning

Automatic Generation of Mixed-Precision Programs

Los Alamos National Laboratory

Using Thrill to Process Scientific Data on HPC

Red Oak Consulting

Procurement and Commissioning of HPC Systems

Glenn K. Lockwood

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

A Year in the Life of a Parallel File System

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

Parallel-IO in Practice

Sandia National Laboratories

Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales

The IO-500 and the Virtual Institute of I/O

University of Tennessee

Feature-Relevant Data Reduction for In Situ Workflows

University of Wisconsin

Fault Tolerant Cholesky Factorization on GPUs

Advanced Micro Devices Inc

Challenges of High-Capacity DRAM Stacks and Potential Directions

Johann Lombardi

Intel Corporation

Enabling Data Services for HPC

Victor Lomuller

Codeplay Software Ltd

Challenges of C++ Heterogeneous Programming Using SYCL Implementation Experience: the Four Horsemen of the Apocalypse

Pennsylvania State University

HPC Impact at TAE Technologies and Pratt & Whitney

HPC Impact at Procter & Gamble, Boeing and GE

HPC Impact at BP and Lockheed Martin

HPC Impact at GM and John Deere

University of Texas

TACC's Cloud Deployer: Automating the Management of Distributed Software Systems

Lawrence Berkeley National Laboratory

Python-Based In Situ Analysis and Visualization

SENSEI Cross-Platform View of In Situ Analytics

University of A Coruña, Spain

Toward Ad Hoc Recovery For Soft Errors

Pavel Lougovski

Oak Ridge National Laboratory

Quantum Computing for Scientific Applications

Keysight Technologies Inc

Make Sure the Network Isn’t the Problem! 400GE Considerations and Best Practices for Testing the Cluster Fabric

Indiana University

Cloud Infrastructure Solutions To Run HPC Workloads

David K. Lowenthal

University of Arizona

Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing

King Abdullah University of Science and Technology

Approximating for Faster, Better and Cheaper Scientific Computing

Toward Smoothing Data Movement Between RAM and Storage

Fermi National Accelerator Laboratory

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Ohio State University

Designing High-Performance, Resilient, and Heterogeneity-Aware Key-Value Storage for Modern HPC Clusters

Exploiting HPC Technologies for Accelerating Big Data Processing and Associated Deep Learning

National Supercomputer Center Guangzhou

Panel 3: Challenge and Chance for Supercomputing Center in China

German Climate Computing Center

Toward Understanding I/O Behavior in HPC Workflows

Toward a HPC Certification Program

Nvidia Corporation

Exascale Deep Learning for Climate Analytics

German Climate Computing Center

Argonne National Laboratory

Toward Understanding I/O Behavior in HPC Workflows

Andrew Lumsdaine

Pacific Northwest National Laboratory

17th Graph500 List

Oak Ridge National Laboratory

Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

Argonne National Laboratory

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

University of Delaware

Toward Deductive Verification of Message-Passing Parallel Programs

University of Tennessee

Innovative Computing Laboratory

Batched, Reproducible, and Reduced Precision BLAS

HPCG Benchmark Update

Fermi National Accelerator Laboratory

Quantum Communication Networks and Technologies

University of Texas

Evaluating and Accelerating High-Fidelity Error Injection for HPC

University of Minnesota

Ceph Applications in HPC Environments

Vickie E. Lynch

Oak Ridge National Laboratory

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Ciena Corporation

INDIS Showcases Panel: NRE and XNET and Architecture

M

Massachusetts Green High Performance Computing Center

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Qatar Computing Research Institute

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Lawrence Berkeley National Laboratory

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

SDN for End-to-End Networked Science at the Exascale (SENSE)

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Lewis & Clark College

Jupyter Notebooks and User-Friendly HPC Access

Maciej Maciejewski

Intel Corporation

DAQDB - a Distributed Key-Value Store for Petascale Hot Storage

Lalith Maddegedara

University of Tokyo

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Pennsylvania State University

Doctoral Showcase II

Argonne National Laboratory

MPICH: A High Performance Open-Source MPI Implementation

Krell Institute

How to Analyze the Performance of Parallel Codes 101

California Institute of Technology

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Keynote: Glow: An Optimizing Compiler for High-Performance Machine Learning

Lawrence Berkeley National Laboratory

Exascale Deep Learning for Climate Analytics

Satheesh Maheswaran

Atomic Weapons Establishment (AWE), UK

Performance Portability of an Unstructured Hydrodynamics Mini-Application

Abdulrahman Mahmoud

University of Illinois

Optimizing Software-Directed Instruction Replication for GPU Error Detection

Pittsburgh Supercomputing Center

Strategies for Inclusive and Scalable HPC Outreach and Education

Evaluating the Wide Area Classroom after 10,500 HPC Students

Akalanka Mailewa Dissanayaka

Texas Tech University

Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs

Matthias Maiterth

Ludwig Maximilian University of Munich

Intel Corporation

A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

Kobe University

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Pros and Cons of HPCx benchmarks

Intel Corporation

DAQDB - a Distributed Key-Value Store for Petascale Hot Storage

Indian Institute of Technology Kanpur

Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations

Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications

Nicholas Malaya

Advanced Micro Devices Inc

Keynote: Full Stack Open Source Supercomputing

DePaul University

Semantically Organized Containers for Reproducible Research

University of Oregon

VPA18 Keynote: Not Your Mama’s Angry Fruit Salad: Ruminations on 30 Years of Performance Visualization and Visual Performance Analysis

OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing

Allen D. Malony

University of Oregon

Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter

Carlos Maltzahn

University of California, Santa Cruz

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

Semantically Organized Containers for Reproducible Research

Spotting Black Swans With Ease: The Case for a Practical Reproducibility Platform

International Center for Advanced Internet Research (iCAIR)

Northwestern University

Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Renaissance Computing Institute (RenCI)

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

University of Houston

GPU-Accelerated Interpolation for 3D Image Registration

Filippo Mantovani

Barcelona Supercomputing Center

Teaching HPC Systems and Parallel Programming with Small Scale Clusters of Embedded SoCs

Filling the Gap between Education and Industry: Evidence-Based Methods for Introducing Undergraduate Students to HPC

Oak Ridge National Laboratory

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Alexander Margolin

Hebrew University of Jerusalem

Tree-Based Fault-Tolerant Collective Operations for MPI

Argonne National Laboratory

Large-Scale PDE-Constrained Optimization

University of Chicago

A Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints

Stefano Markidis

KTH Royal Institute of Technology

Characterizing Deep-Learning I/O Workloads in TensorFlow

Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows

HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making

George Markomanolis

Oak Ridge National Laboratory

The IO-500 and the Virtual Institute of I/O

Sandia National Laboratories

Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications

Tokyo Institute of Technology

DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

Lawrence Berkeley National Laboratory

Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training

The HPC Best Practices Webinar Series

Chris Marroquin

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Indiana University

SciGaP: Apache Airavata Hosted Science Gateways

Nicole Marsaglia

University of Oregon

A Flexible System For In Situ Triggers

Argonne National Laboratory

Achieving Performance on Large-Scale Intel Xeon-Based Systems

HPC Impact at TAE Technologies and Pratt & Whitney

HPC Impact at Procter & Gamble, Boeing and GE

HPC Impact at BP and Lockheed Martin

HPC Impact at GM and John Deere

Hartman Executive Advisors

SC: The Conference

Panel Discussion

Power API and Redfish: Standardizing Power Measurement and Control for HPC

Maxime Martinasso

Swiss National Supercomputing Centre

RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Richard C. Martineau

Idaho National Laboratory

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

Dominique Martinet

Atomic Energy and Alternative Energies Commission (CEA)

On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale

Sandia National Laboratories

The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification

Data Analytics for System and Facility Energy Management

Energy Efficiency Considerations for HPC Procurements

Jose Maria Martinez

Technical University of Valencia

The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems

Daniel A. Martinez

US Army Engineer Research and Development Center

Deep Learning Evolutionary Optimization for Regression of Rotorcraft Vibrational Spectra

Technical University of Ostrava, Czech Republic

HPC-as-a-Service for Life Sciences

Job Simulation for Large-Scale PBS-Based Clusters with the Maui Scheduler

Margaret Martonosi

Princeton University

What Is the Role of Architecture and Software Researchers in Making Quantum Computing Practical?

Xavier Martorell

Barcelona Supercomputing Center

Benchmarking Scientific Reconfigurable / FPGA Computing

Lawrence Livermore National Laboratory

Benchmarking Scientific Reconfigurable / FPGA Computing

Exascale Machine Learning

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

Programming Systems Tools

Michael Mascagni

Florida State University

National Institute of Standards and Technology

Introduction - Computational Reproducibility at Exascale 2018 (CRE2018)

Computational Reproducibility at Exascale 2018 (CRE2018)

Kristyn Maschhoff

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Mississippi State University

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

Intel Corporation

Function/Kernel Vectorization via Loop Vectorizer

Fabian Mastenbroek

Delft University of Technology

A Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments

Sergi Mateo Bellido

Barcelona Supercomputing Center

Mastering Tasking with OpenMP

Michael Matheson

Oak Ridge National Laboratory

Exascale Deep Learning for Climate Analytics

Leibniz Supercomputing Centre

Boosting the Scalability of Car-Parrinello Molecular Dynamics Simulations for Multi- and Manycore Architectures

Amrita Mathuriya

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Yoshimasa Matsumura

University of Tokyo

Multi-GPU Accelerated Non-Hydrostatic Numerical Ocean Model with GPUDirect RDMA Transfers

Satoshi Matsuoka

Tokyo Institute of Technology

DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

Approximating for Faster, Better and Cheaper Scientific Computing

“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities

Introduction - The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)

Exascale Machine Learning

NASA Ames Research Center

PBS Pro Open Source Project Community BoF

Federal University of Rio de Janeiro

A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows

Intel Corporation

Programming Your GPU with OpenMP: A Hands-On Introduction

OpenMP Common Core: a “Hands-On” Exploration

Intel Corporation

Performance Tuning of Scientific Codes with the Roofline Model

Purdue University

cgroups py : Using Linux Control Groups and Systemd to Manage CPU Time and Memory

University of Manchester

First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture

Oak Ridge National Laboratory

GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

University of Southern California

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Alces Flight Limited

The Movement toward HPC Inclusivity: Achieving On-Demand Accessibility of High Performance Computing (HPC) through Ephemeral Projects Utilizing the Alces Gridware Project

John D. McCalpin

University of Texas

Texas Advanced Computing Center

HPL and DGEMM Performance Variability on the Xeon Platinum 8160 Processor

US Navy, Oceanographer of the Navy

Purpose-Built HPC: Last Hope for Earth System Prediction?

Quantum Computing for Scientific Applications

Meghan McClelland

Versity Software Inc

Exascale Archiving - Challenges and Opportunities

Patrick McCormick

Los Alamos National Laboratory

Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems

Task-Based Programming

Peter McCorquodale

Lawrence Berkeley National Laboratory

A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids

Ralph McEldowney

US Department of Defense HPC Modernization Program, Air Force Research Laboratory

Learning to Lead in HPC - Strategies to Start Your Leadership Journey

University of California, Berkeley

Lawrence Berkeley National Laboratory

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

University of Illinois

National Research Infrastructure: Collaborative Session

Suzanne McIntosh

New York University

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

Third Annual Meeting of the SIGHPC - Big Data Chapter

Simon McIntosh-Smith

University of Bristol

The ARM HPC Experience: From Testbeds to Exascale

Programming Your GPU with OpenMP: A Hands-On Introduction

Australian Data Science Education Institute

Data Science and HPC Education and Outreach

University of Texas

Getting Scientific Software Installed

Nvidia Corporation

Making Container Easier with HPC Container Maker

Donald McMullen

Texas A&M University

Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level

CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity

Colin McMurtrie

Swiss National Supercomputing Centre

RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management

Panel 1: A Site-Local View of Creating a Pan-European Federated Research Infrastructure

Stephen McNally

Oak Ridge National Laboratory

GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan

Lawrence Meadows

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Technical University of Ostrava, Czech Republic

Workflow for Parallel Processing of Sequential Mesh Databases

Maryam Mehri Dehnavi

University of Toronto

ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism

Susan Mehringer

Cornell University

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Fifth SC Workshop on Best Practices for HPC Training and Education

Lawrence Berkeley National Laboratory

Managing HPC Software Complexity with Spack

University of Pittsburgh

Partial Redundancy in HPC Systems with Non-Uniform Node Reliabilities

John Mellor-Crummey

Rice University

Dynamic Data Race Detection for OpenMP Programs

Ruby Mendenhall

University of Illinois

Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.

A Black Woman’s Sojourn in High Performance Computing: Recovering Lost History

Celso L. Mendes

University of Illinois

National Center for Supercomputing Applications

Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Tencent Holdings Ltd

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Shandong University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Texas Tech University

Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs

Harshitha Menon

Lawrence Livermore National Laboratory

Error Analysis in HPC Applications Using Algorithmic Differentiation

ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning

SpotSDC: an Information Visualization System to Analyze Silent Data Corruption

Automatic Generation of Mixed-Precision Programs

Michael Mercier

Considering the Development Workflow to Achieve Reproducibility with Variation

Cristin Merritt

Alces Flight Limited

The Movement toward HPC Inclusivity: Achieving On-Demand Accessibility of High Performance Computing (HPC) through Ephemeral Projects Utilizing the Alces Gridware Project

Personalized Medicine and HPC

Technical University of Ostrava, Czech Republic

Distributed Fast Boundary Element Methods

Enablence Technologies Inc

FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects

Argonne National Laboratory

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

Nvidia Corporation

Nvidia Corporation

Interactivity in HPC

TOP500 Supercomputers

University of Erlangen-Nuremberg

University of Erlangen-Nuremberg

Boosting the Scalability of Car-Parrinello Molecular Dynamics Simulations for Multi- and Manycore Architectures

The Next Wave of HPC in the Datacenter

Indiana University

Machine Learning and AI

Marek Michalewicz

Interdisciplinary Center for Mathematical and Computational Modeling

University of Warsaw

Panel 1: Role of Federated Polish HPC Centers in Polish AI Initiatives and EuroHPC Program

Data Machines Corporation

Federated Cloud: An Evolutionary Path from Grid Computing

Cloud Infrastructure Solutions To Run HPC Workloads

Lauren Milechin

Massachusetts Institute of Technology

The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training

University of Wisconsin

Secure Coding Practices and Automated Assessment Tools

RWTH Aachen University

PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications

Michelle Strout

University of Arizona

ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism

Students@SC: Careers in Industry, Research Labs, and Academia

Navigating the SC Conference Technical Program Submission Process

Australian National University

AIWC: OpenCL-Based Architecture Independent Workload Characterization

Amanda J. Minnich

Lawrence Livermore National Laboratory

Safety, Reproducibility, Performance: Accelerating Cancer Drug Discovery with Cloud, ML, and HPC Technologies

Students@SC: Careers in Industry, Research Labs, and Academia

Oak Ridge National Laboratory

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

Pacific Northwest National Laboratory

Enabling High-Level Graph Processing via Dynamic Tasking

University of Texas

Texas Advanced Computing Center

The New NSF-Funded Resource: Frontera - Towards a Leadership Class Computing Facility

Azalia Mirhoseini

Morning Keynote – Azalia Mirhoseini (Google)

Vladimir Mironov

Lomonosov Moscow State University

MPI/OpenMP parallelization of the Fragment Molecular Orbitals Method in GAMESS

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

Intel Corporation

Parallel Computing Lab

Optimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting

Konstantina Mitropoulou

Intel Corporation

Function/Kernel Vectorization via Loop Vectorizer

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

Takaaki Miyajima

Stream Computing of Lattice-Boltzmann Method on Intel Programmable Accelerator Card

Graz University of Technology

faimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU

Susan Mniszewski

Los Alamos National Laboratory

Community Detection Across Emerging Quantum Architectures

Non-Neural Network Applications for Spiking Neuromorphic Hardware

Forschungszentrum Juelich

SC: The Conference

Lawrence Livermore National Laboratory

ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning

Students@SC: Careers in Industry, Research Labs, and Academia

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

SpotSDC: an Information Visualization System to Analyze Silent Data Corruption

Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems

VeloC: Very Low Overhead Checkpointing System

Multi-Level Memory and Storage for HPC and Data Analytics

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Panel: Interactivity in Supercomputing

Shintaro Momose

Tohoku University

NEC Corporation

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Lawrence Berkeley National Laboratory

Energy Sciences Network (ESnet)

SDN for End-to-End Networked Science at the Exascale (SENSE)

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Los Alamos National Laboratory

Improving Application Resilience by Extending Error Correction with Contextual Information

Raffaele Montella

Parthenope University of Naples

DagOn*: Executing Direct Acyclic Graphs as Parallel Jobs on Anything

Los Alamos National Laboratory

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

How to Analyze the Performance of Parallel Codes 101

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems

VeloC: Very Low Overhead Checkpointing System

Lawrence Livermore National Laboratory

James Madison University

Automatic Generation of Mixed-Precision Programs

Lawrence Livermore National Laboratory

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

Scalable Deep Ensemble Learning for Cancer Drug Discovery

Sebastien Morais

Atomic Energy and Alternative Energies Commission (CEA)

PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows

Nicolas Morales

Sandia National Laboratories

Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications

HPC Graph Toolkits and the GraphBLAS Forum

Kenneth Moreland

Sandia National Laboratories

ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

Barcelona Supercomputing Center

Polytechnic University of Catalonia

Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs

Kazutaka Morita

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

Argonne National Laboratory

Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications

Sandia National Laboratories

Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI

Alexander Moskovsky

Panel 2: Russian HPC Trends: a View From a Local Vendor Trench

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

University of Pittsburgh

Supporting Thorough Artifact Evaluation with Occam

University of California, Berkeley

Programmable Interactive Visualization of a Core-Collapse Supernova Simulation

University of Washington

Accelerating Wave-Propagation Algorithms with Adaptive Mesh Refinement Using the Graphics Processing Unit (GPU)

Charles Moulinec

Science and Technology Facilities Council, UK

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

Irene Moulitsas

Cranfield University

Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model

Argonne National Laboratory

Introduction - Women in HPC: Diversifying the HPC Community

Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems

Women in HPC: Diversifying the HPC Community

University of Warwick

Swiss Army Programming: Performance and Portability from Modern Tools

OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling

Heterogeneous CPU-GPU Execution of Stencil Applications

Mayur Mudigonda

Lawrence Berkeley National Laboratory

Exascale Deep Learning for Climate Analytics

Public Library of Science

Quantum Communication Networks and Technologies

North Carolina State University

Doomsday: Predicting Which Node Will Fail When on Supercomputers

Using Darshan and CODES to Evaluate Application I/O Performance

Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud

University of Tartu, Estonia

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Diptajyoti Mukherjee

Allegheny College

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

Daichi Mukunoki

Tokyo Woman's Christian University

High Performance Implementation of Reproducible BLAS Routines with Tunable Accuracy Using Ozaki Scheme

Massachusetts Institute of Technology

Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education

Strategies for Inclusive and Scalable HPC Outreach and Education

The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training

Fifth SC Workshop on Best Practices for HPC Training and Education

Matthias Müller

RWTH Aachen University

PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications

RWTH Aachen University

PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications

Argonne National Laboratory

Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations

Universidade da Coruña

In-Transit Molecular Dynamics Analysis with Apache Flink

Railway Technical Research Institute, Japan

Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis

Micron Technology Inc

17th Graph500 List

Tohoku University

NEC Corporation

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Pacific Northwest National Laboratory

Polytechnic University of Catalonia

Characterization of the Impact of Soft Errors on Iterative Methods

Pacific Northwest National Laboratory

HPC Software Verification in Action: A Case Study with Tensor Transposition

Lawrence Berkeley National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Python-Based In Situ Analysis and Visualization

N

Santosh Nagarakatte

Rutgers University

A Parallelism Profiler with What-If Analyses for OpenMP Programs

Ahmad Turan Naimey

Northern Arizona University

Pathogen and Microbiome Institute

Enabling Reproducible Microbiome Science through Decentralized Provenance Tracking in QIIME 2

Railway Technical Research Institute, Japan

Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis

University of Tokyo

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Takashi Nakamura

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Aiichiro Nakano

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Kouta Nakashima

Fujitsu Laboratories Ltd

DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping

Yuji Nakatsukasa

National Institute of Informatics, Japan

Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices

University of Bordeaux

Runtime for Exascale and Beyond: Convergence or Divergence?

Sai Narasimhamurthy

Seagate Systems UK

Characterizing Deep-Learning I/O Workloads in TensorFlow

Sri Hari Krishna Narayanan

Argonne National Laboratory

A Study on Checkpoints Compression for Adjoint Computation

Carleton College

A Statistical Analysis of Compressed Climate Model Data

Nvidia Corporation

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Saber Naserifar

California Institute of Technology

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs

Lawrence Livermore National Laboratory

P3HPC Community Discussion and Next Steps

Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

Henry J. Neeman

University of Oklahoma

Sustaining Research Software

Workloads and Benchmarks for System Acquisition

Gianina Alina Negoita

Iowa State University

Deep Learning: Extrapolation Tool for Computational Nuclear Physics

David H. Neill Asanza

Grinnell College

Los Alamos National Laboratory

Challenges of Performance Portability for Fortran Unstructured Mesh Codes

Effective Performance Portability

Performance Portability Challenges for Fortran Applications

Faucet Foundation

University of Waikato

Faucet: SDN made Easy

Nvidia Corporation

Meeting HPC Container Challenges as a Community

The Difference Between HPC on Premises and in the Cloud

California Institute of Technology

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

SDN for End-to-End Networked Science at the Exascale (SENSE)

SLAC National Accelerator Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

West Chester University

Using CloudLab as a Scalable Platform for Teaching Cluster Computing

University of Texas

Institute for Computational Engineering and Sciences

Arctic Ocean-Sea Ice Interactions

Los Alamos National Laboratory

Comparing Deep Learning with Quantum Inference on The D-Wave 2X

Lawrence Berkeley National Laboratory

Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows

University of Massachusetts

Toward Developing a Repository of Logical Errors Observed in Parallel Code for Teaching Code Correctness

Texas Tech University

HPCViz: Monitoring Health Status of High Performance Computing Systems

University of North Carolina

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

Argonne National Laboratory

A Study on Checkpoints Compression for Adjoint Computation

VeloC: Very Low Overhead Checkpointing System

Nvidia Corporation

Programmable Interactive Visualization of a Core-Collapse Supernova Simulation

Daisuke Nishiura

Japan Agency for Marine-Earth Science and Technology

Massively Parallel Stress Chain Characterization for Billion Particle DEM Simulation of Accretionary Prism Formation

Riken Center for Computational Science

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Altair Engineering

PBS Pro Open Source Project Community BoF

Anderson Braulio Nobrega da Silva

Federal Institute of Paraíba

Federal University of Rio Grande do Norte

PaScal Viewer: A Tool for the Visualization of Parallel Scalability Trends

Korea Advanced Institute of Science and Technology

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Talent Strategy Institute

Women in HPC: the Importance of Male Allies

Jean-Philippe Nominé

European Technology Platform for High Performance Computing (ETP4HPC)

Atomic Energy and Alternative Energies Commission (CEA)

Consolidating the European Exascale Effort

Ken-ichi Nomura

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Riken Center for Computational Science

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

Fermi National Accelerator Laboratory

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Michael L. Norman

San Diego Supercomputer Center

University of California, San Diego

Computational Cosmology and Astrophysics on Adaptive Meshes Using Charm++

Ali Akbar Nosrati

Texas Tech University

Simulating Data Centers with Redfish-Enabled Equipment

Marziyeh Nourian

North Carolina State University

A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms

Javier Novo Rodríguez

Appentra Solutions

Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.

Texas State University

High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem

US Department of Energy Office of Advanced Scientific Computing Research

Hot Topics Discussion II: Thriving at Work

Perspectives on Data Reduction from ASCR

Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.

Tranquility Amidst Turbulence: A Vision for Advancing Scientific Discovery in the Era of Extreme Heterogeneity

Understanding the Reader

Pawsey Supercomputing Centre

Strategies for Inclusive and Scalable HPC Outreach and Education

HPC Education and Training: An Australian Perspective

Intel Corporation

Compiler Optimization for Heterogeneous Locality and Homogeneous Parallelism in OpenCL and LLVM

Marguerite Nyhan

United Nations Global Pulse

HPC Inspires Plenary: HPC and AI: Helping to Solve Humanity’s Grand Challenges

O

Kathryn O'Brien

Workshop Morning Break

Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

Patrick O'Leary

Introduction - ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

SENSEI Cross-Platform View of In Situ Analytics

Argonne National Laboratory

Better Scientific Software

University of Texas

Institute for Computational Engineering and Sciences

Arctic Ocean-Sea Ice Interactions

University of Southern California

WRENCH: A Framework for Simulating Workflow Management Systems

Tokyo Woman's Christian University

High Performance Implementation of Reproducible BLAS Routines with Tunable Accuracy Using Ozaki Scheme

Martin Ohlerich

Leibniz Supercomputing Centre

Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

University of Stuttgart

Visual Analytics Challenges in Analyzing Calling Context Trees

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

Stephan Olbrich

University of Hamburg

Toward a HPC Certification Program

Gabriel Consulting Group

Introduction to Student Cluster Competitions

Boston University

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Daniel Oliveira

Fluminense Federal University, Fluminense Federal University, Brazil

A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows

University of Pittsburgh

Supporting Thorough Artifact Evaluation with Occam

Vyacheslav Olshevsky

KTH Royal Institute of Technology

HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making

Hensley Omorodion

University of Benin

Special Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)

Kyushu University

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

Naoyuki Onodera

Japan Atomic Energy Agency

Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers

Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations

Lawrence Berkeley National Laboratory

Automated Labeling of Electron Microscopy Images Using Deep Learning

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

LUSTRE Community BOF: Lustre in HPC and Emerging Data Markets: Roadmap, Features and Challenges

College of William & Mary

Thomas Jefferson National Accelerator Facility

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

Argonne National Laboratory

Hybrid Quantum-Classical Computing Architectures

Daniel Osei-Kuffuor

Lawrence Livermore National Laboratory

ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning

Barcelona Supercomputing Center

Personalized Medicine and HPC

Consolidating the European Exascale Effort

Appentra Solutions

Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.

Leibniz Supercomputing Centre

A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management

Frederick Oullet

University of Florida

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

University of California, Davis

Linear Algebra Is the Right Way to Think About Graphs

Katsuhisa Ozaki

Shibaura Institute of Technology

High Performance Implementation of Reproducible BLAS Routines with Tunable Accuracy Using Ozaki Scheme

Barcelona Supercomputing Center

Polytechnic University of Catalonia

Compiler and Runtime Based Parallelization and Optimization for GPUs

OpenMP GPU Offload in Flang and LLVM

Argonne National Laboratory

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

P

Intel Corporation

Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures

Carlos Pachajoa

University of Vienna

Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods

Universidade da Coruña

In-Transit Molecular Dynamics Analysis with Apache Flink

Francesco Paesani

University of California, San Diego

Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU

Los Alamos National Laboratory

Navigating the SC Conference Technical Program Submission Process

Introduction to Quantum Computing

Miguel Palacios

Procter and Gamble Company

HPC Enables Simulation-Led Innovation in Places You Would Not Expect

Rice University

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

Sudhakar Pamidighantam

Indiana University

SciGaP: Apache Airavata Hosted Science Gateways

H3 Platform Inc

A Cost-Effective Flexible System Optimized for DNN and ML

Georgia Institute of Technology

School of Computational Science and Engineering

Optimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting

Ohio State University

Cooperative Rendezvous Protocols for Improved Performance and Overlap

ESPM2 2018: Closing Remarks

The Next Wave of HPC in the Datacenter

Introduction - ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

High Performance Middlewares for Next Generation Architectures: Challenges and Solutions

Designing High-Performance, Resilient, and Heterogeneity-Aware Key-Value Storage for Modern HPC Clusters

Unified Communication X (UCX) Community

Scalable and Distributed DNN Training on Modern HPC Systems

InfiniBand, Omni-Path, and High-Speed Ethernet: Advanced Features, Challenges in Designing HEC Systems, and Usage

Exploiting HPC Technologies for Accelerating Big Data Processing and Associated Deep Learning

InfiniBand, Omni-Path, and High-Speed Ethernet for Beginners

ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

University of Hawaii at Manoa

WRENCH: A Framework for Simulating Workflow Management Systems

Ramesh Pankajakshan

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Pacific Northwest National Laboratory

HPC Software Verification in Action: A Case Study with Tensor Transposition

Jean-Pierre Panziera

European Technology Platform for High Performance Computing (ETP4HPC)

Consolidating the European Exascale Effort

George Papadimitriou

University of Southern California

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Tom Papatheodore

Oak Ridge National Laboratory

Application Porting and Optimization on GPU-Accelerated POWER Architectures

Michael E. Papka

Argonne National Laboratory

Northern Illinois University

Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

libIS: A Lightweight Library for Flexible In Transit Visualization

Ketan Paranjape

Roche - Diagnostic Information Solutions

Morning Keynote – Computational Approaches in Clinical Applications

Manish Parashar

National Science Foundation

The Future of NSF Supported Advanced Cyberinfrastructure

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows

Enabling Data Services for HPC

TCHPC Career Panel

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows

High Performance I/O Frameworks 101

Sandia National Laboratories

Quantum Computing for Scientific Applications

Byung Hoon Park

Oak Ridge National Laboratory

A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform

Lawrence Berkeley National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Argonne National Laboratory

Characterization of MPI Usage on a Production Supercomputer

University of Michigan

Introduction to Kubernetes

Valerio Pascucci

University of Utah

SpotSDC: an Information Visualization System to Analyze Silent Data Corruption

A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing

libIS: A Lightweight Library for Flexible In Transit Visualization

Fermi National Accelerator Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Data-Parallel Python for High Energy Physics Analyses

Lawrence Livermore National Laboratory

“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities

Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance

US Naval Research Laboratory

The ARM HPC Experience: From Testbeds to Exascale

Aristides Patrinos

Precision and Personalized Medicine: The Time Has Arrived

Christos Patriotis

National Cancer Institute

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Nvidia Corporation

Panel: Interactivity in Supercomputing

Michael Patterson

Intel Corporation

The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification

Robert M. Patton

Oak Ridge National Laboratory

Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines

Introduction - Machine Learning in HPC Environments

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Md Mostofa Ali Patwary

Graph Algorithms and Systems

Marquette University

OpenACC-Based GPU Parallelization of Plane Sweep Algorithm for Geometric Intersection

Rice University

A Unified Runtime for PGAS and Event-Driven Programming

Viking Enterprise Solutions

Cassandra in Dockers Deployment Using an NVMe Fabric

Los Alamos National Laboratory

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

Nicholas Pavini

American River College

OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine

Texas A&M University

CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity

Los Alamos National Laboratory

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

David E. Pearah

Sustaining Research Software

Lawrence Livermore National Laboratory

Visual Analytics Challenges in Analyzing Calling Context Trees

Students@SC Keynote: The Computing Hidden in Everyday Things

Students@SC Keynote: Livin’ on the Edge: Thoughts on Careers in High Performance Computing

Lawrence Livermore National Laboratory

PruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution

There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline

Sandia National Laboratories

OpenHPC Community BoF

A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

Lawrence Berkeley National Laboratory

Flowzilla: A Methodology for Detecting Data Transfer Anomalies in Research Networks

Benoit Pelletier

CPU Overheating Characterization in HPC Systems: a Case Study

Piotr Pelplinski

Intel Corporation

DAQDB - a Distributed Key-Value Store for Petascale Hot Storage

Los Alamos National Laboratory

OpenHPC Community BoF

Next-Generation Cluster Management Software

HPC Storage and Memory Architectures

University of Texas, MD Anderson Cancer Center

Script of Scripts Polyglot Notebook and Workflow System

Oak Ridge National Laboratory

Siena: Exploring the Design Space of Heterogeneous Memory Systems

Simon J. Pennycook

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

P3HPC Session 2 Panel Discussion

Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

Effective Performance Portability

Evaluating the Impact of Proposed OpenMP 5.0 Features on Performance, Portability, and Productivity

Atomic Energy and Alternative Energies Commission (CEA)

PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows

Atomic Energy and Alternative Energies Commission (CEA)

Spack Community BoF

Cody J. Permann

Idaho National Laboratory

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

Karina Pesatova

Technical University of Ostrava, Czech Republic

Strategies for Inclusive and Scalable HPC Outreach and Education

Albert Einstein College of Medicine

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Argonne National Laboratory

Large-Scale Algorithms

Mississippi State University

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

John W. Peterson

Idaho National Laboratory

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

Francesco Petrogalli

LLVM and the Automatic Vectorization of Loops Invoking Math Routines: -fsimdmath

University of Utah

A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing

Fredrik Pettersson

Moffitt Cancer Center

Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches

University of California, Santa Barbara

Fernbach Award Presentation - The Roles of Computing in the Advancement of Science: A Case Study

Texas Tech University

HPCViz: Monitoring Health Status of High Performance Computing Systems

Everett Phillips

Nvidia Corporation

Exascale Deep Learning for Climate Analytics

Tomás Picornell

Technical University of Valencia

The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems

Indiana University

SciGaP: Apache Airavata Hosted Science Gateways

Mary Ann Piette

Lawrence Berkeley National Laboratory

Urban Energy Science and High Performance Computing

University of Illinois

Monitoring Parsl Workflows

A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management

Lawrence Livermore National Laboratory

Western Washington University

Automatic Generation of Mixed-Precision Programs

US Department of Energy Office of Advanced Scientific Computing Research

Afternoon Keynote - Robinson Pino (DOE ASCR)

Randall Pittman

North Carolina State University

Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines

Fernando Pizzano

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Rensselaer Polytechnic Institute

Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems

University of Grenoble

CPU Overheating Characterization in HPC Systems: a Case Study

Forschungszentrum Juelich

Panel 1: Service-Oriented HPC and Data Infrastructures for Science in Germany

HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation

Application Porting and Optimization on GPU-Accelerated POWER Architectures

Christian Plessl

Paderborn University

Bringing FPGAs to HPC Production Systems and Codes

Damian Podareanu

Fast and Accurate Training of an AI Radiologist

Large Minibatch Training on Supercomputers with Improved Accuracy and Reduced Time to Train

Norbert Podhorszki

Oak Ridge National Laboratory

High Performance I/O Frameworks 101

Marjo Poindexter

University of Texas

TACC's Cloud Deployer: Automating the Management of Distributed Software Systems

David Poliakoff

Lawrence Livermore National Laboratory

Gotcha: A Function-Wrapping Interface for HPC Tools

Samuel D. Pollard

University of Oregon

Evaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters

Alexei Poludnenko

Texas A&M University

Refactoring and Optimizing Multiphysics Combustion Models for Data Parallelism

Making Sense of Scientific Simulation Ensembles

SciNet HPC Consortium

University of Toronto

Trends in Demand, Growth, and Breadth in Scientific Computing Training Delivered by a High-Performance Computing Center

Nvidia Corporation

OpenACC API User Experience, Vendor Reaction, Relevance, and Roadmap

Los Alamos National Laboratory

HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation

OpenSHMEM in the Era of Exascale

Carleton College

A Statistical Analysis of Compressed Climate Model Data

University of Kassel

Comparison of the HPC and Big Data Java Libraries Spark, PCJ and APGAS

Purdue University

Adaptive Anonymization of Data with b-Edge Covers

Thomas E. Potok

Oak Ridge National Laboratory

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Brookhaven National Laboratory

Reproducibility for Streaming Analysis

Louis-Noel Pouchet

Colorado State University

Associative Instruction Reordering to Alleviate Register Pressure

Alexandra L. Poulos

Coastal Carolina University

Improving Application Resilience by Extending Error Correction with Contextual Information

Lawrence Berkeley National Laboratory

HDF5: I/O Middleware and Ecosystem for HPC and Experimental and Observational Sciences

Atomic Weapons Establishment (AWE), UK

Performance Portability of an Unstructured Hydrodynamics Mini-Application

Energy Efficiency Modeling of Parallel Applications

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Interactive HPC Deep Learning with Jupyter Notebooks

Exascale Deep Learning for Climate Analytics

Deep Learning at Scale

Case Western Reserve University

Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images

Georgia State University

PDC Curriculum Update

Introduction - Workshop on Education for High Performance Computing (EduHPC)

Panel 2: United States National Science Foundation (NSF) Office of Advanced Cyberinfrastructure Programs and Workforce Development

Cardiff University

Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture

Northeastern University

PRISM: Predicting Resilience of GPU Applications Using Statistical Methods

Employing Student Retention Strategies for an Introductory GPU Programming Course

Optimization of an Image Processing Algorithm: Histogram Equalization

Timothy Prickett Morgan

The Next Platform

The Next Wave of HPC in the Datacenter

Reid Priedhorsky

Los Alamos National Laboratory

Containers in HPC

Benjamin Pritchard

Molecular Sciences Software Institute

Software Engineers: Careers in Research

University of Texas

OpenHPC Community BoF

Roberto Proietti

University of California, Davis

FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects

Laurent Prosperi

ENS Paris-Saclay

Planner: Cost-efficient Execution Plans Placement for Uniform Stream Analytics on Edge and Cloud

Oak Ridge National Laboratory

High Performance I/O Frameworks 101

Los Alamos National Laboratory

In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization

Marquette University

Introducing PDC Concepts with Spatial Computing Curriculum

OpenACC-Based GPU Parallelization of Plane Sweep Algorithm for Geometric Intersection

Microsoft Corporation

Reconfigurable Computing for HPC: Will It Make It this Time?

Benchmarking Scientific Reconfigurable / FPGA Computing

Introduction - ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

Q

Texas State University

High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem

Hardware Acceleration of CNNs with Coherent FPGAs

Beihang University

Sun Yat-sen University

China’s Effort on Exascale Computing: Current Status and Perspectives

Northwest University, China

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

University of North Texas

Characterizing Declustered Software RAID for Enhancing Storage Reliability and Performance

Ohio State University

Understanding SSD Reliability in Large-Scale Cloud Systems

University of Washington

Accelerating Wave-Propagation Algorithms with Adaptive Mesh Refinement Using the Graphics Processing Unit (GPU)

Indiana University

High-Performance AI: A View from Systems and Frameworks

University of California, Berkeley

Visualizing Outbursts of Massive Stars

Indiana University

Special Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)

Strategies for Inclusive and Scalable HPC Outreach and Education

SMPI Courseware: Teaching Distributed-Memory Computing with MPI in Simulation

Enrique S. Quintana-Ortí

Jaume I University

High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation

R

Intel Corporation

DAQDB - a Distributed Key-Value Store for Petascale Hot Storage

Argonne National Laboratory

MPICH: A High Performance Open-Source MPI Implementation

French Institute for Research in Computer Science and Automation (INRIA)

Large Scale Computation of Quantiles Using MELISSA

In-Transit Molecular Dynamics Analysis with Apache Flink

Vanderbilt University

Superscaling Performance through Energy-Efficient Supercomputing

Raghu Raja Chandrasekar

Amazon Web Services

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

University of Illinois

Student Cluster Competition Team Panel Presentation

Vinay Ramakrishnaiah

Los Alamos National Laboratory

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

Accelerating the Signal Alignment Process in Time-Evolving Geometries Using Python

Lavanya Ramakrishnan

Lawrence Berkeley National Laboratory

Dac-Man: Data Change Management for Scientific Datasets on HPC Systems

Automated Labeling of Electron Microscopy Images Using Deep Learning

Arvind Ramanathan

Oak Ridge National Laboratory

HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports

Artificial Intelligence Enabled Multiscale Molecular Simulations

Parameswaran Ramanathan

University of Wisconsin

Fault Tolerant Cholesky Factorization on GPUs

Cristian Ramon-Cortes

Barcelona Supercomputing Center

AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests

Duke University

Toward a Computational Simulation of Circulating Tumor Cell Transport in Vascular Geometries

Experiencing HPC for Undergraduates: Careers in HPC

Experiencing HPC for Undergraduates: Graduate Student Perspective

Experiencing HPC for Undergraduates: Introduction to HPC Research

Experiencing HPC for Undergraduates Orientation

Experiencing HPC for Undergraduates Opening Reception

Northwestern University

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

University of Virginia

Lawrence Livermore National Laboratory

Exploring Application Performance on Fat-Tree Networks in the Presence of Congestion

University of Münster

Portable Parallel Performance via Multi-Dimensional Homomorphisms

Soren C. Rasmussen

Cranfield University

Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model

Fabrice Rastello

French Institute for Research in Computer Science and Automation (INRIA)

Associative Instruction Reordering to Alleviate Register Pressure

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

North Carolina State University

Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA

Accelerating Microscope Data Analysis Using Parallel Computing

Archana Ravindar

Application Porting and Optimization on GPU-Accelerated POWER Architectures

Prashant Singh Rawat

Ohio State University

Associative Instruction Reordering to Alleviate Register Pressure

North Carolina State University

Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud

Elaine Raybourn

Sandia National Laboratories

Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training

The HPC Best Practices Webinar Series

University of Iowa

Convergence between HPC and Big Data: The Day After Tomorrow

“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities

How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?

Sandia National Laboratories

Non-Neural Network Applications for Spiking Neuromorphic Hardware

Istvan Z. Reguly

Pázmány Péter Catholic University, Hungary

OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling

Heterogeneous CPU-GPU Execution of Stencil Applications

National Center for Atmospheric Research

Visualization of Droplet Dynamics in Cloud Turbulence

Technical University Munich

Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG Schemes

Steve Reinhardt

D-Wave Systems Inc

Opportunities for Open-Source Development for D-Wave Systems

Panel: Open-Source Software

University of Kassel

Comparison of the HPC and Big Data Java Libraries Spark, PCJ and APGAS

Technical University Munich

Distributed-Memory Hierarchical Compression of Dense SPD Matrices

University of California, Merced

Runtime Data Management on Non-Volatile Memory-Based Heterogeneous Memory for Task-Parallel Programs

Understanding Application Recomputability without Crash Consistency in Non-Volatile Memory

University of Illinois, Chicago

SAGE2 10th Annual International SC BOF: Scalable Amplified Group Environment for Global Collaboration

Massachusetts Institute of Technology

Interactivity in HPC

University of British Columbia

PruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution

Pattern Matching on Massive Metadata Graphs at Scale

Scale-Free Graph Processing on a NUMA Machine

There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline

Alejandro Ribes

EDF Research and Development

Large Scale Computation of Quantiles Using MELISSA

Olivier Richard

University of Grenoble

Considering the Development Workflow to Achieve Reproducibility with Variation

Andrew Richards

Codeplay Software Ltd

Swiss Army Programming: Performance and Portability from Modern Tools

Matthias Riebisch

University of Hamburg

Toward a HPC Certification Program

Eleanor G. Rieffel

NASA Ames Research Center

Introduction to Quantum Computing

Lubomír Říha

Technical University of Ostrava, Czech Republic

Workflow for Parallel Processing of Sequential Mesh Databases

University of Manchester

First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture

RIKEN BNL Research Center

Lawrence Berkeley National Laboratory

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

Michael F. Ringenburg

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Interactivity in HPC

Deep Learning at Scale

Burkhard Ringlein

IBM Zurich Research Laboratory

University of Erlangen-Nuremberg

Integrating Network-Attached FPGAs into the Cloud Using Partial Reconfiguration

University of British Columbia

PruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution

Pattern Matching on Massive Metadata Graphs at Scale

Scale-Free Graph Processing on a NUMA Machine

There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline

University of Chicago

Dynamically Negotiating Capacity Between On-Demand and Batch Clusters

Georgia Institute of Technology

Hot Topics Discussion II: Thriving at Work

Strategies for Inclusive and Scalable HPC Outreach and Education

Lawrence Berkeley National Laboratory

University of Puerto Rico at Mayaguez

Capsule Networks for Protein Structure Classification

Argonne National Laboratory

libIS: A Lightweight Library for Flexible In Transit Visualization

SENSEI Cross-Platform View of In Situ Analytics

Benjamin Robbins

Panel Discussion

Keynote 1: A Few Scheduling Problems for Resilience at Scale

Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice

Argonne National Laboratory

Stateless Provisioning: Modern Practice in HPC

Los Alamos National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Improving Application Resilience by Extending Error Correction with Contextual Information

Effective Performance Portability

Performance Portability Challenges for Fortran Applications

Aleix Roca Nonell

Barcelona Supercomputing Center

Polytechnic University of Catalonia

On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale

Dylan Rodriguez

Texas A&M University

Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level

Eduardo Rodriguez-Gutiez

University of Valladolid

Storms of High-Energy Particles: An assignment for OpenMP, MPI, and CUDA/OpenCL

University of Erlangen-Nuremberg

Applying the Execution-Cache-Memory Model: Current State of Practice

James H. Rogers

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Thomas B. Rolinger

University of Maryland

Laboratory for Physical Sciences at University of Maryland

Impact of Traditional Sparse Optimizations on a Migratory Thread Architecture

Lawrence Berkeley National Laboratory

Doomsday: Predicting Which Node Will Fail When on Supercomputers

Alana Romanella

Upcoming Events in the HPC Systems Professionals Community

Nvidia Corporation

Exascale Deep Learning for Climate Analytics

Nichols A. Romero

Argonne National Laboratory

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Lawrence Berkeley National Laboratory

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Defense Advanced Research Projects Agency (DARPA)

Domain-Specific System on Chip (DSSoC) Program

Domain-Specific System on Chips (DSSoC)

Nvidia Corporation

Programmable Interactive Visualization of a Core-Collapse Supernova Simulation

University of Grenoble

CPU Overheating Characterization in HPC Systems: a Case Study

Danny Rorabaugh

University of Tennessee

Introduction of Practical Approaches to Data Analytics for HPC with Spark

Jordi Ros-Giralt

Reservoir Labs Inc

Fast Detection of Elephant Flows with Dirichlet-Categorical Inference

Dan A. Rosa de Jesus

Lawrence Berkeley National Laboratory

University of Puerto Rico at Mayaguez

Capsule Networks for Protein Structure Classification

Oak Ridge National Laboratory

Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes

Oak Ridge National Laboratory

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Garrett S. Rose

University of Tennessee

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

Power API and Redfish: Standardizing Power Measurement and Control for HPC

Arnold Rosenberg

University of Massachusetts

PDC Curriculum Update

Bryan Rosenburg

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Caitlin J. Ross

Rensselaer Polytechnic Institute

In Situ Performance Analysis of Event-driven Simulations to Support the Codesign of Extreme-Scale Systems

Argonne National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems

Parallel-IO in Practice

Fraunhofer ITWM

Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows

Oak Ridge National Laboratory

Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges

Students@SC: Careers in Industry, Research Labs, and Academia

Improving the I/O Performance and Memory Usage of the Xolotl Cluster Dynamics Simulator

Ohio State University

Associative Instruction Reordering to Alleviate Register Pressure

Lawrence Livermore National Laboratory

Power API and Redfish: Standardizing Power Measurement and Control for HPC

Sourcery Institute

Learning to Lead in HPC - Strategies to Start Your Leadership Journey

Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model

Beihang University

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

Rigetti Computing

Quantum Computing for Scientific Applications

Nvidia Corporation

PRISM: Predicting Resilience of GPU Applications Using Statistical Methods

Cindy Rubio-González

University of California, Davis

Introduction - 2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)

2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)

University of Southern California

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

S

University of Minnesota

Computing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver

Ohio State University

Associative Instruction Reordering to Alleviate Register Pressure

Clemson University

Community Detection Across Emerging Quantum Architectures

University of Notre Dame

Event-Triggered Communication in Parallel Computing

Emmanuelle Saillard

French Institute for Research in Computer Science and Automation (INRIA)

PARCOACH Extension for a Full-Interprocedural Collectives Verification

Intel Corporation

Function/Kernel Vectorization via Loop Vectorizer

Naohisa Sakamoto

Kobe University

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Takeshi Sakamoto

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

Vikram Saletore

Intel Corporation

Fast and Accurate Training of an AI Radiologist

Large Minibatch Training on Supercomputers with Improved Accuracy and Reduced Time to Train

Training Speech Recognition Models on HPC Infrastructure

Michael A. Salim

Argonne National Laboratory

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

D.E. Shaw Research

The Parallel Hashed Oct-Tree Algorithm Revisited

Kewal K. Saluja

University of Wisconsin

Fault Tolerant Cholesky Factorization on GPUs

Zebula Sampedro

University of Colorado

Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins

Francesca Samsel

University of Texas

University of Texas

The First Water in the Universe

Benjamin Samudio

American River College

OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine

Ahmed Sanaullah

Boston University

SimBSP: Enabling RTL Simulation for Intel FPGA OpenCL Kernels

Geoffrey Sanders

Lawrence Livermore National Laboratory

PruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution

There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline

Alexander Sannikov

Intel Corporation

Framework for Scalable Intra-Node Collective Operations Using Shared Memory

Stream Computing of Lattice-Boltzmann Method on Intel Programmable Accelerator Card

Reconfigurable Computing for HPC: Will It Make It this Time?

Benchmarking Scientific Reconfigurable / FPGA Computing

Institute Superior Técnico

Characterizing Deep-Learning I/O Workloads in TensorFlow

Jibonananda Sanyal

Oak Ridge National Laboratory

Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes

Dakota State University

Oak Ridge National Laboratory

Mentor-Protégé Informational Session

Georgia Institute of Technology

Detecting MPI Usage Anomalies via Partial Program Symbolic Execution

Runtime for Exascale and Beyond: Convergence or Divergence?

A Unified Runtime for PGAS and Event-Driven Programming

A Preliminary Study of Compiler Transformations for Graph Applications on the Emu System

A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java

Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free

Lawrence Berkeley National Laboratory

The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification

Sajith Sasidharan

Fermi National Accelerator Laboratory

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Lawrence Livermore National Laboratory

Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems

Tohoku University

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Riken Center for Computational Science

RIKEN Advanced Institute for Computational Science (AICS)

Keynote: The Post-K for General-Purpose, Energy-Efficient, and Sustained Application Performance

Design of Data Management for Multi-SPMD Workflow Programming Model

The ARM HPC Experience: From Testbeds to Exascale

David Sauerwein

University of Erlangen-Nuremberg

Student Cluster Competition Team Panel Presentation

University of North Carolina, Charlotte

Closing Remarks

Best Paper Announcement

Introduction - Workshop on Education for High Performance Computing (EduHPC)

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

Matthew Schabath

Moffitt Cancer Center

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Matthieu Schaller

Leiden Observatory

An Efficient SIMD Implementation of Pseudo-Verlet Lists for Neighbor Interactions in Particle-Based Codes

MD Anderson Cancer Center

Afternoon Keynote – Genomic Profiling of Normal, Premalignant, and Heterogeneous Tissues in Cancer Patients

Lawrence Livermore National Laboratory

Managing HPC Software Complexity with Spack

Florian Scheidegger

High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation

University of Lugano

Distributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures

University of California, Irvine

Using Integrated Processor Graphics to Accelerate Concurrent Data and Index Structures

Andreas Schlapka

Micron Technology Inc

Enabling High Performance Memory for the Broad HPC markets

Fabian Schlebusch

RWTH Aachen University

PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications

Johannes Gutenberg University Mainz

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Argonne National Laboratory

INDIS Showcases Panel: NRE and XNET and Architecture

Joseph Schoonover

Fluid Numerics LLC

Performance Portability Challenges for Fortran Applications

Markus Schordan

Lawrence Livermore National Laboratory

ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning

Facilitating the Adoption of Correctness Tools in HPC Applications

Automatic Generation of Mixed-Precision Programs

Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free

Andreas Schreiber

German Aerospace Center

Introduction - 8th Workshop on Python for High-Performance and Scientific Computing

Sandra Schröder

University of Hamburg

Toward a HPC Certification Program

Thomas C. Schulthess

Swiss National Supercomputing Centre

RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Scot A. Schultz

Mellanox Technologies

Interconnect Your Future with Mellanox “Smart” Interconnect

Meeting HPC Container Challenges as a Community

University of Texas

Institute for Computational Engineering and Sciences, Dell Medical School

Invited Talk: Current Status of the OpenHPC Project

Introduction - ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

OpenHPC Community BoF

ESPM2 2018: Closing Remarks

ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

Leibniz Supercomputing Centre

Hot Topics Discussion I: Thriving at Work

Developing Workplace Resilience and Managing Stress

Technical University Munich

FlipTracker: Understanding Natural Error Resilience in HPC Applications

Software Development Tools for HPC and at Scale

Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)

The Message Passing Interface (MPI): Version 4.0 and Beyond

Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights

Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack

How to Analyze the Performance of Parallel Codes 101

7th Workshop on Extreme-Scale Programming Tools (ESPT)

Richard Schulze

University of Münster

Portable Parallel Performance via Multi-Dimensional Homomorphisms

Catherine Schuman

Oak Ridge National Laboratory

Use Cases of Neuromorphic Co-Processors in Future HPC Environments

Non-Neural Network Applications for Spiking Neuromorphic Hardware

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Thomas R. W. Scogland

Lawrence Livermore National Laboratory

Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining

The Green 500: Trends in Energy Efficient Supercomputing

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Vincenzo Scotti

University of Naples Federico II

The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems

Indiana University

Presenting / Communication

Early Career Survival Guide for Successful Communication: Preparing Effective Grant Proposals, Publications, and Presentations

William Scullin

Argonne National Laboratory

Panel: Interactivity in Supercomputing

Introduction - 8th Workshop on Python for High-Performance and Scientific Computing

Oak Ridge National Laboratory

High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation

Md Syadus Sefat

Texas State University

Hardware Acceleration of CNNs with Coherent FPGAs

Fermi National Accelerator Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Data-Parallel Python for High Energy Physics Analyses

Hans-Peter Seidel

Max Planck Institute for Informatics

faimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU

Satoshi Sekiguchi

National Institute of Advanced Industrial Science and Technology (AIST)

ABCI – Envisioning High Performance Human-Centered AI for Industry

Jinsil Hwaryoung Seo

Texas A&M University

CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity

Louisiana State University

Asynchronous Execution of Python Code on Task Based Runtime Systems

Madhavan Seshadri

Nanyang Technological University, Singapore

Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)

Brad Settlemyer

Los Alamos National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Scaling Embedded In Situ Indexing with DeltaFS

Characterizing Declustered Software RAID for Enhancing Storage Reliability and Performance

William M. Severa

Sandia National Laboratories

Non-Neural Network Applications for Spiking Neuromorphic Hardware

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Swiss Army Programming: Performance and Portability from Modern Tools

Effective Performance Portability

Evaluating the Impact of Proposed OpenMP 5.0 Features on Performance, Portability, and Productivity

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Lawrence Berkeley National Laboratory

A Deferred Correction Coupling Strategy for Cosmological Simulations

General Atomics

Kernel-Based and Total Performance Analysis of CGYRO on 4 Leadership Systems

Shawn Shacterman

University of California, Berkeley

OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine

Extreme Networks

Plug and Play IP Fabrics

Mellanox Technologies

Interconnect Your Future with Mellanox “Smart” Interconnect

Unified Communication X (UCX) Community

InfiniBand In-Network Computing Technology and Roadmap

Lawrence Berkeley National Laboratory

Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows

Pros and Cons of HPCx benchmarks

OpenSHMEM in the Era of Exascale

Unified Communication X (UCX) Community

Lawrence Berkeley National Laboratory

Improving MPI Reduction Performance for Manycore Architectures with OpenMP and Data Compression

Ohio State University

Designing High-Performance, Resilient, and Heterogeneity-Aware Key-Value Storage for Modern HPC Clusters

Mallikarjun Shankar

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Intel Corporation

CosmoFlow: Using Deep Learning to Learn the Universe at Scale

Lawrence Berkeley National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Ruslan Shaydulin

Clemson University

Community Detection Across Emerging Quantum Architectures

North Carolina State University

Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines

University of Oregon

Students@SC: Careers in Industry, Research Labs, and Academia

Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter

OpenSHMEM in the Era of Exascale

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Mark S. Shephard

Rensselaer Polytechnic Institute

Dynamic Load Balancing of Plasma and Flow Simulations

Lawrence Livermore National Laboratory

Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method

Rice University

Computing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver

The Difference Between HPC on Premises and in the Cloud

National Center for High-Performance Computing, Taiwan

Panel 3: The Path from HPC to AI in Taiwan's NCHC

Satoshi Shigematsu

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

Toshiyuki Shimizu

Fujitsu Laboratories Ltd

Post-K Supercomputer with Fujitsu's Original CPU, Powered by Arm ISA

Kumamoto University

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Takashi Shimokawabe

University of Tokyo

Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers

Los Alamos National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Shahrzad Shirzad

Louisiana State University

Asynchronous Execution of Python Code on Task Based Runtime Systems

University of California, Irvine

OpenACC Routine Directive Propagation Using Interprocedural Analysis

Fumiyoshi Shoji

Riken Center for Computational Science

HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments

National Center for Supercomputing Applications

Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights

Sohil Lal Shrestha

University of Texas, Arlington

Precomputing Outputs of Hidden Layers to Speed Up Deep Neural Network Training

Argonne National Laboratory

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

Technical University Darmstadt

Understanding the Scalability of Molecular Simulation Using Empirical Performance Modeling

Argonne National Laboratory

MPICH: A High Performance Open-Source MPI Implementation

Taniya Siddiqua

Advanced Micro Devices Inc

Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo

Christopher Siefert

Sandia National Laboratories

Low Thread-Count Gustavson: A Multithreaded Algorithm for Sparse Matrix-Matrix Multiplication Using Perfect Hashing

Stephen F. Siegel

University of Delaware

Toward Deductive Verification of Message-Passing Parallel Programs

Pázmány Péter Catholic University, Hungary

Heterogeneous CPU-GPU Execution of Stencil Applications

Texas Tech University

Welcome, Workshop Goals, and Opening Remarks

Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish

Simulating Data Centers with Redfish-Enabled Equipment

Out-of-Band (BMC based) Data Center Monitoring DMTF Redﬁsh API Integration with Nagios

The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

IRISA, INSA Rennes

Planner: Cost-efficient Execution Plans Placement for Uniform Stream Analytics on Edge and Cloud

Federal University of Rio de Janeiro

A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows

Steven Silvester

Project Jupyter

Panel: Interactivity in Supercomputing

Lawrence Berkeley National Laboratory

Energy Sciences Network (ESnet)

SDN for End-to-End Networked Science at the Exascale (SENSE)

Oak Ridge National Laboratory

BESPOKV: Application Tailored Scale-Out Key-Value Stores

Nikolay A. Simakov

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

Christian Simmendinger

T-System Solutions for Research

Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows

University of Texas, Dallas

OpenHPC Community BoF

Christoph Simon

University of Calgary

Quantum Communication Networks and Technologies

Lawrence Berkeley National Laboratory

Lawrence Berkeley National Laboratory

TOP500 Supercomputers

Invited Talk Session 3

University of Maryland, Baltimore County

Students@SC: Careers in Industry, Research Labs, and Academia

Anthony Simonet

Rutgers University

Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows

Christopher Simpkin

Cardiff University

Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture

Nvidia Corporation

Cloud Infrastructure Solutions To Run HPC Workloads

CCIX Consortium

CCIX: Seamless Data Movement for Accelerated Applications

Robert Sinkovits

San Diego Supercomputer Center

Data Science and HPC Education and Outreach

Chaitanya Prasad Sishtla

KTH Royal Institute of Technology

Characterizing Deep-Learning I/O Workloads in TensorFlow

Roberto R. Sisneros

University of Illinois

National Center for Supercomputing Applications

Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience

Science and Technology Facilities Council, UK

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

Anthony Skjellum

University of Tennessee, Chattanooga

Introduction - Workshop on Exascale MPI (ExaMPI)

Workshop on Exascale MPI (ExaMPI)

Impacting Cancer with HPC: Opportunities and Challenges

Andrew E. Slaughter

Idaho National Laboratory

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

Elliott Slaughter

SLAC National Accelerator Laboratory

Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes

Los Alamos National Laboratory

The First Water in the Universe

Cameron W. Smith

Rensselaer Polytechnic Institute

Dynamic Load Balancing of Plasma and Flow Simulations

University of Maine

Advanced Event Sampling Support for PAPI

University of Colorado

Hybrid Quantum-Classical Computing Architectures

West Virginia Science and Research

Marshall University

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Jeremy C. Smith

Oak Ridge National Laboratory

University of Tennessee

High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability

Preston M. Smith

Purdue University

Machine Learning and AI

University of Arizona

Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing

Lawrence Berkeley National Laboratory

Containers in HPC

University of Illinois

Runtime for Exascale and Beyond: Convergence or Divergence?

Fast and Generic Concurrent Message-Passing

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

Argonne National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Toward Understanding I/O Behavior in HPC Workflows

A Year in the Life of a Parallel File System

University of Warsaw

Student Cluster Competition Team Panel Presentation

Shuaiwen Leon Song

Pacific Northwest National Laboratory

Binarized ImageNet Inference in 29us

National Institute of Technology, Tiruchirappalli

Accelerating 2D FFT: Exploit GPU Tensor Cores through Mixed-Precision

Jerome Soumagne

Methodology for the Rapid Development of Scalable HPC Data Services

Enabling Data Services for HPC

University of Cincinnati

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Annemarie Southwell

Nvidia Corporation

OpenMP GPU Offload in Flang and LLVM

SciNet HPC Consortium

University of Toronto

Trends in Demand, Growth, and Breadth in Scientific Computing Training Delivered by a High-Performance Computing Center

Brno University of Technology

Faculty of Information Technology

Optimization of Ultrasound Simulations on Multi-GPU Servers

Maria Spiropulu

California Institute of Technology

Quantum Communication Networks and Technologies

Sandia National Laboratories

PyHPC Lightning Talks

Introduction - 8th Workshop on Python for High-Performance and Scientific Computing

Becky Springmeyer

Lawrence Livermore National Laboratory

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Jeffrey Squyres

Open MPI State of the Union 2018

Vilas Sridharan

Advanced Micro Devices Inc

Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo

Sudhir Srivastava

National Cancer Institute

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Purdue University

xCAT and Masterless Puppet: Aiming for Ideal Configuration Management

Frederick National Laboratory for Cancer Research

Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)

Impacting Cancer with HPC: Opportunities and Challenges

Texas Advanced Computing Center

University of Texas

The New NSF-Funded Resource: Frontera - Towards a Leadership Class Computing Facility

Spectra Logic Corporation

Exascale Archiving - Challenges and Opportunities

Nicholas Stegmeier

South Dakota State University

Optimizing Next Generation Hydrodynamics Code for Exascale Systems

Markus Steinberger

Graz University of Technology

Max Planck Institute for Informatics

faimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU

Zuse Institute Berlin

Impacting Cancer with HPC: Opportunities and Challenges

Los Alamos National Laboratory

Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems

Laurie A. Stephey

Lawrence Berkeley National Laboratory

National Energy Research Scientific Computing Center (NERSC)

Optimizing Python Data Processing for the DESI Experiment on the NERSC Cori Supercomputer

Boston University

Boston University

A Novel Approach to Supporting Communicators for In-Switch Processing of MPI Collectives

Argonne National Laboratory

Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

University of Illinois

Argonne National Laboratory

Spack Community BoF

Managing HPC Software Complexity with Spack

Claire Frist-Stirm

Purdue University

Best Practices from Organizations on Improving Workplace Diversity

Victoria Stodden

University of Illinois

CRE2018 – Plenary I

HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation

Assessing Reproducibility: An Astrophysical Example of Computational Uncertainty in the HPC Context

University of Texas

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

Larisa Stoltzfus

University of Edinburgh

Is Data Placement Optimization Still Relevant on Newer GPUs?

Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling

Christopher P. Stone

US Department of Defense HPC Modernization Program

Engility Corporation

Synthetic Data Generation for Evaluating Parallel I/O Compression Performance and Scalability

Refactoring and Optimizing Multiphysics Combustion Models for Data Parallelism

Princeton University

Visualizing Outbursts of Massive Stars

University of Illinois

Interactivity in HPC

Advanced Micro Devices Inc

The Next Wave of HPC in the Datacenter

Texas Instruments

Programming Your GPU with OpenMP: A Hands-On Introduction

Quentin F. Stout

University of Michigan

Parallel Computing 101

Tjerk P. Straatsma

Oak Ridge National Laboratory

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Peter E. Strazdins

Australian National University

A Comprehensive Parallel Computing Curriculum: From Second Year to Professionals

Erich Strohmaier

Lawrence Berkeley National Laboratory

Lawrence Berkeley National Laboratory

TOP500 Supercomputers

The Green 500: Trends in Energy Efficient Supercomputing

University of Texas

TACC's Cloud Deployer: Automating the Management of Distributed Software Systems

Hinnerk Stüben

University of Hamburg

Toward a HPC Certification Program

Alejandro Suarez

National Science Foundation

The Future of NSF Supported Advanced Cyberinfrastructure

Forschungszentrum Juelich

Achieving Performance on Large-Scale Intel Xeon-Based Systems

Rutgers University

Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows

Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows

Ohio State University

Cooperative Rendezvous Protocols for Improved Performance and Overlap

ESPM2 2018: Closing Remarks

Introduction - ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

InfiniBand, Omni-Path, and High-Speed Ethernet: Advanced Features, Challenges in Designing HEC Systems, and Usage

InfiniBand, Omni-Path, and High-Speed Ethernet for Beginners

ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

Argonne National Laboratory

Hybrid Quantum-Classical Computing Architectures

Slippery Rock University of Pennsylvania

Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Fifth SC Workshop on Best Practices for HPC Training and Education

Keynote Address

Productive and Performant AI Platforms of the Future

Aravind Sukumaran-Rajam

Ohio State University

Associative Instruction Reordering to Alleviate Register Pressure

Michael B. Sullivan

Nvidia Corporation

Evaluating and Accelerating High-Fidelity Error Injection for HPC

Optimizing Software-Directed Instruction Replication for GPU Error Detection

Auburn University

Understanding the Usage of MPI in Exascale Proxy Applications

National Institutes of Health

Introduction - Computational Phenomics @Scale: From Supercomputers to Bedside

The Impact of Deep Learning and Artificial Intelligence in Radiology

Georgia Institute of Technology

HiCOO: Hierarchical Storage of Sparse Tensors

Illinois Institute of Technology

Introduction - MCHPC’18: Workshop on Memory Centric High Performance Computing

MCHPC’18: Workshop on Memory Centric High Performance Computing

Indian Institute of Tropical Meteorology

Visualization of Droplet Dynamics in Cloud Turbulence

Supreeth Suresh

University of Wyoming

Using Thrill to Process Scientific Data on HPC

University of Maryland

PDC Curriculum Update

Frédéric Suter

IN2P3 Computing Center, National Center for Scientific Research (CNRS)

WRENCH: A Framework for Simulating Workflow Management Systems

SMPI Courseware: Teaching Distributed-Memory Computing with MPI in Simulation

Microsoft Corporation

Applications of Deep Learning in Industry and Research

Kuniyasu Suzaki

National Institute of Advanced Industrial Science and Technology (AIST)

FlowOS-RM: Disaggregated Resource Management System

Technical University of Ostrava, Czech Republic

HPC-as-a-Service for Life Sciences

KTH Royal Institute of Technology

Energy Efficiency Considerations for HPC Procurements

Texas A&M University

Incremental Static Race Detection in OpenMP Programs

T

Deep Learning at Scale on Nvidia V100 Accelerators

Fast and Accurate Training of an AI Radiologist

Kyushu Institute of Technology

Numerical Simulation of a Flue Instrument with Finite-Difference Lattice Boltzmann Method using GPGPU

Philip A. Taffet

Rice University

Lawrence Livermore National Laboratory

Exploring Application Performance on Fat-Tree Networks in the Presence of Congestion

University of Utah

PARLOT: Efficient Whole-Program Call Tracing for HPC Applications

Daisuke Takahashi

Railway Technical Research Institute, Japan

Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis

National Institute of Advanced Industrial Science and Technology (AIST)

FlowOS-RM: Disaggregated Resource Management System

Data Analytics for System and Facility Energy Management

Hiroyuki Takizawa

Tohoku University

A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems

University of Indianapolis

Experience Report: 4 Years of Teaching Cloud Computing and Big Data at the University Level

Beijing Technology and Business University

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

University of California, San Diego

Tongji University

Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU

Hideyuki Tanaka

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Japan Telegraph and Telephone Corporation

Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning

University of Hawaii at Manoa

WRENCH: A Framework for Simulating Workflow Management Systems

Kogakuin University

MGRIT Preconditioned Krylov Subspace Method

Lawrence Berkeley National Laboratory

Evaluation of HPC Application I/O on Object Storage Systems

Anycast: Rootless Broadcasting with MPI

Distributed Adaptive Radix Tree for Efficient Metadata Search on HPC Systems

Princeton University

Approximating for Faster, Better and Cheaper Scientific Computing

Tsinghua University

Qatar Computing Research Institute

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

University of Chicago

The Gen3 Approach to Portability and Repeatability for Cancer Genomics Projects

Lawrence Berkeley National Laboratory

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

University of Alabama

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation

University of Delaware

Open Panel: Automating Artifact Sharing, Evaluation, and Reuse

Panel Discussion

SC: The Conference

Introduction - ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC

Flux: Overcoming Scheduling Challenges for Exascale Workflows

Introduction of Practical Approaches to Data Analytics for HPC with Spark

SC19 Conference Preview

University of Tokyo

Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading

US Air Force Research Laboratory

Refactoring and Optimizing Multiphysics Combustion Models for Data Parallelism

Cardiff University

Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture

Office of Science and Technology Policy (OSTP)

Quantum Communication Networks and Technologies

Argonne National Laboratory

University of Chicago

Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta

ACM and IEEE-CS Award Presentations

Moffitt Cancer Center

Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches

Keita Teranishi

Sandia National Laboratories

Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)

Christian Terboven

RWTH Aachen University

Mastering Tasking with OpenMP

Advanced OpenMP: Host Performance and 5.0 Features

Théophile Terraz

French Institute for Research in Computer Science and Automation (INRIA)

Large Scale Computation of Quantiles Using MELISSA

University of Texas

TACC's Cloud Deployer: Automating the Management of Distributed Software Systems

Barcelona Supercomputing Center

Mastering Tasking with OpenMP

University of Notre Dame

A Lightweight Model for Right-Sizing Master-Worker Applications

Reduction of Workflow Resource Consumption Using a Density-based Clustering Model

Argonne National Laboratory

Advanced MPI Programming

Arnold Tharrington

Oak Ridge National Laboratory

High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation

Laura Theademan

Purdue University

Best Practices from Organizations on Improving Workplace Diversity

Maxence Thevenet

Lawrence Berkeley National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Jayaraman J. J. Thiagarajan

Lawrence Livermore National Laboratory

Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing

Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance

Syska Hennessy Group

High Performance Computing (HPC) Data Center Planning and TCO: A Case Study and Roadmap

Owen G. M. Thomas

Red Oak Consulting

The Business of HPC: TCO, Funding Models, Metrics, Value, and More

Procurement and Commissioning of HPC Systems

Rollin C. Thomas

Lawrence Berkeley National Laboratory

National Energy Research Scientific Computing Center (NERSC)

Optimizing Python Data Processing for the DESI Experiment on the NERSC Cori Supercomputer

Panel: Interactivity in Supercomputing

Introduction - 8th Workshop on Python for High-Performance and Scientific Computing

Interactive HPC Deep Learning with Jupyter Notebooks

Container Computing for HPC and Scientific Workflows

Christopher Thompson

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

SENSEI Cross-Platform View of In Situ Analytics

University of St Andrews

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

University of Massachusetts

Keynote Talk: Student Engagement: View from the Trenches

Technical University Darmstadt

A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs

Andreas F. Tillack

Oak Ridge National Laboratory

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation

Jenett Tillotson

Purdue University

Message from the SIGHPC SYSPROS Virtual Chapter President

Jesmin Jahan Tithi

Intel Corporation

Students@SC: Making the Best of Your HPC Education

Northeastern University

Data Analytics for System and Facility Energy Management

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Louisiana State University

Asynchronous Execution of Python Code on Task Based Runtime Systems

Max Planck Institute of Molecular Cell Biology and Genetics

HPC-as-a-Service for Life Sciences

Yasumoto Tomita

Fujitsu Laboratories Ltd

DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping

University of Tennessee

Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers

MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines

Dai-Hai Ton-That

DePaul University

Semantically Organized Containers for Reproducible Research

Thierry Tonellot

Toward Smoothing Data Movement Between RAM and Storage

Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation

Technical University of Valencia

The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems

Belén Torrente Torrente

Appentra Solutions

Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.

Georgia Tourassi

Oak Ridge National Laboratory

AI-Enabled Disease Phenotyping: Opportunities and Computational Challenges

Introduction - Computational Phenomics @Scale: From Supercomputers to Bedside

HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports

University of Notre Dame

A Lightweight Model for Right-Sizing Master-Worker Applications

Reduction of Workflow Resource Consumption Using a Density-based Clustering Model

National Center for Supercomputing Applications

Big Data Challenge - How to Engage with Large Scale Facilities?

Kathryn Traxler

Louisiana State University

Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins

Bradley E. Treeby

University College London

Biomedical Ultrasound Group

Optimization of Ultrasound Simulations on Multi-GPU Servers

Nvidia Corporation

Runtime for Exascale and Beyond: Convergence or Divergence?

Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes

Exascale Deep Learning for Climate Analytics

Nicolas Tripoul

University of British Columbia

PruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution

There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

Christian Trott

Sandia National Laboratories

Swiss Army Programming: Performance and Portability from Modern Tools

Matthias Troyer

Microsoft Corporation

A Quantum Future of Computation

High Level Programming Languages for Quantum Computation

University of Warwick

Pointers Inside Lambda Closure Objects in OpenMP Target Offload Regions

Gretar Tryggvason

Johns Hopkins University

Event-Triggered Communication in Parallel Computing

Nvidia Corporation

Optimizing Software-Directed Instruction Replication for GPU Error Detection

Miyuki Tsubouchi

Riken Center for Computational Science

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Riken Center for Computational Science

Design of Data Management for Multi-SPMD Workflow Programming Model

Lawrence Berkeley National Laboratory

University of California, Berkeley

Identifying Network Data Transfer Bottlenecks in HPC Systems

Pacific Northwest National Laboratory

Adaptive Anonymization of Data with b-Edge Covers

Introduction - IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

Enabling High-Level Graph Processing via Dynamic Tasking

HPC Graph Toolkits and the GraphBLAS Forum

miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems

IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

University of Edinburgh

Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training

Intel Corporation

Function/Kernel Vectorization via Loop Vectorizer

U

Stream Computing of Lattice-Boltzmann Method on Intel Programmable Accelerator Card

Juliette Ugirumurera

Lawrence Berkeley National Laboratory

Lawrence Berkeley National Laboratory

High Performance Computing in Dynamic Traffic Simulation

Maui High Performance Computing Center

Performance and Communication Modeling for Exascale Proxy Architecture in Aspen

Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows

Barcelona Supercomputing Center

Characterization of the Impact of Soft Errors on Iterative Methods

Toward Ad Hoc Recovery For Soft Errors

Approximating a Multi-Grid Solver

Dimuthu Upeksha

Indiana University

SciGaP: Apache Airavata Hosted Science Gateways

Argonne National Laboratory

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

Pittsburgh Supercomputing Center

Strategies for Inclusive and Scalable HPC Outreach and Education

Evaluating the Wide Area Classroom after 10,500 HPC Students

EDF Research and Development

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

SCI Insitute, University of Utah

Intel Corporation

libIS: A Lightweight Library for Flexible In Transit Visualization

Hayato Ushijima-Mwesigwa

Clemson University

Community Detection Across Emerging Quantum Architectures

Putchong Uthayopas

Kasetsart University, Thailand

Panel 3: HPC Status in Thailand

V

Farshid Vahedifard

Mississippi State University

Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC

University of Southern California, Information Sciences Institute

Enabling Data Analytics Workflows Using Node-Local Storage

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Ramachandran Vaidyanathan

Louisiana State University

PDC Curriculum Update

Barcelona Supercomputing Center

Polytechnic University of Catalonia

Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs

Ruud van der Pas

Advanced OpenMP: Host Performance and 5.0 Features

University of Amsterdam

Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners

Brian Van Essen

Lawrence Livermore National Laboratory

Exascale Machine Learning

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

Scalable Deep Ensemble Learning for Cancer Drug Discovery

Keynote: Learning-Based Predictive Models: a New Approach to Integrating Large-Scale Simulations and Experiments

Brian Van Straalen

Lawrence Berkeley National Laboratory

A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids

UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes

SciNet HPC Consortium

University of Toronto

Trends in Demand, Growth, and Breadth in Scientific Computing Training Delivered by a High-Performance Computing Center

Matt Vander Werf

University of Notre Dame

Compliant Cloud+Campus Hybrid HPC Infrastructure

Vanderbilt University

Getting Scientific Software Installed

Srinivas Varadharajan

Fast and Accurate Training of an AI Radiologist

Ana Lucia Varbanescu

University of Amsterdam

Mix-and-Match: A Model-Driven Runtime Optimization Strategy for BFS on GPUs

Rice University

Hardware Transactional Persistent Memory

Priya Vashishta

University of Southern California

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Courtenay Vaughan

Sandia National Laboratories

Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications

Brno University of Technology

Faculty of Information Technology

Optimization of Ultrasound Simulations on Multi-GPU Servers

Lawrence Berkeley National Laboratory

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Sudharshan S. Vazhkudai

Oak Ridge National Laboratory

GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Using Darshan and CODES to Evaluate Application I/O Performance

Mariano Vazquez

Barcelona Supercomputing Center

Personalized Medicine and HPC

Davide Venturelli

Quantum Computing for Scientific Applications

Becky Verastegui

Oak Ridge National Laboratory

SC: The Conference

Veronica G. Vergara Larrea

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Laurens Versluis

Vrije University Amsterdam

A Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments

Merijn Elwin Verstraaten

University of Amsterdam

Mix-and-Match: A Model-Driven Runtime Optimization Strategy for BFS on GPUs

Stefaan Vervaet

Western Digital Corporation

Exascale Archiving - Challenges and Opportunities

Jeffrey S. Vetter

Oak Ridge National Laboratory

Siena: Exploring the Design Space of Heterogeneous Memory Systems

DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access

Introduction - The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)

OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing

Implementing Efficient Data Compression and Encryption in a Persistent Key-Value Store for HPC

Programming the EMU Architecture: Algorithm Design Considerations for Migratory-Threads-Based Systems

Benchmarking Scientific Reconfigurable / FPGA Computing

Volunteer Opportunities for SC Conference Planning

Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations

Revisiting the 2008 ExaScale Computing Study and Venturing Predictions for 2028

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Clacc: Translating OpenACC to OpenMP in Clang

The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)

RWTH Aachen University

Westphalian University of Applied Sciences, Bocholt

Visual Analytics Challenges in Analyzing Calling Context Trees

Venkatram Vishwanath

Argonne National Laboratory

Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations

Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

libIS: A Lightweight Library for Flexible In Transit Visualization

Versal: The New Xilinx Adaptive Compute Acceleration Platforms (ACAP)

Palacký University Olomouc, Czech Republic

HPC-as-a-Service for Life Sciences

Interactive HPC Deep Learning with Jupyter Notebooks

Gwendolyn Renae Voskuilen

Sandia National Laboratories

Open-Source Modeling and Simulation

Panel: Open-Source Software

University of Paderborn

Understanding the Scalability of Molecular Simulation Using Empirical Performance Modeling

Lawrence Livermore National Laboratory

Lawrence Berkeley National Laboratory

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

Georgia Institute of Technology

HiCOO: Hierarchical Storage of Sparse Tensors

Students@SC: Making the Best of Your HPC Education

Modeling Single-Source Shortest Path Algorithm Dynamics to Control Performance and Power Tradeoffs

W

Louisiana State University

Asynchronous Execution of Python Code on Task Based Runtime Systems

Nvidia Corporation

Application Porting and Optimization on GPU-Accelerated POWER Architectures

Intel Corporation

libIS: A Lightweight Library for Flexible In Transit Visualization

LLVM and the Automatic Vectorization of Loops Invoking Math Routines: -fsimdmath

André Walker-Loud

Lawrence Berkeley National Laboratory

Lawrence Livermore National Laboratory

Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Wolfgang A. Wall

Technical University Munich

Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?

Dylan E. Wallace

Coastal Carolina University

Improving Application Resilience by Extending Error Correction with Contextual Information

University of Texas

Tools and Best Practices for Distributed Deep Learning with Supercomputers

National Supercomputing Center, Wuxi

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Sanjay Wandhekar

Centre for Development of Advanced Computing, India

Panel 2: HPC in India

University of St Andrews

National Supercomputing Center, Wuxi

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

Renaissance Computing Institute (RenCI)

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Oxford Thermofluids Institute

University of Oxford

Software Prefetching for Unstructured Mesh Applications

University of Chicago

Script of Scripts Polyglot Notebook and Workflow System

Hong Kong Baptist University

GPGPU Performance Estimation with Core and Memory Frequency Scaling

University of Chicago

Reproducibility as Side Effect

Lawrence Berkeley National Laboratory

A Year in the Life of a Parallel File System

Boston University

Binarized ImageNet Inference in 29us

Energy Efficiency of Reconfigurable Caches on FPGAs

Hong Kong University of Science and Technology

SP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition

Tongji University

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

Texas Tech University

xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory

Illinois Institute of Technology

Study of Performance Variability on Dragonfly Systems

Chinese Academy of Sciences

Designing and Building Next-Generation Computer Systems for Deep Learning

Rutgers University

Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows

Descartes Labs Inc

The Parallel Hashed Oct-Tree Algorithm Revisited

Todd Warszawski

Stanford University

Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes

Tohoku University

NEC Corporation

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Georgia Institute of Technology

A Fast and Simple Approach to Merge and Merge Sorting Using Wide Vector Instructions

Colorado College

Building a Low Budget Cluster Through Hardware Reuse

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

University of Maine

Advanced Event Sampling Support for PAPI

Lawrence Berkeley National Laboratory

University of California, Davis

Introduction - ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

Automated Labeling of Electron Microscopy Images Using Deep Learning

NEC Laboratories Europe

NEC Corporation

Sol: Transparent Neural Network Acceleration Platform

Shodor Education Foundation

Fifth SC Workshop on Best Practices for HPC Training and Education

University of Massachusetts

PDC Curriculum Update

Lawrence Livermore National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

AI Matrix – Synthetic Benchmarks for DNN

Shenzhen Institutes of Advanced Technology

FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures

Tsinghua University

National Supercomputing Center, Wuxi

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Deborah Weighill

Oak Ridge National Laboratory

University of Tennessee

Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction

Michèle Weiland

University of Edinburgh

Multi-Level Memory and Storage for HPC and Data Analytics

Lawrence Livermore National Laboratory

Students@SC: Making the Best of Your HPC Education

University of Minnesota

Dynamically Negotiating Capacity Between On-Demand and Batch Clusters

Oak Ridge National Laboratory

A Compiler and Profiler Based Tool for Querying HPC Application Characteristics

Parallel-IO in Practice

Gerhard Wellein

University of Erlangen-Nuremberg

Erlangen Regional Computing Center

Applying the Execution-Cache-Memory Model: Current State of Practice

Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures

Node-Level Performance Engineering

Andrew Wellington

National Computational Infrastructure, Australia

PBS Pro Open Source Project Community BoF

Oak Ridge National Laboratory

Session1: WACCPD Keynote: Experiences in Using Directive-Based Programming for Accelerated Computing Architectures

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Shanghai Jiao Tong University

Software Engineering and Reuse in Computational Science and Engineering

Sebastian Werner

University of California, Davis

FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects

University of Victoria, British Columbia

LOS: Level Order Sampling for Task Graph Scheduling on Heterogeneous Resources

Brendan T. Whitaker

Ohio State University

University of Chicago

Measuring Swampiness: Quantifying Chaos in Large Heterogeneous Data Repositories

Johns Hopkins University

National Research Infrastructure: Collaborative Session

Fermi National Accelerator Laboratory

Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities

Joseph P. White

State University of New York at Buffalo

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

University of Illinois

Charm++ and AMPI: Adaptive and Asynchronous Parallel Programming

Intelligent Light

SENSEI Cross-Platform View of In Situ Analytics

RWTH Aachen University

5th Workshop on Accelerator Programming Using Directives (WACCPD): Closing Remarks

Session 2: Porting Scientific Applications Using Directives

Introduction - Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications

Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

Brandon Wiggins

Southern Utah University

The First Water in the Universe

Marvell Technology Group LTD

Bringing Innovation to the HPC Market with Marvell’s ThunderX2 Processors

Argonne National Laboratory

Mentor-Protégé Informational Session

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

Hewlett Packard Enterprise

Leibniz Supercomputing Centre

Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Power API and Redfish: Standardizing Power Measurement and Control for HPC

Data Analytics for System and Facility Energy Management

Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Sandia National Laboratories

Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications

David Wilkinson

University of Pittsburgh

Supporting Thorough Artifact Evaluation with Occam

Technical University Dresden

Top-Down Performance Analysis of Workflow Applications

University of Arizona

Asynchronous Execution of Python Code on Task Based Runtime Systems

Samuel Williams

Lawrence Berkeley National Laboratory

Improving MPI Reduction Performance for Manycore Architectures with OpenMP and Data Compression

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks

Performance Tuning of Scientific Codes with the Roofline Model

New Mexico Consortium

Heterogeneous Memory and Arena-Based Heap Allocation

Durham University

Institute for Computational Cosmology

An Efficient SIMD Implementation of Pseudo-Verlet Lists for Neighbor Interactions in Particle-Based Codes

Case Western Reserve University

Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images

Ellis H. Wilson

Architecture of a Next-Generation Object Storage Device in the Panasas Filesystem

Lucas A. Wilson

Fast and Accurate Training of an AI Radiologist

Applications of Deep Learning in Industry and Research

Argonne National Laboratory

Computing, Environment and Life Science Division

SDN for End-to-End Networked Science at the Exascale (SENSE)

Graz University of Technology

faimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU

University of Tennessee

Improving the I/O Performance and Memory Usage of the Xolotl Cluster Dynamics Simulator

Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter

Aleksandra Wisz

Intel Corporation

DAQDB - a Distributed Key-Value Store for Petascale Hot Storage

Humboldt University of Berlin

LOS: Level Order Sampling for Task Graph Scheduling on Heterogeneous Resources

Los Alamos National Laboratory

Next-Generation Cluster Management Software

Engility Corporation

The ARM HPC Experience: From Testbeds to Exascale

Technical University Darmstadt

Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges

ESPT Closing Remarks

Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)

Understanding the Scalability of Molecular Simulation Using Empirical Performance Modeling

7th Workshop on Extreme-Scale Programming Tools (ESPT)

Performance and Energy Analysis

Oak Ridge National Laboratory

Feature-Relevant Data Reduction for In Situ Workflows

Introduction - ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

SENSEI Cross-Platform View of In Situ Analytics

ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

Nvidia Corporation

PGI Compilers and Tools

High Performance OpenMP for GPUs

OpenMP GPU Offload in Flang and LLVM

OpenACC Routine Directive Propagation Using Interprocedural Analysis

Rensselaer Polytechnic Institute

Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems

Codeplay Software Ltd

Distributed and Heterogeneous Programming in C++ for HPC 2018

Specifying Rack Level High Density Liquid Cooling Solutions

Lawrence Livermore National Laboratory

A Flexible System For In Situ Triggers

Argonne National Laboratory

Methodology for the Rapid Development of Scalable HPC Data Services

Toward Understanding I/O Behavior in HPC Workflows

Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

Keysight Technologies Inc

Cybersecurity Considerations and Best Practices for Supercomputers

Christopher J. Wright

Columbia University

Reproducibility for Streaming Analysis

University of Delaware

Estimating Molecular Dynamics Chemical Shift with GPUs

Nicholas J. Wright

Lawrence Berkeley National Laboratory

A Year in the Life of a Parallel File System

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

Steven A. Wright

University of York

Introduction - The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)

Pointers Inside Lambda Closure Objects in OpenMP Target Offload Regions

The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)

Resilience 3: GPUs

New Jersey Institute of Technology

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Optimizing the Throughput of Storm-Based Stream Processing in Clouds

North Carolina State University

Efficient Deployment of Irregular Computations on Multi- and Many-Core Architectures

A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms

Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA

Understanding SSD Reliability in Large-Scale Cloud Systems

University of California, Merced

Runtime Data Management on Non-Volatile Memory-Based Heterogeneous Memory for Task-Parallel Programs

Understanding Application Recomputability without Crash Consistency in Non-Volatile Memory

Lawrence Berkeley National Laboratory

Automated Parallel Data Processing Engine with Application to Large-Scale Feature Extraction

High Performance I/O Frameworks 101

University of Houston

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

Fermi National Accelerator Laboratory

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

University of Chicago

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

Full State Quantum Circuit Simulation by Using Lossy Data Compression

Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

Argonne National Laboratory

University of Chicago

Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta

Frank Wuerthwein

University of California, San Diego

National Research Infrastructure: Collaborative Session

University of Delaware

University of Tennessee

Introduction of Practical Approaches to Data Analytics for HPC with Spark

X

Samuel Xavier de Souza

Federal University of Rio Grande do Norte

PaScal Viewer: A Tool for the Visualization of Parallel Scalability Trends

University of Minnesota

Computing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver

Argonne National Laboratory

Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta

Idaho National Laboratory

A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks

Yale University

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

Beihang University

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

University of California, Davis

FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects

Georgia Institute of Technology

Automated Parallel Data Processing Engine with Application to Large-Scale Feature Extraction

MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines

Ohio State University

Understanding SSD Reliability in Large-Scale Cloud Systems

Beijing Sogou Technology Development Company

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

AI Matrix – Synthetic Benchmarks for DNN

Deep Learning at Scale on Nvidia V100 Accelerators

Texas Advanced Computing Center

Tools and Best Practices for Distributed Deep Learning with Supercomputers

Understanding SSD Reliability in Large-Scale Cloud Systems

Tsinghua University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Understanding Potential Performance Issues Using Resource-Based alongside Time Models

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Beihang University

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

Y

Japan Atomic Energy Agency

Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations

Takateru Yamagishi

Research Organization for Information Science and Technology, Japan

Multi-GPU Accelerated Non-Hydrostatic Numerical Ocean Model with GPUDirect RDMA Transfers

Takuma Yamaguchi

University of Tokyo

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Yusaku Yamamoto

University of Electro-Communications, Japan

Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices

Susumu Yamashita

Japan Atomic Energy Agency

Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations

Ichitaro Yamazaki

University of Tennessee

MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines

Mentor-Protégé Informational Session

University of South Carolina

Introduction - MCHPC’18: Workshop on Memory Centric High Performance Computing

MCHPC’18: Workshop on Memory Centric High Performance Computing

Yuka Yanagisawa

Waseda University

Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices

H3 Platform Inc

A Cost-Effective Flexible System Optimized for DNN and ML

University of California, Davis

Lawrence Berkeley National Laboratory

Linear Algebra Is the Right Way to Think About Graphs

Lawrence Berkeley National Laboratory

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Performance Tuning of Scientific Codes with the Roofline Model

A Case Study for Performance Portability Using OpenMP 4.5

Tsinghua University

National Supercomputing Center, Wuxi

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

Fast and Accurate Training of an AI Radiologist

University of Maryland

Mid-Atlantic Crossroads

SDN for End-to-End Networked Science at the Exascale (SENSE)

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Y. Richard Yang

Yale University

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

Intel Corporation

Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.

The Algorithm and Framework Designs and Optimizations for Scalable Automata Processing on HPC Platforms

Israel Institute of Technology

Processing-in-Storage Architecture for Machine Learning and Bioinformatics

Open Research Cloud Alliance

Federated Cloud: An Evolutionary Path from Grid Computing

Georgia Institute of Technology

Detecting MPI Usage Anomalies via Partial Program Symbolic Execution

Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free

Stony Brook University

Supercomputing for the Multi-Driver Routing

International Center for Advanced Internet Research (iCAIR)

Northwestern University

Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer

H3 Platform Inc

A Cost-Effective Flexible System Optimized for DNN and ML

Katherine Yelick

Lawrence Berkeley National Laboratory

Extreme Scale De Novo Metagenome Assembly

Learning to Lead in HPC - Strategies to Start Your Leadership Journey

Invited Talk Session 4

Invited Talk Session 2

Oak Ridge National Laboratory

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Northwest University, China

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Harvard University

On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse

Kogakuin University

MGRIT Preconditioned Krylov Subspace Method

Rutgers University

A Parallelism Profiler with What-If Analyses for OpenMP Programs

Mitsuo Yokokawa

Kobe University

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Tokyo Institute of Technology

Convergence between HPC and Big Data: The Day After Tomorrow

Lawrence Livermore National Laboratory

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

University of California, Davis

FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects

Ulsan National Institute of Science and Technology

Dynamic Load Balancing of Plasma and Flow Simulations

Oak Ridge National Laboratory

HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports

Kazutomo Yoshii

Argonne National Laboratory

Reconfigurable Computing for HPC: Will It Make It this Time?

Benchmarking Scientific Reconfigurable / FPGA Computing

Wuxi Jiangnan Institute of Computing Technology

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

University of California, Berkeley

Fast and Accurate Deep Neural Networks Training on Distributed Systems

Georgia Institute of Technology

Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations

Modeling Single-Source Shortest Path Algorithm Dynamics to Control Performance and Power Tradeoffs

Steven R. Young

Oak Ridge National Laboratory

Introduction - Machine Learning in HPC Environments

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Andrew Youngdahl

DePaul University

Semantically Organized Containers for Reproducible Research

Sandia National Laboratories

The ARM HPC Experience: From Testbeds to Exascale

Containers in HPC

Meeting HPC Container Challenges as a Community

Purdue University

Introduction - HPC Systems Professionals Workshop (HPCSYSPROS18)

Workloads and Benchmarks for System Acquisition

Tsinghua University

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

University of Texas

Distributed-Memory Hierarchical Compression of Dense SPD Matrices

Tsinghua University

Student Cluster Competition Team Panel Presentation

Georgia Institute of Technology

A Study of OpenMP Device Offloading in LLVM: Correctness and Consistency

International Center for Advanced Internet Research (iCAIR)

Northwestern University

Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

University of St Andrews

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

Florida State University

Enabling Efficient Data Infrastructure and Analytics on HPC Systems

Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems

The Algorithm and Framework Designs and Optimizations for Scalable Automata Processing on HPC Platforms

Hong Kong University of Science and Technology

SP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition

DePaul University

Semantically Organized Containers for Reproducible Research

The BP Data Science Sandbox

Z

Frederick National Laboratory for Cancer Research

Portable and Reusable Deep Learning Infrastructure with Containers to Accelerate Cancer Studies

Intel Corporation

Compiler Optimization for Heterogeneous Locality and Homogeneous Parallelism in OpenCL and LLVM

Marcin Zalewski

Pacific Northwest National Laboratory

Chapel Aggregation Library (CAL)

Technical University of Ostrava, Czech Republic

Distributed Fast Boundary Element Methods

University of Edinburgh

Mitigating Performance and Progress Variability in Iterative Asynchronous Algorithms

Max Planck Institute for Informatics

faimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU

Matthew J. Zekauskas

University of California, San Diego

Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU

J. Jensen Zhang

Tongji University

Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences

Hong Kong University of Science and Technology

SP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition

Fermi National Accelerator Laboratory

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

State Key Laboratory of Mathematical Engineering and Advanced Computing

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Shandong University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Stony Brook University

Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling

Chinese Academy of Sciences

Reduction of Workflow Resource Consumption Using a Density-based Clustering Model

University of California, Berkeley

AI Matrix – Synthetic Benchmarks for DNN

Shandong University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Texas Tech University

Distributed Adaptive Radix Tree for Efficient Metadata Search on HPC Systems

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

AI Matrix – Synthetic Benchmarks for DNN

Lawrence Berkeley National Laboratory

Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows

WarpX: Toward Exascale Modeling of Plasma Particle Accelerators

Institute of Physics, Chinese Academy of Sciences

University of California, Irvine

Heterogeneous Programming and Optimization of Gyrokinetic Toroidal Code Using Directives

University of Science and Technology of China

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Tsinghua University

Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight

Northwest University, China

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Beihang University

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

Moffitt Cancer Center

Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches

Texas Advanced Computing Center

Introduction - Deep Learning on Supercomputers - Welcome

Enabling Scalable and Efficient Deep Learning on Supercomputers

Tools and Best Practices for Distributed Deep Learning with Supercomputers

Southern University of Science and Technology, China

Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight

Georgia Institute of Technology

Detecting MPI Usage Anomalies via Partial Program Symbolic Execution

University of California, Riverside

Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs

University of Utah

Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks

Tsinghua University

National Supercomputing Center, Wuxi

Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers

Washington State University

Students@SC: Making the Best of Your HPC Education

University of Chicago

Reproducibility as Side Effect

Iowa State University

Understanding SSD Reliability in Large-Scale Cloud Systems

Carnegie Mellon University

Scaling Embedded In Situ Indexing with DeltaFS

Tsinghua University

Understanding Potential Performance Issues Using Resource-Based alongside Time Models

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Numerical Algorithms Group

BP Center For High Performance Computing

Optimization of a Lattice Boltzmann Program

Montclair State University

Introduction - Innovating the Network for Data Intensive Science (INDIS)

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Engaging Students in Parallel and Distributed Computing Learning by Games Design Using Unity

Innovating the Network for Data Intensive Science (INDIS)

Purdue University

HPC System Architectures

Tsinghua University

Qatar Computing Research Institute

ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds

Florida State University

Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems

State University of New York at Buffalo

ColdFront: An Open Source HPC Resource Allocation System

Maxim A. Ziatdinov

Oak Ridge National Laboratory

167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation

Sean B. Ziegeler

US Department of Defense HPC Modernization Program

Engility Corporation

Synthetic Data Generation for Evaluating Parallel I/O Compression Performance and Scalability

Matthew T. Ziegler

Energy Efficient Computer in the Exascale Age

Taming Datacenter Thermodynamics with Lenovo Neptune Technology

Christopher Zimmer

Oak Ridge National Laboratory

GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan

The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems

Using Darshan and CODES to Evaluate Application I/O Performance

A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing

Malgorzata Zimon

GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne

Georg Zitzlsberer

IT4Innovations, Czech Republic

Technical University of Ostrava, Czech Republic

Job Simulation for Large-Scale PBS-Based Clusters with the Maui Scheduler

University of Pittsburgh

Partial Redundancy in HPC Systems with Non-Uniform Node Reliabilities

Michael Zuckerman

Mobileye, an Intel Company

Compiler Optimization for Heterogeneous Locality and Homogeneous Parallelism in OpenCL and LLVM

California State University, Dominguez Hills

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Energy Sciences Network (ESnet)

INDIS Invited Talk: Introduction to SCinet

Volunteer Opportunities for SC Conference Planning

Next Generation NVMe-Native Parallel File System Accelerating AI Workloads