Workshop Session

5th International Workshop on HPC User Support Tools: HUST-18

Introduction - 5th International Workshop on HPC User Support Tools: HUST-18

OOOPS: An Innovative Tool for IO Workload Management on Supercomputers

CView and NWPerf for Supercomputer Performance Collection and Display.

Workshop Morning Break

ReFrame: A Regression Testing and Continuous Integration Framework for HPC systems

A Compiler and Profiler Based Tool for Querying HPC Application Characteristics

ColdFront: An Open Source HPC Resource Allocation System

Concluding Remarks

Back to Workshop Archive Listing

ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC

Introduction - ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC

Keynote

Workshop Morning Break

Assessing Reproducibility: An Astrophysical Example of Computational Uncertainty in the HPC Context

Considering the Development Workflow to Achieve Reproducibility with Variation

Spotting Black Swans With Ease: The Case for a Practical Reproducibility Platform

Supporting Thorough Artifact Evaluation with Occam

Semantically Organized Containers for Reproducible Research

Open Panel: Automating Artifact Sharing, Evaluation, and Reuse

Back to Workshop Archive Listing

Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

Introduction - Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

Session1: WACCPD Keynote: Experiences in Using Directive-Based Programming for Accelerated Computing Architectures

Session 2: Porting Scientific Applications Using Directives

Heterogeneous Programming and Optimization of Gyrokinetic Toroidal Code Using Directives

Workshop Morning Break

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation

Session 3: Using OpenMP

OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries

A Case Study for Performance Portability Using OpenMP 4.5

Session 4: Using OpenACC

OpenACC Routine Directive Propagation Using Interprocedural Analysis

OpenACC-Based GPU Parallelization of Plane Sweep Algorithm for Geometric Intersection

5th Workshop on Accelerator Programming Using Directives (WACCPD): Closing Remarks

Back to Workshop Archive Listing

Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)

Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)

Preserving Privacy through Processing Encrypted Data

Bringing FPGAs to HPC Production Systems and Codes

Workshop Morning Break

SimBSP: Enabling RTL Simulation for Intel FPGA OpenCL Kernels

Scalable FPGA Deployments for HPC and DC Applications

First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture

Integrating Network-Attached FPGAs into the Cloud Using Partial Reconfiguration

Accelerating Intelligence

The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems

Stream Computing of Lattice-Boltzmann Method on Intel Programmable Accelerator Card

Back to Workshop Archive Listing

The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)

Introduction - The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)

Tranquility Amidst Turbulence: A Vision for Advancing Scientific Discovery in the Era of Extreme Heterogeneity

Workshop Morning Break

Hybrid Quantum-Classical Computing Architectures

Stochastic Computing on Quantum Gates

Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression

Community Detection Across Emerging Quantum Architectures

Comparing Deep Learning with Quantum Inference on The D-Wave 2X

Speaker Panel 1

Workshop Lunch (on your own)

Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing

A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing

Non-Neural Network Applications for Spiking Neuromorphic Hardware

Workshop Afternoon Break

Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation

Speaker Panel 2

Domain-Specific System on Chips (DSSoC)

Workshop Closing

Back to Workshop Archive Listing

Innovating the Network for Data Intensive Science (INDIS)

Introduction - Innovating the Network for Data Intensive Science (INDIS)

INDIS Invited Talk: Introduction to SCinet

INDIS Showcases Panel: NRE and XNET and Architecture

Workshop Morning Break

INDIS Morning Keynote

Flowzilla: A Methodology for Detecting Data Transfer Anomalies in Research Networks

Fast Detection of Elephant Flows with Dirichlet-Categorical Inference

Tracking Network Flows with P4

Workshop Lunch (on your own)

SDN for End-to-End Networked Science at the Exascale (SENSE)

Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners

INDIS Afternoon Keynote

Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers

Workshop Afternoon Break

Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer

BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer

Back to Workshop Archive Listing

Fourth Computational Approaches for Cancer Workshop (CAFCW18)

Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)

Morning Keynote – Computational Approaches in Clinical Applications

Workshop Morning Break

Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches

Safety, Reproducibility, Performance: Accelerating Cancer Drug Discovery with Cloud, ML, and HPC Technologies

The Gen3 Approach to Portability and Repeatability for Cancer Genomics Projects

Scalable Deep Ensemble Learning for Cancer Drug Discovery

Panel Discussion: Reproducibility and Accessibility - Challenges and Opportunities

Workshop Lunch (on your own)

Afternoon Keynote – Genomic Profiling of Normal, Premalignant, and Heterogeneous Tissues in Cancer Patients

Workshop Afternoon Break

HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports

Toward a Computational Simulation of Circulating Tumor Cell Transport in Vascular Geometries

Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud

Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning

Extending Frontiers for Computing in Cancer – Special Session

Back to Workshop Archive Listing

MCHPC’18: Workshop on Memory Centric High Performance Computing

Introduction - MCHPC’18: Workshop on Memory Centric High Performance Computing

MCHPC'18 Morning Keynote: Converging Storage and Memory

Workshop Morning Break

Challenges of High-Capacity DRAM Stacks and Potential Directions

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory

Understanding Application Recomputability without Crash Consistency in Non-Volatile Memory

A Preliminary Study of Compiler Transformations for Graph Applications on the Emu System

Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling

Workshop Lunch (on your own)

MCHPC'18 Afternoon Keynote: All Tomorrow’s Memory Systems

Workshop Afternoon Break

On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale

Exploring Allocation Policies in Disaggregated Non-Volatile Memories

Heterogeneous Memory and Arena-Based Heap Allocation

MCHPC'18 Panel: Research Challenges in Memory-Centric Computing

Back to Workshop Archive Listing

Workshop on Exascale MPI (ExaMPI)

Introduction - Workshop on Exascale MPI (ExaMPI)

ExaMPI Keynote

Workshop Morning Break

Tree-Based Fault-Tolerant Collective Operations for MPI

Understanding the Usage of MPI in Exascale Proxy Applications

Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies

Workshop Lunch (on your own)

AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication Libraries

A Novel Approach to Supporting Communicators for In-Switch Processing of MPI Collectives

Workshop Afternoon Break

ExaMPI Invited Talk

Anycast: Rootless Broadcasting with MPI

ExaMPI Panel

Back to Workshop Archive Listing

Workshop on Education for High Performance Computing (EduHPC)

Introduction - Workshop on Education for High Performance Computing (EduHPC)

Keynote Talk: Student Engagement: View from the Trenches

Workshop Morning Break

Teaching HPC Systems and Parallel Programming with Small Scale Clusters of Embedded SoCs

Jupyter Notebooks and User-Friendly HPC Access

SMPI Courseware: Teaching Distributed-Memory Computing with MPI in Simulation

Employing Student Retention Strategies for an Introductory GPU Programming Course

Paper Discussion

PDC Curriculum Update

Workshop Lunch (on your own)

Filling the Gap between Education and Industry: Evidence-Based Methods for Introducing Undergraduate Students to HPC

Evaluating the Wide Area Classroom after 10,500 HPC Students

A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java

Paper Discussion

Workshop Morning Break

Toward Developing a Repository of Logical Errors Observed in Parallel Code for Teaching Code Correctness

Paper Discussion

Peachy Introduction

Computing a Movie of Zooming into a Fractal

Optimization of an Image Processing Algorithm: Histogram Equalization

OpenMP: What’s Inside the Black Box?

The Wave Equation as a Motivating Example for High Performance Computing

Storms of High-Energy Particles: An assignment for OpenMP, MPI, and CUDA/OpenCL

Presentation of Updates from the Community

Experience Report: 4 Years of Teaching Cloud Computing and Big Data at the University Level

Building a Low Budget Cluster Through Hardware Reuse

Introducing Three Basic Concepts in Parallel Computation to 1st Year Computer Science Students in a Simple and Effective Way

Engaging Students in Parallel and Distributed Computing Learning by Games Design Using Unity

Lessons from Integrating Parallelism into Undergraduate Curriculum at UMKC

A Comprehensive Parallel Computing Curriculum: From Second Year to Professionals

Eight Years Analysis of Adopting PDC in Data Structures at UCA

Introducing PDC Concepts with Spatial Computing Curriculum

Discussions on Peachy Assignment and Community Update

Best Paper Announcement

Closing Remarks

Back to Workshop Archive Listing

WORKS 2018: 13th Workshop on Workflows in Support of Large-Scale

Introduction - WORKS 2018: 13th Workshop on Workflows in Support of Large-Scale Science

Keynote

Workshop Morning Break

Reduction of Workflow Resource Consumption Using a Density-based Clustering Model

Flux: Overcoming Scheduling Challenges for Exascale Workflows

LOS: Level Order Sampling for Task Graph Scheduling on Heterogeneous Resources

Energy-Aware Workflow Scheduling and Optimization in Clouds Using Bat Algorithm

A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows

End-to-End Online Performance Data Capture and Analysis for Scientific Workflows

Workshop Lunch (on your own)

Planner: Cost-efficient Execution Plans Placement for Uniform Stream Analytics on Edge and Cloud

Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture

Optimizing the Throughput of Storm-Based Stream Processing in Clouds

Workshop Afternoon Break

DagOn*: Executing Direct Acyclic Graphs as Parallel Jobs on Anything

WRENCH: A Framework for Simulating Workflow Management Systems

WORKS 2018 Panel

Back to Workshop Archive Listing

Women in HPC: Diversifying the HPC Community

Introduction - Women in HPC: Diversifying the HPC Community

A Black Woman’s Sojourn in High Performance Computing: Recovering Lost History

Workshop Morning Break

Hot Topics Discussion I: Thriving at Work

Hot Topics Discussion II: Thriving at Work

Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.

Workshop Lunch (on your own)

High Performance Computing in Dynamic Traffic Simulation

Best Practices from Organizations on Improving Workplace Diversity

Efficient Application of Low Mach Number Hydrodynamics Code to Stellar Flows

The Movement toward HPC Inclusivity: Achieving On-Demand Accessibility of High Performance Computing (HPC) through Ephemeral Projects Utilizing the Alces Gridware Project

Optimizing Python Data Processing for the DESI Experiment on the NERSC Cori Supercomputer

Study of Performance Variability on Dragonfly Systems

A Deferred Correction Coupling Strategy for Cosmological Simulations

Deep Learning: Extrapolation Tool for Computational Nuclear Physics

Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method

FLAME GPU: Complex System Simulation Framework

Error Analysis in HPC Applications Using Algorithmic Differentiation

Power Aware Heterogeneous Node Assembly

Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales

From Message Passing to PGAS

In Situ Performance Analysis of Event-driven Simulations to Support the Codesign of Extreme-Scale Systems

Characterization of the Impact of Soft Errors on Iterative Methods

Use Cases of Neuromorphic Co-Processors in Future HPC Environments

Optimization of a Lattice Boltzmann Program

Plasma Meets Portability: A Journey to Performance Portability in a Particle-in-Cell Code

Title: Distributed Memory Fast Fourier Transforms in the Exascale Era

Challenges of Performance Portability for Fortran Unstructured Mesh Codes

Large-Scale PDE-Constrained Optimization

Workshop Afternoon Break

Developing Workplace Resilience and Managing Stress

Back to Workshop Archive Listing

2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development

Introduction - 2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development

Welcome and Introduction

Panel 1: Role of Federated Polish HPC Centers in Polish AI Initiatives and EuroHPC Program

Panel 1: A Site-Local View of Creating a Pan-European Federated Research Infrastructure

Panel 1: Service-Oriented HPC and Data Infrastructures for Science in Germany

Panel 1: European Region Q&A / Discussion Moderated by Prof. Taisuke Boku

Panel 2: United States National Science Foundation (NSF) Office of Advanced Cyberinfrastructure Programs and Workforce Development

Panel 2: Arabia's Leap into the Cyber Era

Panel 2: HPC in India

Panel 2: Russian HPC Trends: a View From a Local Vendor Trench

Panel 2: US, Saudi Arabia, India and Russia Q&A / Discussion Moderated by Prof. Taisuke Boku

Workshop Lunch (on your own)

Keynote: The Post-K for General-Purpose, Energy-Efficient, and Sustained Application Performance

Panel 3: Challenge and Chance for Supercomputing Center in China

Panel 3: Japan's HPC Program for System Development and Deployment toward Exascale

Panel 3: HPC Status in Thailand

Panel 3: The Path from HPC to AI in Taiwan's NCHC

Panel 3: Asian Region Q&A / Discussion Moderated by Dr. David Kahaner

Workshop Afternoon Break

Introduction to Student Cluster Competitions

Panel 4: Asia Supercomputing Community: Profound Inspiration through Strong Competition

Panel 4: Student Spotlight Presentation

Student Cluster Competition Team Panel Presentation

Panel 4: Student Competition and Workforce Development Q&A / Discussion Moderated by Dan Olds

Networking / Independent Discussion

Back to Workshop Archive Listing

HPC Systems Professionals Workshop (HPCSYSPROS18)

Introduction - HPC Systems Professionals Workshop (HPCSYSPROS18)

HPCSYSPROS18: Keynote

Message from the SIGHPC SYSPROS Virtual Chapter President

Workshop Afternoon Break

Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD

Compliant Cloud+Campus Hybrid HPC Infrastructure

cgroups py : Using Linux Control Groups and Systemd to Manage CPU Time and Memory

Making Container Easier with HPC Container Maker

Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks

xCAT and Masterless Puppet: Aiming for Ideal Configuration Management

Stateless Provisioning: Modern Practice in HPC

Next-Generation Cluster Management Software

HPCSYSPROS18: Author Panel

Upcoming Events in the HPC Systems Professionals Community

Back to Workshop Archive Listing

Machine Learning in HPC Environments

Introduction - Machine Learning in HPC Environments

Deep Learning Evolutionary Optimization for Regression of Rotorcraft Vibrational Spectra

Workshop Morning Break

Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes

Auto-Tuning TensorFlow Threading Model for CPU Backend

Automated Parallel Data Processing Engine with Application to Large-Scale Feature Extraction

Training Speech Recognition Models on HPC Infrastructure

Back to Workshop Archive Listing

Fifth International Workshop on Visual Performance Analysis (VPA 18)

Introduction - Fifth International Workshop on Visual Performance Analysis (VPA 18)

VPA18 Keynote: Not Your Mama’s Angry Fruit Salad: Ruminations on 30 Years of Performance Visualization and Visual Performance Analysis

Workshop Afternoon Break

Visual Analytics Challenges in Analyzing Calling Context Trees

Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges

Visualizing Multidimensional Health Status of Data Centers

PaScal Viewer: A Tool for the Visualization of Parallel Scalability Trends

Back to Workshop Archive Listing

The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

Perspectives on Data Reduction from ASCR

Workshop Afternoon Break

Feature-Relevant Data Reduction for In Situ Workflows

A Statistical Analysis of Compressed Climate Model Data

Data Reduction Challenges in Coordinated Simulation and Experimental Fusion Science

Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation

Synthetic Data Generation for Evaluating Parallel I/O Compression Performance and Scalability

Amplitude-Aware Lossy Compression for Quantum Circuit Simulation

A Study on Checkpoints Compression for Adjoint Computation

Back to Workshop Archive Listing

Computational Reproducibility at Exascale 2018 (CRE2018)

Introduction - Computational Reproducibility at Exascale 2018 (CRE2018)

CRE2018 – Plenary I

Debugging and Optimization of HPC Programs in Mixed Precision with the Verrou Tool

High Performance Implementation of Reproducible BLAS Routines with Tunable Accuracy Using Ozaki Scheme

Workshop Afternoon Break

CRE218 – Plenary II

Reproducibility for Streaming Analysis

Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance

Panel Discussion

Back to Workshop Archive Listing

The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)

Introduction - The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)

Improving MPI Reduction Performance for Manycore Architectures with OpenMP and Data Compression

Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications

Workshop Morning Break

Deep Learning at Scale on Nvidia V100 Accelerators

Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications

Algorithm Selection of MPI Collectives Using Machine Learning Techniques

miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems

Unified Cross-Platform Profiling of Parallel C++ Applications

Workshop Lunch (on your own)

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity

Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa

Workshop Afternoon Break

Is Data Placement Optimization Still Relevant on Newer GPUs?

Approximating a Multi-Grid Solver

Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems

Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures

Back to Workshop Archive Listing

9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

Introduction

Keynote 1: A Few Scheduling Problems for Resilience at Scale

Workshop Morning Break

Keynote 2: HPC and AI as Drivers for Industrial Engagement

Event-Triggered Communication in Parallel Computing

Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers

Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations

Non-Collective Scalable Global Network Based on Local Communications

Workshop Lunch (on your own)

Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps

Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials

Machine Learning-Aided Numerical Linear Algebra: Convolutional Neural Networks for the Efficient Preconditioner Generation

Workshop Afternoon Break

Keynote 3: Hierarchical Algorithms on Hierarchical Architectures

Low Thread-Count Gustavson: A Multithreaded Algorithm for Sparse Matrix-Matrix Multiplication Using Perfect Hashing

A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations

Dynamic Load Balancing of Plasma and Flow Simulations

On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures

Back to Workshop Archive Listing

PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

Welcome and Introduction

Keynote Address

Workshop Morning Break

Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era

Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales

Evaluation of HPC Application I/O on Object Storage Systems

WIP Session 1

Workshop Lunch (on your own)

Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage

Understanding SSD Reliability in Large-Scale Cloud Systems

WIP Session 2

Workshop Afternoon Break

Characterizing Deep-Learning I/O Workloads in TensorFlow

Toward Understanding I/O Behavior in HPC Workflows

Methodology for the Rapid Development of Scalable HPC Data Services

WIP Session 3

Back to Workshop Archive Listing

The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

Welcome, Workshop Goals, and Opening Remarks

Contention-Aware Container Placement Strategy for Docker Swarm

Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

TACC's Cloud Deployer: Automating the Management of Distributed Software Systems

Workshop Morning Break

HPCViz: Monitoring Health Status of High Performance Computing Systems

Tivan: A Scalable Data Collection and Analytics Cluster

DiG: Enabling Out-of-Band Scalable High-Resolution Monitoring for Data-Center Analytics, Automation, and Control

Out-of-Band (BMC based) Data Center Monitoring DMTF Redﬁsh API Integration with Nagios

Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis

Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs

Workshop Lunch (on your own)

Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective

Simulating Data Centers with Redfish-Enabled Equipment

Workshop Afternoon Break

Invited Talk: Resource Control at Facebook

Invited Talk: The Campus Compute Cooperative Project as an Alternative to Commercial Clouds

Invited Talk: Current Status of the OpenHPC Project

Closing Remarks

Back to Workshop Archive Listing

IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

Introduction - IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

Photonic Interconnects for Extreme Scale Computing

Workshop Morning Break

Software Prefetching for Unstructured Mesh Applications

A Fast and Simple Approach to Merge and Merge Sorting Using Wide Vector Instructions

Impact of Traditional Sparse Optimizations on a Migratory Thread Architecture

There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline

Scale-Free Graph Processing on a NUMA Machine

Workshop Lunch (on your own)

Versal: The New Xilinx Adaptive Compute Acceleration Platforms (ACAP)

Workshop Afternoon Break

Mix-and-Match: A Model-Driven Runtime Optimization Strategy for BFS on GPUs

A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs

High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation

IA^3 Debate

Back to Workshop Archive Listing

ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

Introduction - ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

Exascale Challenges in Across-Node Parallelism for Languages and Runtimes

Workshop Morning Break

Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications

Design of Data Management for Multi-SPMD Workflow Programming Model

Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)

Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems

Workshop Lunch (on your own)

Asynchronous Execution of Python Code on Task Based Runtime Systems

A Unified Runtime for PGAS and Event-Driven Programming

Workshop Afternoon Break

Portable and Reusable Deep Learning Infrastructure with Containers to Accelerate Cancer Studies

Analysis of Explicit vs. Implicit Tasking in OpenMP Using Kripke

Heterogeneous Systems and the Road to Exascale for HPC and AI

ESPM2 2018: Closing Remarks

Back to Workshop Archive Listing

Machine Learning in HPC Environments

Introduction - Machine Learning in HPC Environments

Workshop Overview

Morning Keynote – Azalia Mirhoseini (Google)

Workshop Morning Break

Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training

Large-Scale Clustering Using MPI-Based Canopy

Automated Labeling of Electron Microscopy Images Using Deep Learning

Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration

Workshop Lunch (on your own)

On Adam-Trained Models and a Parallel Method to Improve the Generalization Performance

Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems

Workshop Afternoon Break

Workshop Overview

Afternoon Keynote - Robinson Pino (DOE ASCR)

Optimizing Machine Learning on Apache Spark in HPC Environments

Large Minibatch Training on Supercomputers with Improved Accuracy and Reduced Time to Train

Back to Workshop Archive Listing

ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

Introduction - ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

Keynote: Perspectives on In Situ

Workshop Morning Break

A Flexible System For In Situ Triggers

PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows

In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization

Python-Based In Situ Analysis and Visualization

Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows

Scheduling for In-machine Analytics: Data Size Is Important

Lightning Round Questions

Workshop Lunch (on your own)

In-Transit Molecular Dynamics Analysis with Apache Flink

libIS: A Lightweight Library for Flexible In Transit Visualization

Best Paper Awards

Workshop Afternoon Break

Invited Talk: Data Science Meets CFD

UnPanel on the State of the In Situ Community

ISAV 2018 Wrap Up

Back to Workshop Archive Listing

2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)

Introduction - 2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)

Making Formal Methods for HPC Disappear

Workshop Morning Break

Hybrid Theorem Proving as a Lightweight Method for Verifying Numerical Software

HPC Software Verification in Action: A Case Study with Tensor Transposition

Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems

Verifying Qthreads: Is Model Checking Viable for User Level Tasking Runtimes?

Incremental Static Race Detection in OpenMP Programs

Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free

Workshop Lunch (on your own)

Correctness of Floating Point Programs - Exception Handling and Reproducibility

Workshop Afternoon Break

Compiler-Aided Type Tracking for Correctness Checking of MPI Applications

Toward Deductive Verification of Message-Passing Parallel Programs

PARCOACH Extension for a Full-Interprocedural Collectives Verification

Facilitating the Adoption of Correctness Tools in HPC Applications

Back to Workshop Archive Listing

4th Workshop for Open Source Supercomputing (OpenSuCo)

Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)

Opening Remarks

Performance and Communication Modeling for Exascale Proxy Architecture in Aspen

FireSim: FPGA-Accelerated Cycle-Exact Scale-Out System Simulation in the Public Cloud

Workshop Morning Break

RV128 Instruction Set Architecture

How System-Level Design Can Benefit the Progress of Open-Source Hardware

Keynote: Full Stack Open Source Supercomputing

Panel: Open-Source Hardware

Workshop Lunch (on your own)

HPC PowerStack: a community-wide open collaboration for enabling system-wide power efficiency

Open-Source Supercomputing

GraphBLAS Forum and Its Relevant Software Zoo

Workshop Afternoon Break

Driving Asynchronous Distributed Tasks with Events

Opportunities for Open-Source Development for D-Wave Systems

Open-Source Modeling and Simulation

Panel: Open-Source Software

Closing Remarks

Back to Workshop Archive Listing

LLVM-HPC2018: The Fifth Workshop on the LLVM Compiler Infrastructure in HPC

Introduction - LLVM-HPC2018: The Fifth Workshop on the LLVM Compiler Infrastructure in HPC

Keynote: Glow: An Optimizing Compiler for High-Performance Machine Learning

Workshop Morning Break

OpenMP GPU Offload in Flang and LLVM

Pointers Inside Lambda Closure Objects in OpenMP Target Offload Regions

Clacc: Translating OpenACC to OpenMP in Clang

LLVM and the Automatic Vectorization of Loops Invoking Math Routines: -fsimdmath

Workshop Lunch (on your own)

Function/Kernel Vectorization via Loop Vectorizer

User-Directed Loop-Transformations in Clang

Workshop Afternoon Break

OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling

PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications

AIWC: OpenCL-Based Architecture Independent Workload Characterization

Compiler Optimization for Heterogeneous Locality and Homogeneous Parallelism in OpenCL and LLVM

A Study of OpenMP Device Offloading in LLVM: Correctness and Consistency

Challenges of C++ Heterogeneous Programming Using SYCL Implementation Experience: the Four Horsemen of the Apocalypse

LLVM-HPC2018: Final Discussion

Back to Workshop Archive Listing

8th Workshop on Python for High-Performance and Scientific Computing

Introduction - 8th Workshop on Python for High-Performance and Scientific Computing

Keynote: Better Scientific Software (BSSw)

Workshop Morning Break

AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests

Managing Python in HPC Environments

Panel: Interactivity in Supercomputing

Workshop Lunch (on your own)

Accelerating the Signal Alignment Process in Time-Evolving Geometries Using Python

Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta

Workshop Afternoon Break

Data-Parallel Python for High Energy Physics Analyses

Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

PyHPC Lightning Talks

Back to Workshop Archive Listing

Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Keynote

Workshop Morning Break

State of the Working Group

The Power Grid

Thermosyphon

Workshop Lunch (on your own)

Machine Installations at ORNL, LRZ, Sandia

Workshop Afternoon Break

Quantum Computing and Impact on Facility

Challenges in Holistic Monitoring and Data Integration (LLNL, LBNL, UniBo/Cineca)

Panel Discussion – Software for Energy Efficiency (PowerStack, PowerAPI, READEX, GEOPM)

Closing Remarks

Back to Workshop Archive Listing

Fifth SC Workshop on Best Practices for HPC Training and Education

Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education

Workshop Morning Break

HPC Education and Training: An Australian Perspective

Trends in Demand, Growth, and Breadth in Scientific Computing Training Delivered by a High-Performance Computing Center

Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level

The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training

Training Computational Scientists to Build and Package Code

CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity

Workshop Lunch (on your own)

Toward a HPC Certification Program

Potential Influence of Prior Experience in an Undergraduate-Graduate Level HPC Course

Deep Learning by Doing: Nvidia Deep Learning Institute

Using CloudLab as a Scalable Platform for Teaching Cluster Computing

Programmable Education Infrastructure: Cloud Resources as HPC Education Environments

The HPC Best Practices Webinar Series

Workshop Afternoon Break

Discussion Session

Breakout Groups

Back to Workshop Archive Listing

7th Workshop on Extreme-Scale Programming Tools (ESPT)

Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)

Understanding Software Sustainability: Learning from Parsl and Other Projects

Understanding the Scalability of Molecular Simulation Using Empirical Performance Modeling

Advanced Event Sampling Support for PAPI

Workshop Morning Break

PARLOT: Efficient Whole-Program Call Tracing for HPC Applications

Gotcha: A Function-Wrapping Interface for HPC Tools

HPC Software Infrastructures at German Aerospace Center

ESPT Closing Remarks

Back to Workshop Archive Listing

Deep Learning on Supercomputers

Introduction - Deep Learning on Supercomputers - Welcome

Keynote: Learning-Based Predictive Models: a New Approach to Integrating Large-Scale Simulations and Experiments

CANDLE Framework for Large Scale Deep Learning

Fast and Accurate Deep Neural Networks Training on Distributed Systems

Workshop Morning Break

Deep Learning at NERSC: Usability, Capability, and Everything in Between

Artificial Intelligence Enabled Multiscale Molecular Simulations

Scalable and Distributed DNN Training on Modern HPC Systems

High-Performance AI: A View from Systems and Frameworks

Large Scale Deep Learning in PFN: from 15-Min Imagenet to PFDet

Enabling Scalable and Efficient Deep Learning on Supercomputers

Back to Workshop Archive Listing

Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)

Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)

Toward Ad Hoc Recovery For Soft Errors

Fault Tolerant Cholesky Factorization on GPUs

Improving Application Resilience by Extending Error Correction with Contextual Information

A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform

Workshop Morning Break

Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer

Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods

CPU Overheating Characterization in HPC Systems: a Case Study

SaNSA - the Supercomputer and Node State Architecture

Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG Schemes

Back to Workshop Archive Listing

PAW-ATM: Parallel Applications Workshop - Alternatives to MPI

Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI

An Application Perspective on Programming Models for the Future

Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model

Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows

Comparison of the HPC and Big Data Java Libraries Spark, PCJ and APGAS

Computational Cosmology and Astrophysics on Adaptive Meshes Using Charm++

Workshop Morning Break

GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network

Chapel Aggregation Library (CAL)

Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations

Distributed L-Shaped Algorithms in Julia

Panel Discussion

Back to Workshop Archive Listing

International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability

An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability

Effective Performance Portability

Evaluating the Impact of Proposed OpenMP 5.0 Features on Performance, Portability, and Productivity

P3HPC Session 1 Panel Discussion

Workshop Morning Break

Performance Portability of an Unstructured Hydrodynamics Mini-Application

Performance Portability Challenges for Fortran Applications

Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks

Heterogeneous CPU-GPU Execution of Stencil Applications

P3HPC Session 2 Panel Discussion

P3HPC Community Discussion and Next Steps

Back to Workshop Archive Listing

5th International Workshop on HPC User Support Tools: HUST-18

ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC

Fifth Workshop on Accelerator Programming Using Directives (WACCPD)

Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)

The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)

Innovating the Network for Data Intensive Science (INDIS)

Fourth Computational Approaches for Cancer Workshop (CAFCW18)

MCHPC’18: Workshop on Memory Centric High Performance Computing

Workshop on Exascale MPI (ExaMPI)

Workshop on Education for High Performance Computing (EduHPC)

WORKS 2018: 13th Workshop on Workflows in Support of Large-Scale

Women in HPC: Diversifying the HPC Community

2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development

HPC Systems Professionals Workshop (HPCSYSPROS18)

Machine Learning in HPC Environments

Fifth International Workshop on Visual Performance Analysis (VPA 18)

The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)

Computational Reproducibility at Exascale 2018 (CRE2018)

The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)

9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems

PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems

The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)

IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms

ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware

Machine Learning in HPC Environments

ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization

2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)

4th Workshop for Open Source Supercomputing (OpenSuCo)

LLVM-HPC2018: The Fifth Workshop on the LLVM Compiler Infrastructure in HPC

8th Workshop on Python for High-Performance and Scientific Computing

Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)

Fifth SC Workshop on Best Practices for HPC Training and Education

7th Workshop on Extreme-Scale Programming Tools (ESPT)

Deep Learning on Supercomputers

Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)

PAW-ATM: Parallel Applications Workshop - Alternatives to MPI

International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

Computational Phenomics @Scale: From Supercomputers to Bedside