Menu Close Button
SC Schedule
First-Time Attendees
Students
Exhibitors
Media
Program
Exhibits
Experience
Submit
Program
Awards
Birds of a Feather
Early Career
HPC Inspires Plenary
Invited Talks
Keynote
Panels
Papers
Posters
Proceedings
Showcases
Emerging Technologies Showcase
Doctoral Showcase
HPC Impact Showcase
Scientific Visualization & Data Analytics Showcase
Tutorials
Workshops
Exhibits
Exhibit at SC
Exhibitor Forum
Exhibitor Listing
SCinet & Exhibitors
Sponsorship Opportunities
Experience
Register
SC Schedule
30 Years of SC
Attendee Deadlines
Convention Center
Convention Center Map
Dallas
Family Resources
Housing
Inclusivity
Receptions
Media
SC18 News
Navigating SC
SC Blog
SC Newsletter
SCinet
Students@SC
Submit
Submission Deadlines
Submission Themes
SC Reproducibility Initiative
Early Career Applications
Exhibitor Forum Submissions
Panels Submissions
Papers Submissions
Posters Submissions
Showcases Submissions
Tutorials Submissions
Workshops Submissions
Search
Search
SC Schedule
First-Time Attendees
Students
Exhibitors
Media
search-icon
Search
Search
logo
Program
November 11–16, 2018
Exhibits
November 12–15, 2018
KAY BAILEY HUTCHISON CONVENTION CENTER DALLAS
The International Conference for High Performance
Computing, Networking, Storage, and Analysis
Program
Awards
Birds of a Feather
Early Career
HPC Inspires Plenary
Invited Talks
Keynote
Panels
Papers
Posters
Proceedings
Showcases
Doctoral Showcase
Emerging Technologies Showcase
HPC Impact Showcase
Scientific Visualization & Data Analytics Showcase
Tutorials
Workshops
Exhibits
Exhibit at SC
Exhibitor Housing
Exhibitor Manual
Exhibits Floorplan
Online Application
Startup Pavilion
Exhibitor Announcements
Exhibitor Forum
Exhibitor Listing
SCinet & Exhibitors
Sponsorship Opportunities
Experience
Register
Registration FAQ
SC Schedule
30 Years of SC
Attendee Deadlines
Convention Center
Map
Dallas
Family Resources
Housing
Hotel Shuttle Schedule
Inclusivity
Demographics
Navigating SC
Receptions
Media
Logo Usage
Media Partners
Media Registration
Photo/Video Policy
SC Blog
SC Newsletter
SC18 News
SCinet
Network Policy
Acceptable Network Use Policy
On-site Connection Security
Network Requests
Network Research Exhibition
SCinet Diversity Program: WINS
SCinet Contributors
SCinet Teams
SCinet Then & Now
Volunteer for SCinet
Students@SC
Experiencing HPC for Undergraduates
Mentor–Protégé Program
Student Cluster Competition
Student/Post-Doc Job Fair
Student Volunteers
Student Volunteers FAQ
Submit
Submission Deadlines
Submission Themes
SC Reproducibility Initiative
Author FAQ
Reviewer FAQ
Awards Nominations
Birds of a Feather Submissions
Birds of a Feather FAQ
Early Career Applications
Exhibitor Forum Submissions
Panels Submissions
Panels FAQ
Papers Submissions
Papers FAQ
Double-Blind Review Policy
Posters Submissions
Posters FAQ
Showcases Submissions
Tutorials Submissions
Tutorials FAQ
Workshops Submissions
Workshops FAQ
register-pencil
Register
Menu Toggle Button
Home
Presenter Index
Presenter Index
Full
Program
·
Presenters
·
Organizations
·
Search
Program
·
Flagged
·
Happening
Now
·
Maps
·
Notifications
More…
Search Program
Flagged
Happening Now
Maps
Notifications
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
A
Tanuj K. Aasawat
University of British Columbia
Scale-Free Graph Processing on a NUMA Machine
Omar Aaziz
Sandia National Laboratories
Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications
Ahmad Abdelfattah
University of Tennessee
MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines
Rached Abdelkhalak
King Abdullah University of Science and Technology
Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation
Ghaleb Abdulla
Lawrence Livermore National Laboratory
Data Analytics for System and Facility Energy Management
Francois Abel
IBM Zurich Research Laboratory
Integrating Network-Attached FPGAs into the Cloud Using Partial Reconfiguration
Eroma Abeysinghe
Indiana University
SciGaP: Apache Airavata Hosted Science Gateways
Greg Abram
University of Texas
Texas Advanced Computing Center
The First Water in the Universe
David Abramson
University of Queensland
Energy Efficiency Modeling of Parallel Applications
Jean-Thomas Acquaviva
DataDirect Networks
International HPC Certification Program
Toward a HPC Certification Program
Bilge Acun
IBM Research
Power Aware Heterogeneous Node Assembly
Adedoyin Adetokunbo
Los Alamos National Laboratory
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Md Musabbir Adnan
University of Tennessee
Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation
Sarita Adve
University of Illinois
Kennedy Award Presentation - Memory Consistency Models: They Are Broken and Why We Should Care
Patrick Aerts
Netherlands eScience Center
Data Archiving and Networked Services (DANS)
Sustaining Research Software
Deborah Agarwal
Lawrence Berkeley National Laboratory
Dac-Man: Data Change Management for Scientific Datasets on HPC Systems
Neil Agarwal
University of California, Berkeley
SaNSA - the Supercomputer and Node State Architecture
Ankit Agrawal
Northwestern University
Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era
Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies
Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training
Rashmi Agrawal
Boston University
Panel: Open-Source Hardware
RV128 Instruction Set Architecture
Mulya Agung
Tohoku University
A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems
Muhammed Abdullah Al Ahad
KTH Royal Institute of Technology
Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows
Hadia Ahmed
Lawrence Berkeley National Laboratory
From Message Passing to PGAS
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
Dong Ahn
Lawrence Livermore National Laboratory
Panel Discussion
Flux: Overcoming Scheduling Challenges for Exascale Workflows
James Ahrens
Los Alamos National Laboratory
In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization
Alex Aiken
Stanford University
Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes
Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems
James B. Aimone
Sandia National Laboratories
Non-Neural Network Applications for Spiking Neuromorphic Hardware
Mark Ainsworth
Brown University
Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)
Jonathan Ajo-Franklin
Lawrence Berkeley National Laboratory
Automated Parallel Data Processing Engine with Application to Large-Scale Feature Extraction
Kadir Akbudak
King Abdullah University of Science and Technology
Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation
Dana Akhmetova
KTH Royal Institute of Technology
Optimizing Next Generation Hydrodynamics Code for Exascale Systems
Linda Akli
Southeastern Universities Research Association (SURA)
Hot Topics Discussion I: Thriving at Work
Reda Al-Bahrani
Northwestern University
Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era
Ahmed Al-Jarro
Fujitsu Laboratories of Europe Ltd.
DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping
Taha Al-Jody
University of Huddersfield
Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks
Sadaf R. Alam
Swiss National Supercomputing Centre
RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management
Convergence between HPC and Big Data: The Day After Tomorrow
HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation
“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities
Interactivity in HPC
Christie L. Alappat
University of Erlangen-Nuremberg
Erlangen Regional Computing Center
Applying the Execution-Cache-Memory Model: Current State of Practice
Recursive Algebraic Coloring Engine
Mohammed Alawad
Oak Ridge National Laboratory
HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports
Johannes Albert-von der Gönna
Leibniz Supercomputing Centre
Spack Community BoF
Nia Alexandrova
Hartree Centre
Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education
Fifth SC Workshop on Best Practices for HPC Training and Education
Vassil Alexandrov
Barcelona Supercomputing Center
Introduction
Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures
9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
Yuri Alexeev
Argonne National Laboratory
Amplitude-Aware Lossy Compression for Quantum Circuit Simulation
MPI/OpenMP parallelization of the Fragment Molecular Orbitals Method in GAMESS
Full State Quantum Circuit Simulation by Using Lossy Data Compression
Evaluation of Intel Memory Drive Technology Performance for Scientific Applications
Community Detection Across Emerging Quantum Architectures
Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression
Hybrid Quantum-Classical Computing Architectures
Ghazanfar Ali
Texas Tech University
Out-of-Band (BMC based) Data Center Monitoring DMTF Redfish API Integration with Nagios
Yussuf Ali
Japan Atomic Energy Agency
Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers
Momme Allalen
Leibniz Supercomputing Centre
Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?
Randy Allen
Mentor, a Siemens Business
Session 4: Using OpenACC
Brian Allison
OpenCAPI Consortium
OpenCAPI: High Performance, Host-Agnostic, Coherent Accelerator Architecture and Ecosystem
Cyril Allouche
Atos
Bull
Seeking Quantum Supremacy with Numerical Simulation
Ann S. Almgren
Lawrence Berkeley National Laboratory
Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Mitchell Aloserij
University of Amsterdam
Tracking Network Flows with P4
Ilkay Altintas
San Diego Supercomputer Center
Keynote
Data Science and HPC Education and Outreach
Alper Altuntas
National Center for Atmospheric Research
Hybrid Theorem Proving as a Lightweight Method for Verifying Numerical Software
Tariq Alturkestani
King Abdullah University of Science and Technology
Toward Smoothing Data Movement Between RAM and Storage
Srinivas Aluru
Georgia Institute of Technology
School of Computational Science and Engineering
Optimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting
Parallel and Scalable Combinatorial String and Graph Algorithms on Distributed Memory Systems
Lluc Alvarez Marti
Barcelona Supercomputing Center
Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs
Teaching HPC Systems and Parallel Programming with Small Scale Clusters of Embedded SoCs
OpenMP: What’s Inside the Black Box?
Jose N. Amaral
University of Alberta
OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries
Ramon Amela
Barcelona Supercomputing Center
AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests
Abdelhalim Amer
Argonne National Laboratory
Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading
MPICH: A High Performance Open-Source MPI Implementation
Parsa Amini
Louisiana State University
Asynchronous Execution of Python Code on Task Based Runtime Systems
Christopher Amos
Baylor College of Medicine
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Muhammad Alfian Amrizal
Tohoku University
A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems
Jefferson Amstutz
Intel Corporation
libIS: A Lightweight Library for Flexible In Transit Visualization
George Amvrosiadis
Carnegie Mellon University
Scaling Embedded In Situ Indexing with DeltaFS
Rachana Ananthakrishnan
University of Chicago
Globus
National Research Infrastructure: Collaborative Session
Bill Anderson
National Center for Atmospheric Research
HPCSYSPROS18: Keynote
Jason Anderson
University of Chicago
Reproducibility as Side Effect
Kristin Anderson
Colder Products Company
The Case for Thermoplastic Quick Disconnects in Liquid Cooling
Michael Anderson
Intel Corporation
Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels
Paul Anderson
Puppet
Puppet in HPC: Building on 10 Years of Practice
Georgios Andreadis
Delft University of Technology
Vrije University Amsterdam
A Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments
Matthew Andrew
Carl Zeiss X-ray Microscopy Inc
A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks
Samuel Freitas Antao
IBM
GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne
OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling
Kristen Anton
Dartmouth Medical School
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Gabriel Antoniu
French Institute for Research in Computer Science and Automation (INRIA)
Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage
Planner: Cost-efficient Execution Plans Placement for Uniform Stream Analytics on Edge and Cloud
Katie Antypas
Lawrence Berkeley National Laboratory
Convergence between HPC and Big Data: The Day After Tomorrow
Ali Anwar
IBM
BESPOKV: Application Tailored Scale-Out Key-Value Stores
Hartwig Anzt
Karlsruhe Institute of Technology
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation
Toshikazu Aoyama
NEC Corporation
Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
David Appelhans
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Cecilia Aragon
University of Washington
The Human Side of Data Science
Manuel Arenaz
Appentra Solutions
Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training
Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.
Yuki Arikawa
Japan Telegraph and Telephone Corporation
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning
Allison Armstrong
Igneous Systems Inc
Data Protection Solutions for ML/AI
Bill Arndt
Lawrence Berkeley National Laboratory
Extreme Scale De Novo Metagenome Assembly
James Arnemann
University of California, Berkeley
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Dorian C. Arnold
Emory University
Invited Talk Session 6
Ritu Arora
Texas Advanced Computing Center
University of Texas
Toward Developing a Repository of Logical Errors Observed in Parallel Code for Teaching Code Correctness
Richard B. Arthur
General Electric Company
Computationally-Accelerated Engineering at GE: Physics + Deep Learning
Hirochika Asai
Preferred Networks
Large Scale Deep Learning in PFN: from 15-Min Imagenet to PFDet
Samar Aseeri
King Abdullah University of Science and Technology
Title: Distributed Memory Fast Fourier Transforms in the Exascale Era
Rizwan A. Ashraf
Oak Ridge National Laboratory
Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer
Mike Ashworth
University of Manchester
First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture
Semih Aslan
Texas State University
Hardware Acceleration of CNNs with Coherent FPGAs
Scott Atchley
Oak Ridge National Laboratory
GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Using Darshan and CODES to Evaluate Application I/O Performance
Emre Ates
Boston University
Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance
Andrew Attwood
University of Manchester
First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture
Simone Atzeni
Nvidia Corporation
OpenMP GPU Offload in Flang and LLVM
Danny Auble
SchedMD LLC
SLURM User Group Meeting
Guillaume Aupy
French Institute for Research in Computer Science and Automation (INRIA)
University of Bordeaux
Scheduling for In-machine Analytics: Data Size Is Important
Brian Austin
Lawrence Berkeley National Laboratory
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
Jeff Autor
Hewlett Packard Enterprise
Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective
Power API and Redfish: Standardizing Power Measurement and Control for HPC
Sasikanth Avancha
Intel Corporation
Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures
Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels
Amro Awad
University of Central Florida
Exploring Allocation Policies in Disaggregated Non-Volatile Memories
Eduard Ayguade
Barcelona Supercomputing Center
Polytechnic University of Catalonia
Compiler and Runtime Based Parallelization and Optimization for GPUs
Teaching HPC Systems and Parallel Programming with Small Scale Clusters of Embedded SoCs
OpenMP: What’s Inside the Black Box?
Return to Top
B
John D. Bachan
Lawrence Berkeley National Laboratory
Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
Scott B. Baden
Lawrence Berkeley National Laboratory
From Message Passing to PGAS
Doomsday: Predicting Which Node Will Fail When on Supercomputers
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
David Bader
Georgia Institute of Technology
Convergence between HPC and Big Data: The Day After Tomorrow
17th Graph500 List
Michael Bader
Technical University Munich
Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG Schemes
Rosa Badia
Barcelona Supercomputing Center
Big Data and Exascale Computing (BDEC2) Application Roundtable
AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests
Non-Volatile Memory
Frank Baetke
European Open File System Association (EOFS)
LUSTRE Community BOF: Lustre in HPC and Emerging Data Markets: Roadmap, Features and Challenges
Amir Bahmani
Stanford University
Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud
Yu Bai
Beihang University
Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach
Anna Maria Bailey
Lawrence Livermore National Laboratory
Energy Efficient HPC Working Group
Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)
The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification
Energy Efficiency Considerations for HPC Procurements
High Performance Computing (HPC) Data Center Planning and TCO: A Case Study and Roadmap
Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)
Josh Bailey
Google LLC
INDIS Morning Keynote
Stephen J. Bailey
Lawrence Berkeley National Laboratory
Optimizing Python Data Processing for the DESI Experiment on the NERSC Cori Supercomputer
Reese Baird
Intel Corporation
OpenHPC Community BoF
Allison Baker
National Center for Atmospheric Research
Students@SC: Careers in Industry, Research Labs, and Academia
Panel Discussion
A Statistical Analysis of Compressed Climate Model Data
Biology Applications
Zachary K. Baker
Los Alamos National Laboratory
Accelerating the Signal Alignment Process in Time-Evolving Geometries Using Python
Jason Bakos
University of South Carolina
Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)
Pavan Balaji
Argonne National Laboratory
Characterization of MPI Usage on a Production Supercomputer
Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading
Runtime for Exascale and Beyond: Convergence or Divergence?
Navigating the SC Conference Technical Program Submission Process
MPICH: A High Performance Open-Source MPI Implementation
Advanced MPI Programming
Prasanna Balaprakash
Argonne National Laboratory
Argonne National Laboratory
Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications
Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows
Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training
Justas Balcas
California Institute of Technology
Division of Physics, Mathematics and Astronomy
SDN for End-to-End Networked Science at the Exascale (SENSE)
Ilya Baldin
Renaissance Computing Institute (RenCI)
Introduction - Innovating the Network for Data Intensive Science (INDIS)
Innovating the Network for Data Intensive Science (INDIS)
Marc Gamell Balmana
Intel Corporation
Framework for Scalable Intra-Node Collective Operations Using Shared Memory
Alex Balmer
Illinois Institute of Technology
Student Cluster Competition Team Panel Presentation
Gábor Dániel Balogh
Pázmány Péter Catholic University, Hungary
OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling
Fabio Francisco Banchelli
Barcelona Supercomputing Center
Filling the Gap between Education and Industry: Evidence-Based Methods for Introducing Undergraduate Students to HPC
OpenMP: What’s Inside the Black Box?
Kunal Banerjee
Intel Corporation
Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures
Purushotham V. Bangalore
University of Alabama, Birmingham
Introduction - Workshop on Exascale MPI (ExaMPI)
Workshop on Exascale MPI (ExaMPI)
Liang Bao
XiDian University
Optimizing the Throughput of Storm-Based Stream Processing in Clouds
Ingrid Barcena Roig
KU Leuven, Belgium
The Business of HPC: TCO, Funding Models, Metrics, Value, and More
Procurement and Commissioning of HPC Systems
Deborah Bard
National Energy Research Scientific Computing Center (NERSC)
Lawrence Berkeley National Laboratory
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Deep Learning at Scale
Jaydeep Bardhan
GlaxoSmithKline
Career Development Panel
Md Abdullah Shahneous Shahneous Bari
Stony Brook University
Is Data Placement Optimization Still Relevant on Newer GPUs?
Ashley Barker
Oak Ridge National Laboratory
Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training
The HPC Best Practices Webinar Series
Large Scale System Deployments
Kevin Barker
Pacific Northwest National Laboratory
Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations
Thomas Barr
Nationwide Children's Hospital
Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)
Fourth Computational Approaches for Cancer Workshop (CAFCW18)
William Barth
University of Texas
Texas Advanced Computing Center
The New NSF-Funded Resource: Frontera - Towards a Leadership Class Computing Facility
Denis Barthou
French Institute for Research in Computer Science and Automation (INRIA)
PARCOACH Extension for a Full-Interprocedural Collectives Verification
Andrea Bartolini
University of Bologna
DiG: Enabling Out-of-Band Scalable High-Resolution Monitoring for Data-Center Analytics, Automation, and Control
Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis
Data Analytics for System and Facility Energy Management
Elisabeth Baseman
Los Alamos National Laboratory
Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo
Achim Basermann
German Aerospace Center
HPC Software Infrastructures at German Aerospace Center
HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making
Muthu Baskaran
Reservoir Labs Inc
Analysis of Explicit vs. Implicit Tasking in OpenMP Using Kripke
Mary Bass
University of Chicago
Globus
The Power of Storytelling: Exposing User Experiences and Lessons Learned to Inspire and Instruct Technology Adoption
Ned Bass
Lawrence Livermore National Laboratory
Flux: Overcoming Scheduling Challenges for Exascale Workflows
Protonu Basu
Lawrence Berkeley National Laboratory
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Natalie Bates
Energy Efficient HPC Working Group
Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)
The Green 500: Trends in Energy Efficient Supercomputing
Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis
Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)
Torey Battelle
Colorado School of Mines
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Gregory H. Bauer
University of Illinois
National Center for Supercomputing Applications
Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience
Michael Bauer
Nvidia Corporation
Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes
John Baugh
North Carolina State University
Hybrid Theorem Proving as a Lightweight Method for Verifying Numerical Software
Leonardo Bautista-Gomez
Barcelona Supercomputing Center
Toward Ad Hoc Recovery For Soft Errors
Approximating a Multi-Grid Solver
On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale
M. Bayatpour
Ohio State University
Cooperative Rendezvous Protocols for Improved Performance and Overlap
Alexandre Bayen
University of California, Berkeley
High Performance Computing in Dynamic Traffic Simulation
Tony Baylis
Lawrence Livermore National Laboratory
Students@SC: Making the Best of Your HPC Education
Neelima Bayyapu
Argonne National Laboratory
MPICH: A High Performance Open-Source MPI Implementation
Julia Bazińska
University of Warsaw
Panel 4: Student Spotlight Presentation
Jonathan C. Beard
ARM Ltd
MCHPC'18 Panel: Research Challenges in Memory-Centric Computing
Paul A. Beata
North Carolina State University
Floating-Point Autotuner for CPU-Based Mixed-Precision Applications
Michela Becchi
North Carolina State University
Efficient Deployment of Irregular Computations on Multi- and Many-Core Architectures
A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms
Compiling SIMT Programs on Multi- and Many-Core Processors with Wide Vector Units: A Case Study with CUDA
Floating-Point Autotuner for CPU-Based Mixed-Precision Applications
Gregory B. Becker
Lawrence Livermore National Laboratory
Managing HPC Software Complexity with Spack
Pete Beckman
Argonne National Laboratory
Artificial Intelligence at the Edge: How the Internet of Things and HPC Connect in the Computing Continuum
Marcos Bedo
Fluminense Federal University, Fluminense Federal University, Brazil
A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows
Izaak B. Beekman
ParaTools Inc
ParaTools Inc
Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter
Oceane Bel
University of California, Santa Cruz
Geomancy: Automated Data Placement Optimization
Kais Belgaied
Viking Enterprise Solutions
Cassandra in Dockers Deployment Using an NVMe Fabric
Matt Belhorn
Oak Ridge National Laboratory
Spack Community BoF
John Bell
Lawrence Berkeley National Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Emily Belli
General Atomics
Kernel-Based and Total Performance Analysis of CGYRO on 4 Leadership Systems
Vicenç Beltran
Barcelona Supercomputing Center
On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale
Mehmet E. Belviranli
Oak Ridge National Laboratory
DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access
Programming the EMU Architecture: Algorithm Design Considerations for Migratory-Threads-Based Systems
Tal Ben-Nun
ETH Zurich
Deep500: An HPC Deep Learning Benchmark and Competition
Deep Learning
Luca Benini
ETH Zurich
DiG: Enabling Out-of-Band Scalable High-Resolution Monitoring for Data-Center Analytics, Automation, and Control
Tom Benson
Lawrence Livermore National Laboratory
Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems
Graham Bent
IBM Research, UK
Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture
John Bent
DataDirect Networks
The IO-500 and the Virtual Institute of I/O
Brad Benton
Advanced Micro Devices Inc
Unified Communication X (UCX) Community
Florian Berberich
Partnership for Advanced Computing in Europe (PRACE)
Big Data Challenge - How to Engage with Large Scale Facilities?
Alexandre Bergel
University of Chile
Visual Analytics Challenges in Analyzing Calling Context Trees
Marco Berghoff
Karlsruhe Institute of Technology
The NAStJA Framework: Non-Collective Scalable Global Communications
Non-Collective Scalable Global Network Based on Local Communications
Evan Berkowitz
Forschungszentrum Juelich
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Tess Bernard
University of Texas
Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method
David E. Bernholdt
Oak Ridge National Laboratory
Improving the I/O Performance and Memory Usage of the Xolotl Cluster Dynamics Simulator
Software Engineering and Reuse in Computational Science and Engineering
The HPC Best Practices Webinar Series
Better Scientific Software
Anne Berres
Oak Ridge National Laboratory
Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes
Carlo Bertolli
IBM
OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling
Adam Bertsch
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
E. Wes Bethel
Lawrence Berkeley National Laboratory
Python-Based In Situ Analysis and Visualization
SENSEI Cross-Platform View of In Situ Analytics
Blair Bethwaite
New Zealand eScience Infrastructure
Cloud Infrastructure Solutions To Run HPC Workloads
Abhinav Bhatele
Lawrence Livermore National Laboratory
Visual Analytics Challenges in Analyzing Calling Context Trees
Evaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters
Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing
Panel Discussion
Introduction - Fifth International Workshop on Visual Performance Analysis (VPA 18)
Students@SC: Careers in Industry, Research Labs, and Academia
Wahid Bhimji
Lawrence Berkeley National Laboratory
Interactive HPC Deep Learning with Jupyter Notebooks
Sanjukta Bhowmick
University of Nebraska, Omaha
IA^3 Debate
Doctoral Showcase III
Doctoral Showcase I
Mauro Bianco
Swiss National Supercomputing Centre
RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management
Martin Biel
KTH Royal Institute of Technology
Distributed L-Shaped Algorithms in Julia
Arash Bigdeli
University of Texas
Institute for Computational Engineering and Sciences
Arctic Ocean-Sea Ice Interactions
Lars Bildsten
University of California, Santa Barbara
Visualizing Outbursts of Massive Stars
Simon J. L. Billinge
Columbia University
Reproducibility for Streaming Analysis
Jay Jay Billings
Oak Ridge National Laboratory
Software Engineers: Careers in Research
Robert Bird
Los Alamos National Laboratory
Effective Performance Portability
George Biros
University of Texas
Distributed-Memory Hierarchical Compression of Dense SPD Matrices
Approximating for Faster, Better and Cheaper Scientific Computing
GPU-Accelerated Interpolation for 3D Image Registration
Prentice Bisbal
Princeton Plasma Physics Laboratory
Training Computational Scientists to Build and Package Code
Nicholas Bisek
US Air Force Research Laboratory
Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter
Ayan Biswas
Los Alamos National Laboratory
A Flexible System For In Situ Triggers
In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization
Laura Biven
US Department of Energy Office of Advanced Scientific Computing Research
Keynote: Perspectives on In Situ
Perspectives on Data Reduction from ASCR
John Blaas
University of Colorado
Stateless Provisioning: Modern Practice in HPC
Datacenter and Cooling Technologies
Robert Blackmore
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Omer Blaes
University of California, Santa Barbara
Visualizing Outbursts of Massive Stars
Sean Blanchard
Los Alamos National Laboratory
SaNSA - the Supercomputer and Node State Architecture
Improving Application Resilience by Extending Error Correction with Contextual Information
Arthur S. Bland
Oak Ridge National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
Alexander Blass
University of Twente
Volume Renderings of Sheared Thermal Convection
Sophie Blondel
University of Tennessee
Improving the I/O Performance and Memory Usage of the Xolotl Cluster Dynamics Simulator
Michaela Blott
Xilinx Inc
Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)
Danny Bluestein
Stony Brook University
Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling
Ansel Blumers
Brown University
Idaho National Laboratory
A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks
Matthias A. Blumrich
Nvidia Corporation
Exploiting Idle Resources in a High-Radix Switch for Supplemental Storage
Brett Bode
University of Illinois
National Center for Supercomputing Applications
Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience
David Boehme
Lawrence Livermore National Laboratory
Visual Analytics Challenges in Analyzing Calling Context Trees
Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)
Students@SC: HPC Research
Eric Bohm
Charmworks Inc
University of Illinois
Charm++ and AMPI: Adaptive and Asynchronous Parallel Programming
Robert Bohn
National Institute of Standards and Technology
Federated Cloud: An Evolutionary Path from Grid Computing
Vangel Bojaxhi
Inspur
Asia Supercomputer Community (ASC)
Panel 4: Asia Supercomputing Community: Profound Inspiration through Strong Competition
Taisuke Boku
University of Tsukuba
Welcome and Introduction
Panel 3: Japan's HPC Program for System Development and Deployment toward Exascale
Benchmarking Scientific Reconfigurable / FPGA Computing
2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development
Matthias Bollhöfer
Braunschweig University of Technology
Distributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures
Dan Bonachea
Lawrence Berkeley National Laboratory
GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
Jason Booth
Northeastern University
Student Cluster Competition Team Panel Presentation
R. Christopher Bording
IBM
Introduction - 5th International Workshop on HPC User Support Tools: HUST-18
5th International Workshop on HPC User Support Tools: HUST-18
James Bordner
San Diego Supercomputer Center
University of California, San Diego
Computational Cosmology and Astrophysics on Adaptive Meshes Using Charm++
Andrea Borghesi
University of Bologna
Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis
George Bosilca
University of Tennessee
Open MPI State of the Union 2018
Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice
Nader Boushehrinejadmoradi
Rutgers University
A Parallelism Profiler with What-If Analyses for OpenMP Programs
Aurelien Bouteiller
University of Tennessee
Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice
Kurtis Bowman
Gen-Z Consortium
The Data-Centric Future and Gen-Z's Next Generation Interconnect
Pete Bradley
United Technologies Corporation - Pratt & Whitney Division
High Performance Computing in the Cloud at United Technologies
Briana Bradshaw
University of Texas
Texas Advanced Computing Center
Arctic Ocean-Sea Ice Interactions
Jim Brandt
Sandia National Laboratories
Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights
Steven Brandt
Louisiana State University
Asynchronous Execution of Python Code on Task Based Runtime Systems
Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins
David Brayford
Leibniz Supercomputing Centre
OpenHPC Community BoF
Marisa Brazil
Purdue University
Upcoming Events in the HPC Systems Professionals Community
Maximilian H. Bremer
University of Texas at Austin
Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations
Peer-Timo Bremer
Lawrence Livermore National Laboratory
A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing
Paul Brenner
University of Notre Dame
Compliant Cloud+Campus Hybrid HPC Infrastructure
Thomas Brettin
Argonne National Laboratory
Performance, Power, and Scalability Analysis of the Horovod Implementation of the CANDLE NT3 Benchmark on the Cray XC40 Theta
Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration
CANDLE Framework for Large Scale Deep Learning
Alexander Breuer
University of California, San Diego
Tensor-Optimized Hardware Accelerates Fused Discontinuous Galerkin Simulations
Wesley Brewer
US Department of Defense HPC Modernization Program
Deep Learning Evolutionary Optimization for Regression of Rotorcraft Vibrational Spectra
Neil Bright
Georgia Institute of Technology
Upcoming Events in the HPC Systems Professionals Community
Workloads and Benchmarks for System Acquisition
Ron Brightwell
Sandia National Laboratories
ExaMPI Invited Talk
Introduction - MCHPC’18: Workshop on Memory Centric High Performance Computing
MCHPC’18: Workshop on Memory Centric High Performance Computing
Resilience
Gabriel Broner
Rescale
Extending On-Premise HPC to the Cloud
Laura Brown
US Army Engineer Research and Development Center
Workloads and Benchmarks for System Acquisition
Maxine Brown
University of Illinois, Chicago
SAGE2 10th Annual International SC BOF: Scalable Amplified Group Environment for Global Collaboration
Nick Brown
University of Edinburgh
Panel: Open-Source Software
Driving Asynchronous Distributed Tasks with Events
HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making
Strategies for Inclusive and Scalable HPC Outreach and Education
P. Nigel Brown
Laney College
Student Cluster Competition Team Panel Presentation
Marcus Brumfield
Mississippi State University
Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC
Michael Bruner
Texas A&M University
CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity
Hugo Brunie
Atomic Energy and Alternative Energies Commission (CEA)
PARCOACH Extension for a Full-Interprocedural Collectives Verification
Dana Brunson
Oklahoma State University
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Bill Bryce
Univa Corporation
Enabling HPC and Deep Learning Workloads at Extreme Scale in the Cloud
Erik Brynjolfsson
Massachusetts Institute of Technology
Keynote: Explore How to Deploy the Unruly Power of Machine, Platform, and Crowd
Tomáš Brzobohatý
Technical University of Ostrava, Czech Republic
Workflow for Parallel Processing of Sequential Mesh Databases
Norm Buchanan
Colorado State University
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Chandu Budati
Middle Tennessee State University
Energy-Aware Workflow Scheduling and Optimization in Clouds Using Bat Algorithm
Robert Budden
Pittsburgh Supercomputing Center
Cloud Infrastructure Solutions To Run HPC Workloads
Reuben Budiardja
Oak Ridge National Laboratory
High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability
Zoran Budimlic
Rice University
A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java
Aydin Buluc
Lawrence Berkeley National Laboratory
Extreme Scale De Novo Metagenome Assembly
Linear Algebra Is the Right Way to Think About Graphs
HPC Graph Toolkits and the GraphBLAS Forum
Aydin Buluç
Lawrence Berkeley National Laboratory
GraphBLAS Forum and Its Relevant Software Zoo
IA^3 Debate
David Bunde
Knox College
Peachy Introduction
Joe Bungo
Nvidia Corporation
Deep Learning by Doing: Nvidia Deep Learning Institute
Jeffery Bunting
NVXL Technology Inc
NVXL Acceleration Platform for Polymorphic Acceleration
Luca Buratti
IBM
University of Bologna
On Adam-Trained Models and a Parallel Method to Improve the Generalization Performance
Neil Burgess
ARM Ltd
Panel Discussion
Bill Burns
Rogue Wave Software Inc
Advanced Technologies and Techniques for Debugging HPC Applications
Martin Burtscher
Texas State University
Computing a Movie of Zooming into a Fractal
PARLOT: Efficient Whole-Program Call Tracing for HPC Applications
Anastasiia Butko
Lawrence Berkeley National Laboratory
Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)
Gregory F. Butler
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
Ali R. Butt
Virginia Tech
BESPOKV: Application Tailored Scale-Out Key-Value Stores
Suren Byna
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
A Year in the Life of a Parallel File System
Anycast: Rootless Broadcasting with MPI
Distributed Adaptive Radix Tree for Efficient Metadata Search on HPC Systems
Return to Top
C
Hector Carrillo Cabada
University of New Mexico
Effective Performance Portability
Ruben M. Cabezón
University of Basel
Detection of Silent Data Corruptions in Smooth Particle Hydrodynamics Simulations
Paul Caheny
Barcelona Supercomputing Center
Polytechnic University of Catalonia
Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs
Steven Calvez
Colorado State University
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
David Camp
Lawrence Berkeley National Laboratory
Python-Based In Situ Analysis and Visualization
Roy Campbell
US Department of Defense HPC Modernization Program
HPC in the DoD
Jeff Candy
General Atomics
Kernel-Based and Total Performance Analysis of CGYRO on 4 Leadership Systems
Richard Shane Canon
Lawrence Berkeley National Laboratory
Interactive HPC Deep Learning with Jupyter Notebooks
Containers in HPC
Container Computing for HPC and Scientific Workflows
Matteo Cantiello
Flatiron Institute
Visualizing Outbursts of Massive Stars
Fei Cao
University of Central Missouri
Engaging Students in Parallel and Distributed Computing Learning by Games Design Using Unity
Huiyan Cao
New Jersey Institute of Technology
Optimizing the Throughput of Storm-Based Stream Processing in Clouds
Franck Cappello
Argonne National Laboratory
Reconfigurable Computing for HPC: Will It Make It this Time?
Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation
Amplitude-Aware Lossy Compression for Quantum Circuit Simulation
Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)
Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation
Full State Quantum Circuit Simulation by Using Lossy Data Compression
VeloC: Very Low Overhead Checkpointing System
Benchmarking Scientific Reconfigurable / FPGA Computing
Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression
Compression for Scientific Data
Julien Capul
Atomic Energy and Alternative Energies Commission (CEA)
PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows
Luca Carloni
Columbia University
Panel: Open-Source Hardware
How System-Level Design Can Benefit the Progress of Open-Source Hardware
Neil Carlson
Los Alamos National Laboratory
Performance Portability Challenges for Fortran Applications
Philip Carns
Argonne National Laboratory
Methodology for the Rapid Development of Scalable HPC Data Services
Toward Understanding I/O Behavior in HPC Workflows
A Year in the Life of a Parallel File System
Enabling Data Services for HPC
Analyzing Parallel I/O
Christopher D. Carothers
Rensselaer Polytechnic Institute
Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems
Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps
Alexandra Carpen-Amarie
Fraunhofer Institute for Industrial Mathematics
Algorithm Selection of MPI Collectives Using Machine Learning Techniques
Patrick Carribault
Atomic Energy and Alternative Energies Commission (CEA)
PARCOACH Extension for a Full-Interprocedural Collectives Verification
Alex Carrillo
US Army Engineer Research and Development Center
Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC
Caroline Weilhamer
Indiana University
INDIS Showcases Panel: NRE and XNET and Architecture
Jeffrey Carver
University of Alabama
Software Engineering and Reuse in Computational Science and Engineering
Henri Casanova
University of Hawaii at Manoa
WRENCH: A Framework for Simulating Workflow Management Systems
SMPI Courseware: Teaching Distributed-Memory Computing with MPI in Simulation
Marc Casas Guix
Barcelona Supercomputing Center
Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs
Approximating a Multi-Grid Solver
Ben Casses
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Ralph Castain
Intel Corporation
PMIx: Enabling Workflow Orchestration
Vito Giovanni Castellana
Pacific Northwest National Laboratory
Introduction - IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms
Enabling High-Level Graph Processing via Dynamic Tasking
IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms
Bryan Catanzaro
Nvidia Corporation
Applying Deep Learning
Carlo Cavazzoni
CINECA
Data Analytics for System and Facility Energy Management
Aurélien Cavelan
University of Basel
Detection of Silent Data Corruptions in Smooth Particle Hydrodynamics Simulations
Ling Cen
Moffitt Cancer Center
Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches
Mohamad Chaarawi
Intel Corporation
Evaluation of HPC Application I/O on Object Storage Systems
Nicholas Chaimov
ParaTools Inc
Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter
Venkatesan Chakaravarthy
IBM
High-Performance Dense Tucker Decomposition on GPU Clusters
S. Chakraborty
Ohio State University
Cooperative Rendezvous Protocols for Improved Performance and Overlap
High Performance Middlewares for Next Generation Architectures: Challenges and Solutions
InfiniBand, Omni-Path, and High-Speed Ethernet: Advanced Features, Challenges in Designing HEC Systems, and Usage
InfiniBand, Omni-Path, and High-Speed Ethernet for Beginners
Dhruva Chakravorty
Texas A&M University
Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level
CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity
Bradford L. Chamberlain
Cray Inc
Panel Discussion
Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI
Chris Chambreau
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Cy P. Chan
Lawrence Berkeley National Laboratory
Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations
Sunita Chandrasekaran
University of Delaware
University of Delaware
5th Workshop on Accelerator Programming Using Directives (WACCPD): Closing Remarks
Swiss Army Programming: Performance and Portability from Modern Tools
Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)
Introduction - Fifth Workshop on Accelerator Programming Using Directives (WACCPD)
OpenACC API User Experience, Vendor Reaction, Relevance, and Roadmap
Fifth Workshop on Accelerator Programming Using Directives (WACCPD)
Chia Cheng Chang
Lawrence Berkeley National Laboratory
RIKEN
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Chun-Kai Chang
University of Texas
Evaluating and Accelerating High-Fidelity Error Injection for HPC
Michael Chang
NVXL Technology Inc
NVXL Acceleration Platform for Polymorphic Acceleration
Barbara Chapman
Stony Brook University
Swiss Army Programming: Performance and Portability from Modern Tools
Is Data Placement Optimization Still Relevant on Newer GPUs?
Dylan Chapp
University of Delaware
University of Tennessee
Introduction of Practical Approaches to Data Analytics for HPC with Spark
Prasanth Chatarasi
Georgia Institute of Technology
A Preliminary Study of Compiler Transformations for Graph Applications on the Emu System
Samit Chaudhuri
NVXL Technology Inc
NVXL Acceleration Platform for Polymorphic Acceleration
Thomas Cheatham
University of Utah
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Ravi Cheema
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
Bernard Chen
University of Central Arkansas
Eight Years Analysis of Adopting PDC in Data Structures at UCA
Bingwei Chen
Tsinghua University
National Supercomputing Center, Wuxi
Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight
Dexun Chen
Tsinghua University
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight
Hsing-bung Chen
Los Alamos National Laboratory
Characterizing Declustered Software RAID for Enhancing Storage Reliability and Performance
Jackie Chen
Sandia National Laboratories
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
Jieyang Chen
University of California, Riverside
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
Jim Chen
International Center for Advanced Internet Research (iCAIR)
Northwestern University
Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Kun Chen
Georgia Institute of Technology
A Unified Runtime for PGAS and Event-Driven Programming
Wenguang Chen
Tsinghua University
ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds
Xi Chen
University of Kentucky
Deep Learning by Doing: Nvidia Deep Learning Institute
Xiaofei Chen
Southern University of Science and Technology, China
Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight
Xinyu Chen
University of New Mexico
Using Thrill to Process Scientific Data on HPC
Yong Chen
Texas Tech University
Welcome, Workshop Goals, and Opening Remarks
Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)
RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish
Distributed Adaptive Radix Tree for Efficient Metadata Search on HPC Systems
Simulating Data Centers with Redfish-Enabled Equipment
HPCViz: Monitoring Health Status of High Performance Computing Systems
Out-of-Band (BMC based) Data Center Monitoring DMTF Redfish API Integration with Nagios
xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory
The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)
Zizhong Chen
University of California, Riverside
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation
Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation
Xiaohe Cheng
Hong Kong University of Science and Technology
Accelerating 2D FFT: Exploit GPU Tensor Cores through Mixed-Precision
Yu-Hsuan Cheng
National Tsing Hua University, Taiwan
Student Cluster Competition Team Panel Presentation
Yue Cheng
George Mason University
BESPOKV: Application Tailored Scale-Out Key-Value Stores
Jia Lin Cheoh
Purdue University
Student Cluster Competition Team Panel Presentation
Nathanael Cheriere
IRISA
ENS Rennes
Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage
Igor Chernykh
Institute of Computational Mathematics and Mathematical Geophysics SB RAS
Evaluation of Intel Memory Drive Technology Performance for Scientific Applications
Kazem Cheshmi
University of Toronto
ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism
Ron Chi-Lung Chiang
University of St. Thomas
University of St. Thomas
Contention-Aware Container Placement Strategy for Docker Swarm
Wei Der Chien
KTH Royal Institute of Technology
Characterizing Deep-Learning I/O Workloads in TensorFlow
Artem Chikin
University of Alberta
OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries
Bruce Childers
University of Pittsburgh
Supporting Thorough Artifact Evaluation with Occam
J. Taylor Childers
Argonne National Laboratory
Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows
Hank Childs
University of Oregon
A Flexible System For In Situ Triggers
Wendy K. Tam Cho
University of Illinois
A Massively Parallel Evolutionary Markov Chain Monte Carlo Algorithm for Sampling Complicated Multimodal State SpacesState
George Chochia
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Jee Choi
IBM
High-Performance Dense Tucker Decomposition on GPU Clusters
Jong Youl Choi
Oak Ridge National Laboratory
Feature-Relevant Data Reduction for In Situ Workflows
Shreyas Cholia
Lawrence Berkeley National Laboratory
Interactive HPC Deep Learning with Jupyter Notebooks
Frederic T. Chong
University of Chicago
Amplitude-Aware Lossy Compression for Quantum Circuit Simulation
Full State Quantum Circuit Simulation by Using Lossy Data Compression
Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression
Hybrid Quantum-Classical Computing Architectures
Krzysztof Choromanski
Google LLC
Adaptive Anonymization of Data with b-Edge Covers
Alok Choudhary
Northwestern University
Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era
Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies
Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training
Edmond Chow
Georgia Institute of Technology
Accelerating Quantum Chemistry with Vectorized and Batched Integrals
Fahim Chowdhury
Florida State University
Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems
Blair Christian
Oak Ridge National Laboratory
HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports
Marcus Christie
Indiana University
SciGaP: Apache Airavata Hosted Science Gateways
Albert Chu
Lawrence Livermore National Laboratory
Enabling Data Analytics Workflows Using Node-Local Storage
Flux: Overcoming Scheduling Challenges for Exascale Workflows
Xiaowen Chu
Hong Kong Baptist University
Hong Kong Baptist University
GPGPU Performance Estimation with Core and Memory Frequency Scaling
Neil P. Chue Hong
Software Sustainability Institute
University of Edinburgh
Sustaining Research Software
Sudheer Chunduri
Argonne National Laboratory
Characterization of MPI Usage on a Production Supercomputer
I-Hsin Chung
IBM
A Cost-Effective Flexible System Optimized for DNN and ML
Vladimir Chupakhin
Janssen Pharmaceutika NV
HPC-as-a-Service for Life Sciences
Luca Cinquini
Jet Propulsion Laboratory
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Florina M. Ciorba
University of Basel
Detection of Silent Data Corruptions in Smooth Particle Hydrodynamics Simulations
Antonio Cisternino
University of Pisa
Applications of Deep Learning in Industry and Research
M.A. Clark
Nvidia Corporation
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Philippe Claus
French Institute for Research in Computer Science and Automation (INRIA)
University of Strasbourg
AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests
Beverly Clayton
Pittsburgh Supercomputing Center
SC: The Conference
David Clements
Igneous Systems Inc
Data Protection Solutions for ML/AI
David Clifton
ANSYS Inc
Introduction - HPC Systems Professionals Workshop (HPCSYSPROS18)
Sharlee Climer
University of Missouri, St Louis
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction
Douglas D. Cline
Lockheed Martin Aeronautics Company
Challenges and Solutions in the Industrial Application of HPC at Lockheed Martin
Tim Cockerill
Texas Advanced Computing Center
Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)
The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)
Valeriu Codreanu
SURFsara
Fast and Accurate Training of an AI Radiologist
Large Minibatch Training on Supercomputers with Improved Accuracy and Reduced Time to Train
Henrique Colao Zanúz
Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LIG
In-Transit Molecular Dynamics Analysis with Apache Flink
Maureen Colbert
Dartmouth Medical School
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Phil Colella
Lawrence Berkeley National Laboratory
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids
Mark Coletti
Oak Ridge National Laboratory
Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes
Ian Colle
Amazon Web Services
The Difference Between HPC on Premises and in the Cloud
What Would You Do with a Million Cores of Compute Capacity?
Nicholson Collier
Argonne National Laboratory
Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration
Toni Collis
Appentra Solutions
Developing Workplace Resilience and Managing Stress
Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training
Introduction - Women in HPC: Diversifying the HPC Community
Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.
Women in HPC: Diversifying the HPC Community
Alan Commike
Reservoir Labs Inc
Fast Detection of Elephant Flows with Dirichlet-Categorical Inference
Guojing Cong
IBM
On Adam-Trained Models and a Parallel Method to Improve the Generalization Performance
Keith Conger
Colorado College
Building a Low Budget Cluster Through Hardware Reuse
Giuseppe Congiu
Argonne National Laboratory
MPICH: A High Performance Open-Source MPI Implementation
Brandon Cook
Lawrence Berkeley National Laboratory
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Jeanine Cook
Sandia National Laboratories
Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications
Jonathan Cook
New Mexico State University
Exploring and Quantifying How Communication Behaviors in Proxies Relate to Real Applications
Julita Corbalan
Barcelona Supercomputing Center
Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa
Ayse Coskun
Boston University
Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights
Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance
Alexandru Costan
IRISA, INSA Rennes
Planner: Cost-efficient Execution Plans Placement for Uniform Stream Analytics on Edge and Cloud
J. Eric Coulter
Indiana University
XSEDE
Programmable Education Infrastructure: Cloud Resources as HPC Education Environments
Peter Coveney
University College London
Personalized Medicine and HPC
Jim Cownie
Intel Corporation
OpenMP® 5.0 Is Here: Find Out All the Things You Need to Know About It!
LLVM in HPC: What's New?
Samantha Coyle
Texas State University
High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem
Charles D. Cranor
Carnegie Mellon University
Scaling Embedded In Situ Indexing with DeltaFS
Francesco Cremonesi
Swiss Federal Institute of Technology in Lausanne
Applying the Execution-Cache-Memory Model: Current State of Practice
Daniel Crichton
Jet Propulsion Laboratory
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Silvia Crivelli
Lawrence Berkeley National Laboratory
OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine
Capsule Networks for Protein Structure Classification
Clara E. Cromey
University of Arizona
Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing
Julian Cuevas Paniagua
Lawrence Berkeley National Laboratory
University of Puerto Rico at Mayaguez
Capsule Networks for Protein Structure Classification
Xuewen Cui
Virginia Tech
Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining
Yifeng Cui
San Diego Supercomputer Center
Tensor-Optimized Hardware Accelerates Fused Discontinuous Galerkin Simulations
Christine Cuicchi
US Department of Defense
Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.
Volunteer Opportunities for SC Conference Planning
HPC in the DoD
Scott Cukras
Moffitt Cancer Center
Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches
Aaron Culich
University of California, Berkeley
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Massimiliano Culpo
Swiss Federal Institute of Technology in Lausanne
Managing HPC Software Complexity with Spack
Matthew Curry
Sandia National Laboratories
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
Tony Curtis
Stony Brook University
OpenSHMEM in the Era of Exascale
Maciej Cytowski
Pawsey Supercomputing Centre
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
HPC Education and Training: An Australian Perspective
Return to Top
D
Marco D'Amico
Barcelona Supercomputing Center
Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa
Maytal Dahan
University of Texas
Hot Topics Discussion I: Thriving at Work
Developing Workplace Resilience and Managing Stress
Mai Dahshan
Virginia Tech
Making Sense of Scientific Simulation Ensembles
Dong Dai
University of North Carolina, Charlotte
Introduction - The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)
The 2nd Industry/University Joint International Workshop on Data Center Automation, Analytics, and Control (DAAC)
Abdul Dakkak
University of Illinois
MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines
Chris Daley
Lawrence Berkeley National Laboratory
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
John Daly
Laboratory for Physical Sciences at University of Maryland
Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)
Nathaniel Danandeh
University of California, San Diego
Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU
Hoang-Vu Dang
University of Illinois
Fast and Generic Concurrent Message-Passing
Tommy Dang
Texas Tech University
Texas Tech University
Visualizing Multidimensional Health Status of Data Centers
HPCViz: Monitoring Health Status of High Performance Computing Systems
Frederica Darema
United States Air Force
Federated Cloud: An Evolutionary Path from Grid Computing
Eli Dart
Energy Sciences Network (ESnet)
HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
Anwesha Das
North Carolina State University
Doomsday: Predicting Which Node Will Fail When on Supercomputers
Holistic Root Cause Analysis of Node Failures in Production HPC
James Davis
University of Warwick
Optimizing Machine Learning on Apache Spark in HPC Environments
John Davis
Bigstream Networks
Accelerating Intelligence
Joshua H. Davis
University of Delaware
Studying the Impact of Power Capping on MapReduce-Based, Data-Intensive Mini-Applications on Intel KNL and KNM Architectures
Philip Davis
Rutgers University
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows
Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration
Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows
Gene Davison
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Johannes de Fine Licht
ETH Zurich
Productive Parallel Programming for FPGA with High-Level Synthesis
Maarten V. de Hoop
Rice University
Computing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver
Wibe de Jong
Lawrence Berkeley National Laboratory
Quantum Computing for Scientific Applications
Cees de Laat
University of Amsterdam
Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners
Mix-and-Match: A Model-Driven Runtime Optimization Strategy for BFS on GPUs
Noël De Palma
University of Grenoble
CPU Overheating Characterization in HPC Systems: a Case Study
Bronis R. de Supinski
Lawrence Livermore National Laboratory
Energy Efficiency Modeling of Parallel Applications
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining
Mastering Tasking with OpenMP
Advanced OpenMP: Host Performance and 5.0 Features
Resource Management and Interference
Debzani Deb
Winston-Salem State University
Introduction - Workshop on Education for High Performance Computing (EduHPC)
An Alternative Approach to Teaching Bigdata and Cloud Computing Topics at CS Undergraduate Level
Nathan Debardeleben
Los Alamos National Laboratory
Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo
SaNSA - the Supercomputer and Node State Architecture
Improving Application Resilience by Extending Error Correction with Contextual Information
Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)
Tivan: A Scalable Data Collection and Analytics Cluster
Bert Debusschere
Sandia National Laboratories
Presenting / Communication
CRE218 – Plenary II
Ewa Deelman
University of Southern California, Information Sciences Institute
Enabling Data Analytics Workflows Using Node-Local Storage
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Davide Del Vento
National Center for Atmospheric Research
AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication Libraries
Robert L. DeLeon
State University of New York at Buffalo
Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD
Ameneh Deljoo
University of Amsterdam
Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners
Phil Demar
Fermi National Accelerator Laboratory
Computing Division
SDN for End-to-End Networked Science at the Exascale (SENSE)
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Gökalp Demirci
University of Chicago
A Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints
James Demmel
University of California, Berkeley
Correctness of Floating Point Programs - Exception Handling and Reproducibility
Minwen Deng
Tencent Holdings Ltd
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures
Yuefan Deng
Stony Brook University
Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling
Eva Dengler
University of Erlangen-Nuremberg
Student Cluster Competition Team Panel Presentation
Larry Dennison
Nvidia Corporation
Light-Weight Protocols for Wire-Speed Ordering
Exploiting Idle Resources in a High-Radix Switch for Supplemental Storage
Joel Denny
Oak Ridge National Laboratory
Clacc: Translating OpenACC to OpenMP in Clang
Jeff Denton
Clemson University
Using CloudLab as a Scalable Platform for Teaching Cluster Computing
Luiz DeRose
Cray Inc
Energy Efficiency Modeling of Parallel Applications
Ajay Deshpande
General Motors Company
HPC Drives GM
Jack Deslippe
Lawrence Berkeley National Laboratory
“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Exascale Deep Learning for Climate Analytics
A Case Study for Performance Portability Using OpenMP 4.5
Sean Dettrick
TAE Technologies
Data Fusion for Nuclear Fusion – Using HPC To Put a Star in a Bottle
Data Reduction Challenges in Coordinated Simulation and Experimental Fusion Science
Sindhu Devale
Texas State University
PARLOT: Efficient Whole-Program Call Tracing for HPC Applications
Hariharan Devarajan
Illinois Institute of Technology
HDF Group
Hermes: a Multi-Tiered Distributed I/O Buffering System for HDF5
Charlie Dey
University of Texas
Data Science and HPC Education and Outreach
Akshaye Dhawan
Bloomberg LP
TCHPC Career Panel
Salvatore Di Girolamo
ETH Zurich
SimFS: A Simulation Data Virtualizing File System Interface
Diana Di Luccio
Parthenope University of Naples
DagOn*: Executing Direct Acyclic Graphs as Parallel Jobs on Anything
Luca di Mare
Oxford Thermofluids Institute
University of Oxford
Software Prefetching for Unstructured Mesh Applications
Sheng Di
Argonne National Laboratory
Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation
Amplitude-Aware Lossy Compression for Quantum Circuit Simulation
Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation
Full State Quantum Circuit Simulation by Using Lossy Data Compression
Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression
Gerrett Diamond
Rensselaer Polytechnic Institute
Dynamic Load Balancing of Plasma and Flow Simulations
James Dickson
University of Warwick
Performance Portability of an Unstructured Hydrodynamics Mini-Application
Patrick Diehl
Louisiana State University
Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)
Asynchronous Execution of Python Code on Task Based Runtime Systems
Joan Digney
Carnegie Mellon University
Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems
Dabin Ding
University of Central Missouri
Engaging Students in Parallel and Distributed Computing Learning by Games Design Using Unity
Nan Ding
Lawrence Berkeley National Laboratory
Understanding Potential Performance Issues Using Resource-Based alongside Time Models
Pengfei Ding
Fermi National Accelerator Laboratory
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Minh Ngoc Dinh
University of Queensland
Energy Efficiency Modeling of Parallel Applications
Alexander Ditter
University of Erlangen-Nuremberg
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
Integrating Network-Attached FPGAs into the Cloud Using Partial Reconfiguration
Tu Mai Anh Do
University of Southern California, Information Sciences Institute
Enabling Data Analytics Workflows Using Node-Local Storage
Doug Doerfler
Lawrence Berkeley National Laboratory
P3HPC Session 1 Panel Discussion
Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Jens Domke
Tokyo Institute of Technology
Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing
Evan Donato
University of Massachusetts, Boston
Student Cluster Competition Team Panel Presentation
Bin Dong
Lawrence Berkeley National Laboratory
Automated Parallel Data Processing Engine with Application to Large-Scale Feature Extraction
Jack Dongarra
University of Tennessee
Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers
Approximating for Faster, Better and Cheaper Scientific Computing
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines
Batched, Reproducible, and Reduced Precision BLAS
HPCG Benchmark Update
TOP500 Supercomputers
Big Data and Exascale Computing (BDEC2) Application Roundtable
Invited Talk Session 5
9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
David Donofrio
Lawrence Berkeley National Laboratory
Closing Remarks
Opening Remarks
Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)
xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory
4th Workshop for Open Source Supercomputing (OpenSuCo)
Rion Dooley
University of Texas
Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins
Matthieu Dorier
Argonne National Laboratory
Methodology for the Rapid Development of Scalable HPC Data Services
Pufferbench: Evaluating and Optimizing Malleability of Distributed Storage
Fred Douglis
Perspecta Labs
BESPOKV: Application Tailored Scale-Out Key-Value Stores
Chris Downing
Red Oak Consulting
OpenHPC Community BoF
Derek Doyle
Colorado State University
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Erik W. Draeger
Lawrence Livermore National Laboratory
Toward a Computational Simulation of Circulating Tumor Cell Transport in Vascular Geometries
Physics and Tensor Applications
Petros Drineas
Purdue University
Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps
Maurizio Drocco
Pacific Northwest National Laboratory
Enabling High-Level Graph Processing via Dynamic Tasking
Nikoli Dryden
University of Illinois
Lawrence Livermore National Laboratory
Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems
Kristof Du Bois
Intel Corporation
Many-Core Graph Workload Analysis
Shaohua Duan
Rutgers University
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows
Xiaohui Duan
Shandong University
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight
Anshu Dubey
Argonne National Laboratory
An Application Perspective on Programming Models for the Future
Keynote: Better Scientific Software (BSSw)
Panel Discussion
Sustaining Research Software
Better Scientific Software
Thomas Dufaud
University of Versailles
Design of Data Management for Multi-SPMD Workflow Programming Model
Thom Dunning
University of Washington
Pacific Northwest National Laboratory
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
Earl P.N. Duque
Intelligent Light
ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization
Dmitry Durnov
Intel Corporation
Framework for Scalable Intra-Node Collective Operations Using Shared Memory
Soumya Dutta
Los Alamos National Laboratory
A Flexible System For In Situ Triggers
In Situ Data-Driven Adaptive Sampling for Large-Scale Simulation Data Summarization
Return to Top
E
Jonathan Eastep
Intel Corporation
Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack
Hans Eberle
Nvidia Corporation
Light-Weight Protocols for Wire-Speed Ordering
John D. Eblen
University of Tennessee
High-Performance Molecular Dynamics Simulation for Biological and Materials Sciences: Challenges of Performance Portability
David Eder
Maui High Performance Computing Center
OpenMP Common Core: a “Hands-On” Exploration
Stratos Efstathiadis
New York University
Third Annual Meeting of the SIGHPC - Big Data Chapter
Aryan Eftekhari
University of Lugano
Distributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures
Rob Egan
Lawrence Berkeley National Laboratory
Extreme Scale De Novo Metagenome Assembly
Ryusuke Egawa
Tohoku University
A Locality and Memory Congestion-Aware Thread Mapping Method for Modern NUMA Systems
Jan Eitzinger
University of Erlangen-Nuremberg
Erlangen Regional Computing Center
Applying the Execution-Cache-Memory Model: Current State of Practice
Jorge Ejarque
Barcelona Supercomputing Center
AutoParallel: A Python Module for Automatic Parallelization and Distributed Execution of Affine Loop Nests
Samer El Haj Mahmoud
Lenovo
Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective
Kaoutar El Maghraoui
IBM
Developing Workplace Resilience and Managing Stress
Tarek El-Ghazawi
George Washington University
Productive Data Locality Optimizations in Distributed Memory
Mohamad S. El-Zein
Deere & Company
Emergence of Tools - a Competitive Advantage at John Deere
Vadim Elisseev
IBM
A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management
Paul R. Eller
University of Illinois
Scalable Non-Blocking Krylov Solvers for Extreme-Scale Computing
Sally Ellingson
University of Kentucky
Building Lasting and Effective Mentoring Relationships
James Elliott
Sandia National Laboratories
Low Thread-Count Gustavson: A Multithreaded Algorithm for Sparse Matrix-Matrix Multiplication Using Perfect Hashing
Carolyn Ellis
Purdue University
Best Practices from Organizations on Improving Workplace Diversity
Daniel Ellsworth
Colorado College
Building a Low Budget Cluster Through Hardware Reuse
Murali Emani
Lawrence Livermore National Laboratory
Is Data Placement Optimization Still Relevant on Newer GPUs?
Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling
David Emerson
Science and Technology Facilities Council, UK
GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne
Mark Endrei
University of Queensland
Energy Efficiency Modeling of Parallel Applications
Christian Engelmann
Oak Ridge National Laboratory
A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform
Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer
Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
Jeremy Enos
University of Illinois
National Center for Supercomputing Applications
Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience
Mattan Erez
University of Texas
Evaluating and Accelerating High-Fidelity Error Injection for HPC
Steven Eschrich
Moffitt Cancer Center
Developing a Reproducible WDL-Based Workflow for RNASeq Data Using Modular, Software Engineering-Based Approaches
Trilce Estrada
University of New Mexico
PDC Curriculum Update
Vincent Etienne
Saudi Aramco
Toward Smoothing Data Movement Between RAM and Storage
Redesigning The Absorbing Boundary Algorithm for Asynchronous High Performance Acoustic Wave Propagation
Noah Evans
Sandia National Laboratories
Verifying Qthreads: Is Model Checking Viable for User Level Tasking Runtimes?
Oliver Evans
Lawrence Berkeley National Laboratory
Interactive HPC Deep Learning with Jupyter Notebooks
Stijn Eyerman
Intel Corporation
Many-Core Graph Workload Analysis
Matthew A. Ezell
Oak Ridge National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Return to Top
F
Mike Fagan
Rice University
Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing
Kjiersten Fagnan
Lawrence Berkeley National Laboratory
US Department of Energy Joint Genome Institute
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction
J.J. Falkanger
Lenovo
Managing the Convergence of HPC and AI
Duoming Fan
Lawrence Berkeley National Laboratory
Efficient Application of Low Mach Number Hydrodynamics Code to Stellar Flows
Alessandro Fanfarillo
National Center for Atmospheric Research
AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication Libraries
Dingyi Fang
Northwest University, China
Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers
Amin Farmahini-Farahani
Advanced Micro Devices Inc
Challenges of High-Capacity DRAM Stacks and Potential Directions
Muhammad Nufail Farooqi
Koc University
Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows
Steven Farrell
Lawrence Berkeley National Laboratory
Interactive HPC Deep Learning with Jupyter Notebooks
Deep Learning at NERSC: Usability, Capability, and Everything in Between
Deep Learning at Scale
Massimiliano Fatica
Nvidia Corporation
Exascale Deep Learning for Climate Analytics
Farzad Fatollahi-Fard
Lawrence Berkeley National Laboratory
Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)
xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory
4th Workshop for Open Source Supercomputing (OpenSuCo)
Adrien Faure
Atos
Considering the Development Workflow to Achieve Reproducibility with Variation
Jean M. Favre
Swiss National Supercomputing Centre
Volume Renderings of Sheared Thermal Convection
Dmitri Fedorov
National Institute of Advanced Industrial Science and Technology (AIST)
MPI/OpenMP parallelization of the Fragment Molecular Orbitals Method in GAMESS
Noah Feldman
Carleton College
A Statistical Analysis of Compressed Climate Model Data
Evan Felix
Pacific Northwest National Laboratory
CView and NWPerf for Supercomputer Performance Collection and Display.
Shengzhong Feng
Shenzhen Institutes of Advanced Technology
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures
Wu Feng
Virginia Tech
Performance Evaluation of the NVIDIA Tesla V100: Block Level Pipelining vs. Kernel Level Pipelining
The Green 500: Trends in Energy Efficient Supercomputing
John Feo
Pacific Northwest National Laboratory
IA^3 Debate
Introduction - IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms
Enabling High-Level Graph Processing via Dynamic Tasking
HPC Graph Toolkits and the GraphBLAS Forum
IA^3 2018: 8th Workshop on Irregular Applications: Architectures and Algorithms
S M Ferdous
Purdue University
Adaptive Anonymization of Data with b-Edge Covers
Michael Ferguson
Cray Inc.
Chapel Aggregation Library (CAL)
Mark Fernandez
Hewlett Packard Enterprise
HPC in Space: An Update on Spaceborne Computer after 1+ Year on the ISS
Milinda Fernando
University of Utah
Dendro-GR: Massively Parallel Simulations of Binary Black Hole Intermediate-Mass-Ratio Inspirals
Mauricio H. Ferrato
University of Delaware
Estimating Molecular Dynamics Chemical Shift with GPUs
Rafael Ferreira da Silva
University of Southern California
Introduction - WORKS 2018: 13th Workshop on Workflows in Support of Large-Scale Science
WRENCH: A Framework for Simulating Workflow Management Systems
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Kurt B. Ferreira
Sandia National Laboratories
Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo
Mentor-Protégé Informational Session
Building Lasting and Effective Mentoring Relationships
Nicola Ferrier
Argonne National Laboratory
University of Chicago
Introduction - ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization
libIS: A Lightweight Library for Flexible In Transit Visualization
ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization
François Févotte
EDF Research and Development
Debugging and Optimization of HPC Programs in Mixed Precision with the Verrou Tool
Florian Fey
University of Münster
Unified Cross-Platform Profiling of Parallel C++ Applications
Chris Fietkiewicz
Case Western Reserve University
Potential Influence of Prior Experience in an Undergraduate-Graduate Level HPC Course
Alex Filby
Dell EMC
Fast and Accurate Training of an AI Radiologist
Weronika Filinger
University of Edinburgh
Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training
International HPC Certification Program
Strategies for Inclusive and Scalable HPC Outreach and Education
Toward a HPC Certification Program
The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training
Women in HPC: Diversifying the HPC Community
Salvatore Filippone
Cranfield University
Panel Discussion
Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model
Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI
Hal Finkel
Argonne National Laboratory
Workshop Lunch (on your own)
Workshop Afternoon Break
Workshop Morning Break
LLVM-HPC2018: Final Discussion
Amplitude-Aware Lossy Compression for Quantum Circuit Simulation
Introduction - LLVM-HPC2018: The Fifth Workshop on the LLVM Compiler Infrastructure in HPC
Full State Quantum Circuit Simulation by Using Lossy Data Compression
LLVM in HPC: What's New?
Distributed and Heterogeneous Programming in C++ for HPC 2018
Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing
Memory-Efficient Quantum Circuit Simulation by Using Lossy Data Compression
Hybrid Quantum-Classical Computing Architectures
User-Directed Loop-Transformations in Clang
Jeremy Fischer
Indiana University
XSEDE
Programmable Education Infrastructure: Cloud Resources as HPC Education Environments
Zac Flamig
University of Chicago
The Gen3 Approach to Portability and Repeatability for Cancer Genomics Projects
Justin Fletcher
US Air Force Research Laboratory
HPC in the DoD
Jose Flich
Technical University of Valencia
The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems
Patrick Flick
Georgia Institute of Technology
Parallel and Scalable Combinatorial String and Graph Algorithms on Distributed Memory Systems
Fernanda Foertter
Nvidia Corporation
Approximating for Faster, Better and Cheaper Scientific Computing
Claudia Fohry
University of Kassel
Comparison of the HPC and Big Data Java Libraries Spark, PCJ and APGAS
Félix-Antoine Fortin
Laval University
Panel: Interactivity in Supercomputing
Neil Fortner
HDF Group
Evaluation of HPC Application I/O on Object Storage Systems
Greg Foss
University of Texas
Texas Advanced Computing Center
Arctic Ocean-Sea Ice Interactions
Ian Foster
Argonne National Laboratory
Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)
Introduction - Deep Learning on Supercomputers - Welcome
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration
Pouya Fotouhi
University of California, Davis
FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects
Yvan Fournier
EDF Research and Development
GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne
Large Scale Computation of Quantiles Using MELISSA
Geoffrey Fox
Indiana University
Big Data and Exascale Computing (BDEC2) Application Roundtable
Will Fox
Massachusetts Institute of Technology
Feature-Relevant Data Reduction for In Situ Workflows
Manaurae Francisquez
Princeton Plasma Physics Laboratory
Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method
Evan Fraser
Arrell Food Institute
Global Food Security
HPC Inspires Plenary: HPC and AI: Helping to Solve Humanity’s Grand Challenges
Melyssa Fratkin
University of Texas
Achieving Performance on Large-Scale Intel Xeon-Based Systems
Brian Friesen
Lawrence Berkeley National Laboratory
Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Joshua B. Fryman
Intel Corporation
Many-Core Graph Workload Analysis
Haohuan Fu
Tsinghua University
National Supercomputing Center, Wuxi
Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight
Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers
Huansong Fu
Florida State University
Enabling Efficient Data Infrastructure and Analytics on HPC Systems
Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems
Song Fu
University of North Texas
Characterizing Declustered Software RAID for Enhancing Storage Reliability and Performance
Muztaba Fuad
Winston-Salem State University
An Alternative Approach to Teaching Bigdata and Cloud Computing Topics at CS Undergraduate Level
Joel Fuentes
University of California, Irvine
Using Integrated Processor Graphics to Accelerate Concurrent Data and Index Structures
Using Integrated Processor Graphics to Accelerate Concurrent Data and Index Structures
Akihiro Fujii
Kogakuin University
MGRIT Preconditioned Krylov Subspace Method
Kohei Fujita
University of Tokyo
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing
Masahiro Fujita
LTE Inc
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments
Takeshi Fukaya
Hokkaido University
Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices
Douglas Fuller
Red Hat Inc
Ceph Applications in HPC Environments
Steve Furber
University of Manchester
Brain-Inspired Massively-Parallel Computing
Thomas R. Furlani
State University of New York at Buffalo
Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD
Grigori Fursin
Dividiti Ltd
cTuning Foundation
Open Panel: Automating Artifact Sharing, Evaluation, and Reuse
Introduction - ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC
Mikito Furuichi
Japan Agency for Marine-Earth Science and Technology
Massively Parallel Stress Chain Characterization for Billion Particle DEM Simulation of Accretionary Prism Formation
Return to Top
G
Joerg Gablonsky
Boeing
The Enterprise HPC Service at Boeing
Alex Gagliano
University of Illinois
The First Water in the Universe
Sanjaya Gajurel
Case Western Reserve University
Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images
Jim Galarowicz
Krell Institute
How to Analyze the Performance of Parallel Codes 101
Daniel Gall
Engility Corporation
Managing Python in HPC Environments
Brian Gallagher
Lawrence Livermore National Laboratory
Enabling Data Analytics Workflows Using Node-Local Storage
Jean-Mathieu Gallard
Technical University Munich
Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG Schemes
Steven M. Gallo
State University of New York at Buffalo
Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD
Arjun Gambhir
Lawrence Livermore National Laboratory
Lawrence Berkeley National Laboratory
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Todd Gamblin
Lawrence Livermore National Laboratory
Open Panel: Automating Artifact Sharing, Evaluation, and Reuse
Introduction - ResCuE-HPC: 1st Workshop on Reproducible, Customizable, and Portable Workflows for HPC
Spack Community BoF
Managing HPC Software Complexity with Spack
Lin Gan
Tsinghua University
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight
Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight
Gregory R. Ganger
Carnegie Mellon University
Scaling Embedded In Situ Indexing with DeltaFS
Dennis Gannon
Microsoft Corporation
Data and Storage
Wilfried N. Gansterer
University of Vienna
Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods
Raghu K. Ganti
IBM Research
Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture
Ping Gao
Shandong University
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight
Shuang Gao
Nvidia Corporation
Cross-Layer Group Regularization for Deep Neural Network Pruning
Sicun Gao
University of California, San Diego
Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU
Eric Garcia
Intel Corporation
Function/Kernel Vectorization via Loop Vectorizer
Michael Garland
Nvidia Corporation
Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes
A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs
Jim Garlick
Lawrence Livermore National Laboratory
Flux: Overcoming Scheduling Challenges for Exascale Workflows
Maria Garzaran
Intel Corporation
Framework for Scalable Intra-Node Collective Operations Using Shared Memory
Derek R. Gaston
Idaho National Laboratory
A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations
Rahulkumar Gayatri
Lawrence Berkeley National Laboratory
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
A Case Study for Performance Portability Using OpenMP 4.5
J. Michael Gaziano
Harvard Medical School
Complex Phenomics in the MVP
Lixin Ge
SLAC National Accelerator Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Assefaw Gebremedhin
Washington State University
miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems
Al Geist
Oak Ridge National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Introduction - 9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
9th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
Tong Geng
Boston University
Energy Efficiency of Reconfigurable Caches on FPGAs
Binarized ImageNet Inference in 29us
Daniel Gens
Lawrence Berkeley National Laboratory
National Energy Research Scientific Computing Center (NERSC)
Spectral Analysis: Building an LGBTQIA+ Community in Scientific Computing
Ann Gentile
Sandia National Laboratories
Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights
Evangelos Georganas
Intel Corporation
Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures
Extreme Scale De Novo Metagenome Assembly
Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels
Joseph George
Cray Inc
The Next Wave of HPC in the Datacenter
Serban Georgescu
Fujitsu Laboratories of Europe Ltd.
DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping
Anja Gerbes
Goethe University Frankfurt
Toward a HPC Certification Program
Andreas Gerndt
German Aerospace Center
HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making
Balazs Gerofi
RIKEN
On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale
Jim Gerry
IBM
Exascale Archiving - Challenges and Opportunities
Sandra Gesing
University of Notre Dame
Sustaining Research Software
Introduction - WORKS 2018: 13th Workshop on Workflows in Support of Large-Scale Science
Noushin Ghaffari
Texas A&M University
Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level
Sheikh Ghafoor
Tennessee Technological University
PDC Curriculum Update
Yanzan Gharaibeh
Case Western Reserve University
Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images
Ali Ghazanfar
Texas Tech University
HPCViz: Monitoring Health Status of High Performance Computing Systems
Amir Gholami
University of California, Berkeley
GPU-Accelerated Interpolation for 3D Image Registration
Priyanka Ghosh
Washington State University
Scalable Methods for Genome Assembly
Sayan Ghosh
Washington State University
miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems
Soumyadip Ghosh
University of Notre Dame
Event-Triggered Communication in Parallel Computing
Devarshi Ghoshal
Lawrence Berkeley National Laboratory
Dac-Man: Data Change Management for Scientific Datasets on HPC Systems
Anna Giannakou
Lawrence Berkeley National Laboratory
Flowzilla: A Methodology for Detecting Data Transfer Anomalies in Research Networks
Garth A. Gibson
Carnegie Mellon University
Scaling Embedded In Situ Indexing with DeltaFS
Miguel Gila
Swiss National Supercomputing Centre
RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management
Ellis Giles
Rice University
Hardware Transactional Persistent Memory
Hardware Transactional Persistent Memory
Roscoe Giles
Boston University
SC: The Conference
Bruce Gilpin
Versity Software Inc
Exascale Archiving - Challenges and Opportunities
Mercedes Gimeno-Segovia
PsiQuantum
Quantum Communication Networks and Technologies
Ran Ginosar
Israel Institute of Technology
In-Memory Accelerator Architectures for Machine Learning and Bioinformatics
Processing-in-Storage Architecture for Machine Learning and Bioinformatics
Olivier Giroux
Nvidia Corporation
Swiss Army Programming: Performance and Portability from Modern Tools
Alex Gittens
Rensselaer Polytechnic Institute
Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps
Lisa Gittner
Texas Tech University
Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs
Ben Glick
Lewis & Clark College
Jupyter Notebooks and User-Friendly HPC Access
Madeleine Glick
Columbia University
Photonic Interconnects for Extreme Scale Computing
Next-Generation Networking
Nathan Gober
Texas A&M University
CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity
Tyler Gobran
University of Alberta
OpenMP Target Offloading: Splitting GPU Kernels, Pipelining Communication and Computation, and Selecting Better Grid Geometries
William A. Goddard III
California Institute of Technology
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Michael Goesele
Graphics, Capture and Massively Parallel Computing
Technical University Darmstadt
A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs
Andreas W. Goetz
San Diego Supercomputer Center
Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU
Brice Goglin
French Institute for Research in Computer Science and Automation (INRIA)
University of Bordeaux
Scheduling for In-machine Analytics: Data Size Is Important
Maya Gokhale
Lawrence Livermore National Laboratory
MCHPC'18 Panel: Research Challenges in Memory-Centric Computing
Opportunities for Extreme Heterogeneity in High Performance Architectures
Reconfigurable Computing for HPC: Will It Make It this Time?
Introduction - MCHPC’18: Workshop on Memory Centric High Performance Computing
MCHPC’18: Workshop on Memory Centric High Performance Computing
Daniel Goldberg
University of Edinburgh
A Study on Checkpoints Compression for Adjoint Computation
Robin Goldstone
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
File Systems: Data Movement and Provenance
Eugene Goltsman
Lawrence Berkeley National Laboratory
Extreme Scale De Novo Metagenome Assembly
Gabriel Gomes
University of California, Berkeley
High Performance Computing in Dynamic Traffic Simulation
Leon Gommans
Air France-KLM
Social Computational Trust Model (SCTM): A Framework to Facilitate Selection of Partners
Pedro Gonnet
Google LLC
An Efficient SIMD Implementation of Pseudo-Verlet Lists for Neighbor Interactions in Particle-Based Codes
Elsa Gonsiorowski
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Navigating SC
Introduction - 5th International Workshop on HPC User Support Tools: HUST-18
Introduction - Women in HPC: Diversifying the HPC Community
VeloC: Very Low Overhead Checkpointing System
Navigating SC
Navigating SC
Jaime González Cuevas
Appentra Solutions
Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.
Arturo Gonzalez-Escribano
University of Valladolid
Storms of High-Energy Particles: An assignment for OpenMP, MPI, and CUDA/OpenCL
John Goodhue
Massachusetts Green High Performance Computing Center
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Tom Gooding
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Clay Goodman
Mississippi State University
Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC
Ganesh Gopalakrishnan
University of Utah
Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges
Making Formal Methods for HPC Disappear
Facilitating the Adoption of Correctness Tools in HPC Applications
PARLOT: Efficient Whole-Program Call Tracing for HPC Applications
Jan Goral
University of Utah
A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks
Sergei Gorlatch
University of Münster
Portable Parallel Performance via Multi-Dimensional Homomorphisms
Unified Cross-Platform Profiling of Parallel C++ Applications
Ryan Gosse
US Air Force Research Laboratory
Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter
Markus Götz
Karlsruhe Institute of Technology
Machine Learning-Aided Numerical Linear Algebra: Convolutional Neural Networks for the Efficient Preconditioner Generation
John Gounley
Duke University
Toward a Computational Simulation of Circulating Tumor Cell Transport in Vascular Geometries
Mark Govett
National Oceanic and Atmospheric Administration
Purpose-Built HPC: Last Hope for Earth System Prediction?
Jose Gracia
High Performance Computing Center Stuttgart
Pros and Cons of HPCx benchmarks
Richard Graham
Mellanox Technologies
Heterogeneous Systems and the Road to Exascale for HPC and AI
Patrick Gralka
University of Stuttgart, Visualization Research Center
Visual Analytics Challenges in Analyzing Calling Context Trees
David Grant
Oak Ridge National Laboratory
The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification
Ryan E. Grant
Sandia National Laboratories
ExaMPI Panel
Introduction - Workshop on Exascale MPI (ExaMPI)
Power API and Redfish: Standardizing Power Measurement and Control for HPC
Workshop on Exascale MPI (ExaMPI)
Samuel Grayson
University of Texas, Dallas
NautDB: Toward a Hybrid Runtime for Processing Compiled Queries
Christopher Green
Fermi National Accelerator Laboratory
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Data-Parallel Python for High Energy Physics Analyses
Jennifer Green
Los Alamos National Laboratory
How to Analyze the Performance of Parallel Codes 101
Oded Green
Georgia Institute of Technology
A Fast and Simple Approach to Merge and Merge Sorting Using Wide Vector Instructions
Hugh Greenberg
Los Alamos National Laboratory
SaNSA - the Supercomputer and Node State Architecture
Tivan: A Scalable Data Collection and Analytics Cluster
Joe Greenseid
Cray Inc
Monitoring Large-Scale HPC Systems: Extracting and Presenting Meaningful System and Application Insights
Gary Grider
Los Alamos National Laboratory
Scaling Embedded In Situ Indexing with DeltaFS
Exascale Archiving - Challenges and Opportunities
Kevin Griffin
Lawrence Livermore National Laboratory
Visual Analytics Challenges in Analyzing Calling Context Trees
Andrew Grimshaw
University of Virginia
Invited Talk: The Campus Compute Cooperative Project as an Alternative to Commercial Clouds
Federated Cloud: An Evolutionary Path from Grid Computing
Leopold Grinberg
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Mark Grondona
Lawrence Livermore National Laboratory
Flux: Overcoming Scheduling Challenges for Exascale Workflows
William Gropp
University of Illinois
SC: The Conference
Scalable Non-Blocking Krylov Solvers for Extreme-Scale Computing
Software Engineering and Reuse in Computational Science and Engineering
Advanced MPI Programming
Pascal Grosset
Los Alamos National Laboratory
Using Thrill to Process Scientific Data on HPC
Max Grossman
BP
The BP Data Science Sandbox
A Unified Runtime for PGAS and Event-Driven Programming
A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java
Robert L. Grossman
University of Chicago
The Gen3 Approach to Portability and Repeatability for Cancer Genomics Projects
Paola Grosso
University of Amsterdam
Introduction - Innovating the Network for Data Intensive Science (INDIS)
Tracking Network Flows with P4
Innovating the Network for Data Intensive Science (INDIS)
Dave Grote
Lawrence Livermore National Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Matthew Grover
Walmart Inc
Applications of Deep Learning in Industry and Research
Daniel Gruner
University of Toronto
InfiniBand In-Network Computing Technology and Roadmap
Trends in Demand, Growth, and Breadth in Scientific Computing Training Delivered by a High-Performance Computing Center
Thomas Grützmacher
Karlsruhe Institute of Technology
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation
Ruidong GU
North Carolina State University
Floating-Point Autotuner for CPU-Based Mixed-Precision Applications
Yi Gu
Middle Tennessee State University
Energy-Aware Workflow Scheduling and Optimization in Clouds Using Bat Algorithm
Yizi Gu
Rice University
Dynamic Data Race Detection for OpenMP Programs
Hui Guan
North Carolina State University
Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines
Qiang Guan
Kent State University
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
Aditya Gudibanda
Reservoir Labs Inc
Fast Detection of Elephant Flows with Dirichlet-Categorical Inference
Thaylon Guedes
Fluminense Federal University, Fluminense Federal University, Brazil
A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows
Shashank Gugnani
Ohio State University
Accelerating Big Data Processing in the Cloud with Scalable Communication and I/O Schemes
Exploiting HPC Technologies for Accelerating Big Data Processing and Associated Deep Learning
Vineet Gundecha
Dell EMC
Fast and Accurate Training of an AI Radiologist
Dan Gunter
Lawrence Berkeley National Laboratory
Flowzilla: A Methodology for Detecting Data Transfer Anomalies in Research Networks
Danhao Guo
Carnegie Mellon University
Scaling Embedded In Situ Indexing with DeltaFS
Fan Guo
Los Alamos National Laboratory
Scaling Embedded In Situ Indexing with DeltaFS
Luanzheng Guo
University of California, Merced
FlipTracker: Understanding Natural Error Resilience in HPC Applications
Yanfei Guo
Argonne National Laboratory
MPICH: A High Performance Open-Source MPI Implementation
Chin Guok
Lawrence Berkeley National Laboratory
Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences
SDN for End-to-End Networked Science at the Exascale (SENSE)
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Anshul Gupta
IBM
PDC Curriculum Update
Prachi Gupta
Stony Brook University
Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling
Vijay Gupta
University of Notre Dame
Event-Triggered Communication in Parallel Computing
Sudhanva Gurumurthi
Advanced Micro Devices Inc
Challenges of High-Capacity DRAM Stacks and Potential Directions
John Gustafson
National University of Singapore
Open-Source Supercomputing
Julian Gutierrez
Northeastern University
Employing Student Retention Strategies for an Introductory GPU Programming Course
Optimization of an Image Processing Algorithm: Histogram Equalization
Samuel K. Gutiérrez
Los Alamos National Laboratory
Methodology for the Rapid Development of Scalable HPC Data Services
Ethan D Gutmann
National Center for Atmospheric Research
Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model
Gregory S. Gutmann
Tokyo Institute of Technology
Deep Learning by Doing: Nvidia Deep Learning Institute
Attila Gyulassy
University of Utah
A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing
Return to Top
H
Roland Haas
University of Illinois
National Center for Supercomputing Applications
Programmable Interactive Visualization of a Core-Collapse Supernova Simulation
Ioan Hadade
Oxford Thermofluids Institute
University of Oxford
Software Prefetching for Unstructured Mesh Applications
Bilel Hadri
King Abdullah University of Science and Technology
Convergence between HPC and Big Data: The Day After Tomorrow
Frank Hady
Intel Corporation
MCHPC'18 Panel: Research Challenges in Memory-Centric Computing
MCHPC'18 Morning Keynote: Converging Storage and Memory
Georg Hager
University of Erlangen-Nuremberg
Erlangen Regional Computing Center
Applying the Execution-Cache-Memory Model: Current State of Practice
Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures
Node-Level Performance Engineering
Christoph Hagleitner
IBM
Scalable FPGA Deployments for HPC and DC Applications
Application Porting and Optimization on GPU-Accelerated POWER Architectures
Azzam Haidar
University of Tennessee
Innovative Computing Laboratory
Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers
MATEDOR: MAtrix, TEnsor, and Deep-Learning Optimized Routines
Ammar Hakim
Princeton Plasma Physics Laboratory
Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method
Rafif Akila Hakim
Telkom University, Indonesia
Student Cluster Competition Team Panel Presentation
Mahantesh Halappanavar
Pacific Northwest National Laboratory
Adaptive Anonymization of Data with b-Edge Covers
HPC Graph Toolkits and the GraphBLAS Forum
miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems
Hassan Halawa
University of British Columbia
There Are Trillions of Little Forks in the Road: Choose Wisely! -- Estimating the Cost and Likelihood of Success of Constrained Walks to Optimize a Graph Pruning Pipeline
Mary Hall
University of Utah
A Renaissance for Domain-Specific Languages, Compilers and Code Generators for HPC and Big Data
Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks
Test of Time Award Presentation
Caleb Hamilton
Numerical Algorithms Group
The Business of HPC: TCO, Funding Models, Metrics, Value, and More
Procurement and Commissioning of HPC Systems
Kathleen E. Hamilton
Oak Ridge National Laboratory
Non-Neural Network Applications for Spiking Neuromorphic Hardware
Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation
Julian Hammer
University of Erlangen-Nuremberg
RRZE
OoO Instruction Benchmarking Framework on the Back of Dragons
Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures
Dorit Hammerling
National Center for Atmospheric Research
A Statistical Analysis of Compressed Climate Model Data
Gregory Hammett
Princeton Plasma Physics Laboratory
Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method
Jeff Hammond
Intel Corporation
Learning to Lead in HPC - Strategies to Start Your Leadership Journey
Mentor-Protégé Informational Session
Evaluating the Impact of Proposed OpenMP 5.0 Features on Performance, Portability, and Productivity
Simon D. Hammond
Sandia National Laboratories
Introduction - The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)
Exploring Allocation Policies in Disaggregated Non-Volatile Memories
Changnian Han
Stony Brook University
Machine Learning for Adaptive Discretization in Massive Multiscale Biomedical Modeling
Frank Han
Dell EMC
Deep Learning at Scale on Nvidia V100 Accelerators
Grace Han
Monash University
Student Cluster Competition Team Panel Presentation
Jingoo Han
Virginia Tech
BESPOKV: Application Tailored Scale-Out Key-Value Stores
Toshihiro Hanawa
University of Tokyo
Energy Efficiency Considerations for HPC Procurements
David Hancock
Indiana University
HPC in Cloud or Cloud in HPC: Myths, Misconceptions and Misinformation
Bill Hanson
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Meiro Hao
Nanyang Technological University, Singapore
Student Cluster Competition Team Panel Presentation
Daniel Harborne
Cardiff University
Dynamic Distributed Orchestration of Node-RED IOT Workflows Using a Vector Symbolic Architecture
Paul Hargrove
Lawrence Berkeley National Laboratory
Doomsday: Predicting Which Node Will Fail When on Supercomputers
GASNet-EX Performance Improvements Due to Specialization for the Cray Aries Network
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
Siva Kumar Sastry Hari
Nvidia Corporation
Optimizing Software-Directed Instruction Replication for GPU Error Detection
Kevin Harms
Argonne National Laboratory
Methodology for the Rapid Development of Scalable HPC Data Services
Characterization of MPI Usage on a Production Supercomputer
Stephen Lien Harrell
Purdue University
Open Panel: Automating Artifact Sharing, Evaluation, and Reuse
Introduction - HPC Systems Professionals Workshop (HPCSYSPROS18)
Effective Performance Portability
Plasma Meets Portability: A Journey to Performance Portability in a Particle-in-Cell Code
Christopher Harrison
University of Wisconsin
Special Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)
Cyrus Harrison
Lawrence Livermore National Laboratory
Enabling Data Analytics Workflows Using Node-Local Storage
A Flexible System For In Situ Triggers
Robert S. Hart
Global Good
Institute for Disease Modeling at Intellectual Ventures
HPC Inspires Plenary: HPC and AI: Helping to Solve Humanity’s Grand Challenges
Rebecca Hartman-Baker
Lawrence Berkeley National Laboratory
Students@SC: Making the Best of Your HPC Education
Women in HPC: the Importance of Male Allies
The HPC Best Practices Webinar Series
HPC Workflow
Bill Hartner
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Christine Harvey
MITRE Corporation
Volunteer Opportunities for SC Conference Planning
Niranjan Hasabnis
Intel Corporation
Auto-Tuning TensorFlow Threading Model for CPU Backend
J. Hashmi
Ohio State University
Cooperative Rendezvous Protocols for Improved Performance and Overlap
Designing Shared Address Space MPI Libraries in Many-Core Era
Jon Hass
Dell Inc
Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective
RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish
Out-of-Band (BMC based) Data Center Monitoring DMTF Redfish API Integration with Nagios
Hiroyasu Hasumi
University of Tokyo
Multi-GPU Accelerated Non-Hydrostatic Numerical Ocean Model with GPUDirect RDMA Transfers
Kazuma Hatta
Imagica Digitalscape
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments
Carina Haupt
German Aerospace Center
Software Engineering and Reuse in Computational Science and Engineering
Akihiro Hayashi
Rice University
A Unified Runtime for PGAS and Event-Driven Programming
Kengo Hayashi
Kobe University
RIKEN
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments
Damian Hazen
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
Conghui He
Tsinghua University
National Supercomputing Center, Wuxi
Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight
Shuibing He
Wuhan University
Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach
Siyu He
Carnegie Mellon University
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Yun (Helen) He
Lawrence Berkeley National Laboratory
OpenMP Common Core: a “Hands-On” Exploration
Ronnie Hedgepeth
US Department of Defense HPC Modernization Program
HPC in the DoD
Patrick Heimbach
University of Texas
Institute for Computational Engineering and Sciences
Arctic Ocean-Sea Ice Interactions
Alexander Heinecke
Intel Corporation
Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures
Tensorfolding: Improving Convolutional Neural Network Performance with Fused Microkernels
Tensor-Optimized Hardware Accelerates Fused Discontinuous Galerkin Simulations
Thomas Heinis
Imperial College, London
Large-Scale Clustering Using MPI-Based Canopy
Wim Heirman
Intel Corporation
Many-Core Graph Workload Analysis
Thomas Heller
Louisiana State University
Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)
Matthew Henderson
Lawrence Berkeley National Laboratory
Interactive HPC Deep Learning with Jupyter Notebooks
Bruce Hendrickson
Lawrence Livermore National Laboratory
Students@SC Keynote: Livin’ on the Edge: Thoughts on Careers in High Performance Computing
Greg Henry
Intel Corporation
Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures
Robert Henschel
Indiana University
OpenACC API User Experience, Vendor Reaction, Relevance, and Roadmap
David Henty
University of Edinburgh
The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training
Tejun Heo
Facebook
Invited Talk: Resource Control at Facebook
Thomas Herault
University of Tennessee
Fault-Tolerance for High Performance and Distributed Computing: Theory and Practice
Randy Herban
Microsoft Corporation
HPC in the Cloud
Stephen Herbein
Lawrence Livermore National Laboratory
Evaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters
Flux: Overcoming Scheduling Challenges for Exascale Workflows
Introduction of Practical Approaches to Data Analytics for HPC with Spark
Martin Herbordt
Boston University
A Novel Approach to Supporting Communicators for In-Switch Processing of MPI Collectives
Energy Efficiency of Reconfigurable Caches on FPGAs
Binarized ImageNet Inference in 29us
Benchmarking Scientific Reconfigurable / FPGA Computing
SimBSP: Enabling RTL Simulation for Intel FPGA OpenCL Kernels
J. A. Herdman
Atomic Weapons Establishment (AWE), UK
Performance Portability of an Unstructured Hydrodynamics Mini-Application
Pawel Herman
KTH Royal Institute of Technology
Characterizing Deep-Learning I/O Workloads in TensorFlow
Marc-André Hermanns
Forschungszentrum Juelich
Visual Analytics Challenges in Analyzing Calling Context Trees
Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)
Oscar Hernandez
Oak Ridge National Laboratory
OpenSHMEM in the Era of Exascale
Christian Herold
Technical University Dresden
Top-Down Performance Analysis of Workflow Applications
Michael A. Heroux
Sandia National Laboratories
Keynote
MCHPC'18 Panel: Research Challenges in Memory-Centric Computing
Open Panel: Automating Artifact Sharing, Evaluation, and Reuse
Software Engineering and Reuse in Computational Science and Engineering
HPCG Benchmark Update
Navigating the SC Conference Technical Program Submission Process
Better Scientific Software
Andreas Herten
Juelich Supercomputing Centre
Application Porting and Optimization on GPU-Accelerated POWER Architectures
Mary Hester
SURFnet
Introduction - Innovating the Network for Data Intensive Science (INDIS)
Elisa Heymann
University of Wisconsin
Secure Coding Practices and Automated Assessment Tools
Students@SC: Making the Best of Your HPC Education
Jason Hick
Los Alamos National Laboratory
Energy Efficiency Considerations for HPC Procurements
Joshua Higgins
University of Huddersfield
Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks
Nicholas Higham
University of Manchester
School of Mathematics
Harnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers
Christopher Hill
Massachusetts Institute of Technology
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Joseph Hill
University of Amsterdam
Tracking Network Flows with P4
Elizabett Hillery
Purdue University
Best Practices from Organizations on Improving Workplace Diversity
Alex Himmel
Fermi National Accelerator Laboratory
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Kai Himstedt
University of Hamburg
Toward a HPC Certification Program
Naveen Himthani
University of Texas
Institute for Computational Engineering and Sciences
GPU-Accelerated Interpolation for 3D Image Registration
Jacob Hinkle
Oak Ridge National Laboratory
HPC-Based Hyperparameter Search of MT-CNN for Information Extraction from Cancer Pathology Reports
Kei Hiraki
University of Tokyo
Pros and Cons of HPCx benchmarks
Jeffrey Hittinger
Lawrence Livermore National Laboratory
ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning
Shirley Ho
Lawrence Berkeley National Laboratory
Carnegie Mellon University
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Kristin Hoch
Los Alamos National Laboratory
The First Water in the Universe
Torsten Hoefler
ETH Zurich
ExaMPI Keynote
Communication with the Reader
Reconfigurable Computing for HPC: Will It Make It this Time?
Introduction - Fourth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC'18)
High Level Programming Languages for Quantum Computation
Deep500: An HPC Deep Learning Benchmark and Competition
ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds
Productive Parallel Programming for FPGA with High-Level Synthesis
Advanced MPI Programming
Henry Hoffmann
University of Chicago
A Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints
Hybrid Quantum-Classical Computing Architectures
Johannes Hofmann
University of Erlangen-Nuremberg
Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures
Steven Hofmeyr
Lawrence Berkeley National Laboratory
Extreme Scale De Novo Metagenome Assembly
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
Mark Hogan
SLAC National Accelerator Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Markus Höhnerbach
RWTH Aachen University
PotC: Many-Body Potential Implementations à La Carte
Elham Hojati
Texas Tech University
RGB (Redfish Green500 Benchmarker): A Green500 Benchmarking Tool Using Redfish
Jeffrey K. Hollingsworth
University of Maryland
Career Development Panel
David Hollman
Sandia National Laboratories
Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications
Daniel Holmes
University of Edinburgh
Heterogeneous Systems and the Road to Exascale for HPC and AI
Introduction - Workshop on Exascale MPI (ExaMPI)
Workshop on Exascale MPI (ExaMPI)
Violeta Holmes
University of Huddersfield
Rapid Deployment of Bare-Metal and In-Container HPC Clusters Using OpenHPC playbooks
Carissa Holohan
Argonne National Laboratory
Hot Topics Discussion II: Thriving at Work
Spectral Analysis: Building an LGBTQIA+ Community in Scientific Computing
Aaron Holt
University of Colorado
Containers, Collaboration, and Community: Hands-On Building a Data Science Environment for Users and Admins
Burt Holzman
Fermi National Accelerator Laboratory
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Sungwook Hong
University of Southern California
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Valentin Honore
University of Bordeaux
French Institute for Research in Computer Science and Automation (INRIA)
Scheduling for In-machine Analytics: Data Size Is Important
Hans-Christian Hoppe
Intel Corporation
Multi-Level Memory and Storage for HPC and Data Analytics
Muneo Hori
University of Tokyo
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing
Takane Hori
Japan Agency for Marine-Earth Science and Technology
Massively Parallel Stress Chain Characterization for Billion Particle DEM Simulation of Accretionary Prism Formation
Julian Hornich
University of Erlangen-Nuremberg
Erlangen Regional Computing Center
Applying the Execution-Cache-Memory Model: Current State of Practice
Kenneth Hoste
Ghent University
Getting Scientific Software Installed
Aiqin Hou
Northwest University, China
Bandwidth Scheduling for Big Data Transfer with Deadline Constraint between Data Centers
Optimizing the Throughput of Storm-Based Stream Processing in Clouds
Kai-Yuan Hou
Northwestern University
Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era
A Study on Checkpoints Compression for Adjoint Computation
Michael Houston
Nvidia Corporation
Introduction - Machine Learning in HPC Environments
Exascale Deep Learning for Climate Analytics
Paul Hovland
Argonne National Laboratory
A Study on Checkpoints Compression for Adjoint Computation
Abigail Hsu
Stony Brook University
Los Alamos National Laboratory
Challenges of Performance Portability for Fortran Unstructured Mesh Codes
Effective Performance Portability
Performance Portability Challenges for Fortran Applications
Yang Hu
George Washington University
TriCore: Parallel Triangle Counting on GPUs
H. Howie Huang
George Washington University
iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees
TriCore: Parallel Triangle Counting on GPUs
IA^3 Debate
Algorithms on Sparse Data
Hai Huang
IBM
BESPOKV: Application Tailored Scale-Out Key-Value Stores
Hua Huang
Georgia Institute of Technology
Accelerating Quantum Chemistry with Vectorized and Batched Integrals
Jeff Huang
Texas A&M University
Incremental Static Race Detection in OpenMP Programs
Lei Huang
University of Texas
OOOPS: An Innovative Tool for IO Workload Management on Supercomputers
Renfei Huang
Hong Kong University of Science and Technology
SP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition
Nathanael Hübbe
University of Hamburg
Toward a HPC Certification Program
Herbert Huber
Leibniz Supercomputing Centre
The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification
Energy Efficiency Considerations for HPC Procurements
Pierre Huchant
French Institute for Research in Computer Science and Automation (INRIA)
PARCOACH Extension for a Full-Interprocedural Collectives Verification
Alexander Hück
Technical University Darmstadt
Scientific Computing
Compiler-Aided Type Tracking for Correctness Checking of MPI Applications
Kevin Huck
University of Oregon
Using Deep Learning for Automated Communication Pattern Characterization: Little Steps and Big Challenges
Asynchronous Execution of Python Code on Task Based Runtime Systems
Introduction - Fifth International Workshop on Visual Performance Analysis (VPA 18)
Enabling Data Services for HPC
Stephen Hudson
Argonne National Laboratory
Keynote: Better Scientific Software (BSSw)
Clayton Hughes
Sandia National Laboratories
Exploring Allocation Policies in Disaggregated Non-Volatile Memories
Kevin Hughes
Cray Inc
Industry Panel: Data-Center Automation, Analytics, and Control from an Industry Perspective
Yawei Hui
Oak Ridge National Laboratory
A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform
Martin Hull
Arista Networks
The Difference Between HPC on Premises and in the Cloud
Christian Hundt
Johannes Gutenberg University Mainz
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures
Sascha Hunold
Technical University Wien
Algorithm Selection of MPI Collectives Using Machine Learning Techniques
Wendy Huntoon
Keystone Initiative for Network Based Education and Research
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
Ibrahim Hur
Intel Corporation
Many-Core Graph Workload Analysis
Mark Hur
Micron Technology Inc
Accelerate Machine Learning with High Performance Memory
Joshua Hursey
IBM
PMIx: Enabling Workflow Orchestration
Zaeem Hussain
University of Pittsburgh
Partial Redundancy in HPC Systems with Non-Uniform Node Reliabilities
Lance Hutchinson
Sandia National Laboratories
INDIS Showcases Panel: NRE and XNET and Architecture
Networking
Wen-mei Hwu
University of Illinois
MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines
Return to Top
I
Roman Iakymchuk
KTH Royal Institute of Technology
Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows
Costin Iancu
Lawrence Berkeley National Laboratory
Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI
Quantum Computing for Scientific Applications
Tsuyoshi Ichimura
University of Tokyo
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing
Yasuhiro Idomura
Japan Atomic Energy Agency
Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers
Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations
Mike Ignatowski
Advanced Micro Devices Inc
MCHPC'18 Panel: Research Challenges in Memory-Centric Computing
Challenges of High-Capacity DRAM Stacks and Potential Directions
Masashi Ikuta
NEC Corporation
Next Generation Vector Supercomputer
Aleksandar Ilic
INESC-ID, Portugal
Performance Tuning of Scientific Codes with the Roofline Model
Shams Imam
Two Sigma Investments LP
A One Year Retrospective on a MOOC in Parallel, Concurrent, and Distributed Programming in Java
Toshiyuki Imamura
RIKEN
Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations
Takuya Ina
Japan Atomic Energy Agency
Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations
Frank Indiviglio
National Oceanic and Atmospheric Administration
Managing Python in HPC Environments
Martins D. Innus
State University of New York at Buffalo
Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD
Joseph A. Insley
Argonne National Laboratory
Northern Illinois University
Visualizing Outbursts of Massive Stars
libIS: A Lightweight Library for Flexible In Transit Visualization
Latchesar Ionkov
Los Alamos National Laboratory
Heterogeneous Memory and Arena-Based Heap Allocation
Bertrand Iooss
EDF Research and Development
Large Scale Computation of Quantiles Using MELISSA
Alexandru Iosup
Vrije University Amsterdam
Delft University of Technology
A Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments
Keith Irwin
Winston-Salem State University
An Alternative Approach to Teaching Bigdata and Cloud Computing Topics at CS Undergraduate Level
Katherine E. Isaacs
University of Arizona
Introduction - Fifth International Workshop on Visual Performance Analysis (VPA 18)
Youhei Ishihara
Kyoto University
Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems
Yutaka Ishikawa
RIKEN
On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale
Tanzima Islam
Western Washington University
Automatic Generation of Mixed-Precision Programs
Yoko Isobe
Tohoku University
NEC Corporation
Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
Tsuyoshi Ito
Japan Telegraph and Telephone Corporation
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning
Shintaro Iwasaki
University of Tokyo
Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading
Chander J. Iyer
Rensselaer Polytechnic Institute
Yahoo! Research
Iterative Randomized Algorithms for Low Rank Approximation of Terascale Matrices with Small Spectral Gaps
Return to Top
J
Christiane Jablonowski
University of Michigan
Parallel Computing 101
Bruce Jacob
University of Maryland
MCHPC'18 Panel: Research Challenges in Memory-Centric Computing
MCHPC'18 Afternoon Keynote: All Tomorrow’s Memory Systems
Sam Ade Jacobs
Lawrence Livermore National Laboratory
Scalable Deep Ensemble Learning for Cancer Drug Discovery
Doug Jacobsen
Intel Corporation
Effective Performance Portability
Daniel Jacobson
Oak Ridge National Laboratory
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction
Mathias Jacquelin
Lawrence Berkeley National Laboratory
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
Nikhil Jain
Lawrence Livermore National Laboratory
Evaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters
Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing
Surabhi Jain
Intel Corporation
Framework for Scalable Intra-Node Collective Operations Using Shared Memory
William Jalby
Versailles Saint-Quentin-en-Yvelines University
Welcome and Introduction - 7th Workshop on Extreme-Scale Programming Tools (ESPT)
Siddhartha Jana
Intel Corporation
HPC PowerStack: a community-wide open collaboration for enabling system-wide power efficiency
Introduction - Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)
Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack
Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis
Ninth Annual Workshop for the Energy Efficient HPC Working Group (EE HPC WG)
Branislav Jansik
IT4Innovations, Czech Republic
Technical University of Ostrava, Czech Republic
Job Simulation for Large-Scale PBS-Based Clusters with the Maui Scheduler
Jiri Jaros
Brno University of Technology
Faculty of Information Technology
Optimization of Ultrasound Simulations on Multi-GPU Servers
Stephen A. Jarvis
University of Warwick
Introduction - The 9th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems (PMBS18)
Optimizing Machine Learning on Apache Spark in HPC Environments
Performance Portability of an Unstructured Hydrodynamics Mini-Application
Ali Javadi-Abhari
IBM
Quantum Computing for Scientific Applications
Nina Jeliazkova
IDEAconsult Ltd, Bulgaria
HPC-as-a-Service for Life Sciences
Bohumir Jelinek
Mississippi State University
Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC
Louis Jenkins
Pacific Northwest National Laboratory
Chapel Aggregation Library (CAL)
Michael Jennings
Los Alamos National Laboratory
Containers in HPC
Cloud Infrastructure Solutions To Run HPC Workloads
Grzegorz Jereczek
Intel Corporation
DAQDB - a Distributed Key-Value Store for Petascale Hot Storage
Elizabeth Jessup
University of Colorado
Invited Talk Session 1
Moe Jette
SchedMD LLC
A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management
SLURM User Group Meeting
Shantenu Jha
Brookhaven National Laboratory, Rutgers University
Clouds and Distributed Computing
Yuede Ji
George Washington University
iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees
Ming Jiang
Lawrence Livermore National Laboratory
Enabling Data Analytics Workflows Using Node-Local Storage
Nan Jiang
Nvidia Corporation
Exploiting Idle Resources in a High-Radix Switch for Supplemental Storage
Yan-Fei Jiang
University of California, Santa Barbara
Visualizing Outbursts of Massive Stars
Zach Jibben
Los Alamos National Laboratory
Performance Portability Challenges for Fortran Applications
Ivo Jimenez
University of California, Santa Cruz
Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems
Using Thrill to Process Scientific Data on HPC
Spotting Black Swans With Ease: The Case for a Practical Reproducibility Platform
Semantically Organized Containers for Reproducible Research
Chao Jin
University of Queensland
Energy Efficiency Modeling of Parallel Applications
Charles C. Jin
Reservoir Labs Inc
Analysis of Explicit vs. Implicit Tasking in OpenMP Using Kripke
Lingling jin
Alibaba Inc
AI Matrix – Synthetic Benchmarks for DNN
Vanessa Job
Los Alamos National Laboratory
Improving Application Resilience by Extending Error Correction with Contextual Information
Hans Johansen
Lawrence Berkeley National Laboratory
Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks
Mikael Johansson
KTH Royal Institute of Technology
Distributed L-Shaped Algorithms in Julia
Calvin Johnson
San Diego State University
Improving MPI Reduction Performance for Manycore Architectures with OpenMP and Data Compression
Chris Johnson
University of Utah
The Age of Data - Visualizing the Revolution
Christopher Johnson
University of Utah
Learning to Lead in HPC - Strategies to Start Your Leadership Journey
Daniel Johnson
Mississippi State University
Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC
Beau Johnston
Australian National University
AIWC: OpenCL-Based Architecture Independent Workload Characterization
Bryan Johnston
Centre for High Performance Computing, South Africa
Special Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
Strategies for Inclusive and Scalable HPC Outreach and Education
J. Travis Johnston
Oak Ridge National Laboratory
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation
Introduction of Practical Approaches to Data Analytics for HPC with Spark
David Joiner
Kean University
The Wave Equation as a Motivating Example for High Performance Computing
Ana Jokanovic
Barcelona Supercomputing Center
Evaluating SLURM Simulator with Real-Machine SLURM and Vice Versa
Andrew Jones
Numerical Algorithms Group
The Business of HPC: TCO, Funding Models, Metrics, Value, and More
Procurement and Commissioning of HPC Systems
Catherine Jones
Science and Technology Facilities Council, UK
UK Research and Innovation
Software Engineers: Careers in Research
Matthew D. Jones
State University of New York at Buffalo
Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD
Timothy M. Jones
University of Cambridge
Software Prefetching for Unstructured Mesh Applications
William M. Jones
Coastal Carolina University
Improving Application Resilience by Extending Error Correction with Contextual Information
Bálint Joó
Thomas Jefferson National Accelerator Facility
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Kirk E. Jordan
IBM
Astrophysics Applications
Wayne Joubert
Oak Ridge National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction
Guido Juckeland
Helmholtz-Zentrum Dresden-Rossendorf
5th Workshop on Accelerator Programming Using Directives (WACCPD): Closing Remarks
Introduction - Fifth Workshop on Accelerator Programming Using Directives (WACCPD)
Fifth Workshop on Accelerator Programming Using Directives (WACCPD)
Brenden Judson
University of Notre Dame
Compliant Cloud+Campus Hybrid HPC Infrastructure
Luann C. Jung
Massachusetts Institute of Technology
University of Chicago
Measuring Swampiness: Quantifying Chaos in Large Heterogeneous Data Repositories
Christoph Junghans
Los Alamos National Laboratory
Optimizing Next Generation Hydrodynamics Code for Exascale Systems
James Juno
University of Maryland
Kinetic Simulations of Plasma Turbulence Using the Discontinuous Galerkin Finite Element Method
Eulerian Algorithms for the Discretization of Plasma Kinetic Equations
Amy Justice
Yale University
US Department of Veterans Affairs
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction
Return to Top
K
Mozhgan Kabiri Chimeh
University of Sheffield
FLAME GPU: Complex System Simulation Framework
Developing Workplace Resilience and Managing Stress
David Kaeli
Northeastern University
PRISM: Predicting Resilience of GPU Applications Using Statistical Methods
Employing Student Retention Strategies for an Introductory GPU Programming Course
Optimization of an Image Processing Algorithm: Histogram Equalization
David Kahaner
Asian Technology Information Program
Welcome and Introduction
Introduction - 2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development
2nd ATIP Workshop on International Next-Generation Computing Programs and Workforce Development
Jim Kahle
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
David Kainer
Oak Ridge National Laboratory
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction
Hartmut Kaiser
Louisiana State University
Runtime for Exascale and Beyond: Convergence or Divergence?
Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX)
Asynchronous Execution of Python Code on Task Based Runtime Systems
Dhiraj Kalamkar
Intel Corporation
Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures
Laxmikant Kalé
University of Illinois
Charmworks Inc
Charm++ and AMPI: Adaptive and Asynchronous Parallel Programming
Laxmikant (Sanjay) Kale
University of Illinois
Parallel Programming Models for the Extreme Scale Era
Exascale Challenges in Across-Node Parallelism for Languages and Runtimes
Runtime for Exascale and Beyond: Convergence or Divergence?
Rashid Kaleem
Intel Corporation
Framework for Scalable Intra-Node Collective Operations Using Shared Memory
Rajiv K. Kalia
University of Southern California
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Sergei V. Kalinin
Oak Ridge National Laboratory
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation
Kristy A. Kallback-Rose
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
Charu Kalra
Northeastern University
PRISM: Predicting Resilience of GPU Applications Using Statistical Methods
Ananth Kalyanaraman
Washington State University
Scalable Methods for Genome Assembly
miniVite: A Graph Analytics Benchmarking Tool for Massively Parallel Systems
Yasushi Kamata
Railway Technical Research Institute, Japan
Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis
Vinod Kamath
Lenovo
Taming Datacenter Thermodynamics with Lenovo Neptune Technology
Amir Kamil
Lawrence Berkeley National Laboratory
UPC++ and GASNet-EX: PGAS Support for Exascale Applications and Runtimes
Shoaib Kamil
Adobe Research
ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism
Yoshito Kanamori
University of Alaska, Anchorage
Stochastic Computing on Quantum Gates
Joshua Kane
Idaho National Laboratory
A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks
Qiao Kang
Northwestern University
Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies
Ramaseshan Kannan
Arup UK
Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices
Krishna Kant
Temple University
PDC Curriculum Update
Roman Kaplan
Israel Institute of Technology
In-Memory Accelerator Architectures for Machine Learning and Bioinformatics
Processing-in-Storage Architecture for Machine Learning and Bioinformatics
Accelerating DNA Long Read Mapping with Emerging Technologies
Mariia Karabin
Clemson University
Los Alamos National Laboratory
Using Thrill to Process Scientific Data on HPC
Vasileios Karakasis
Swiss National Supercomputing Centre
ReFrame: A Regression Testing and Continuous Integration Framework for HPC systems
Sara Karamati
Georgia Institute of Technology
Modeling Single-Source Shortest Path Algorithm Dynamics to Control Performance and Power Tradeoffs
Sagar Karandikar
University of California, Berkeley
Panel: Open-Source Hardware
FireSim: FPGA-Accelerated Cycle-Exact Scale-Out System Simulation in the Public Cloud
Deepthi Karkada
Intel Corporation
Training Speech Recognition Models on HPC Infrastructure
Ian Karlin
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Exploring Application Performance on Fat-Tree Networks in the Presence of Congestion
Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free
Sven Karlsson
Technical University of Denmark
Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)
Tuomas Karna
Intel Corporation
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Thomas P. Karnowski
Oak Ridge National Laboratory
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation
Junichi Kato
Japan Telegraph and Telephone Corporation
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning
Daniel S. Katz
National Center for Supercomputing Applications
University of Illinois
Understanding Software Sustainability: Learning from Parsl and Other Projects
Sustaining Research Software
Software Engineering and Reuse in Computational Science and Engineering
Ohad Katz
State University of New York at Buffalo
Studying Effects of Meltdown and Spectre Patches on the Performance of HPC Applications Using Application Kernel Module of XDMoD
Christos Kavouklis
Lawrence Livermore National Laboratory
A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids
Kenji Kawai
Japan Telegraph and Telephone Corporation
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning
Tomohiro Kawanabe
Riken Center for Computational Science
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments
Engin Kayraklioglu
George Washington University
Productive Data Locality Optimizations in Distributed Memory
Yoshii Kazutomo
Argonne National Laboratory
Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing
Kate Keahey
Argonne National Laboratory
Dynamically Negotiating Capacity Between On-Demand and Batch Clusters
INDIS Afternoon Keynote
Reproducibility as Side Effect
Stephen W. Keckler
Nvidia Corporation
Optimizing Software-Directed Instruction Replication for GPU Error Detection
Christopher Keefe
Northern Arizona University
Pathogen and Microbiome Institute
Enabling Reproducible Microbiome Science through Decentralized Provenance Tracking in QIIME 2
Kimberly Keeton
Hewlett Packard Enterprise
Panel Discussion
Paul Kefer
Wake Forest University
Student Cluster Competition Team Panel Presentation
Kai Keller
Barcelona Supercomputing Center
Toward Ad Hoc Recovery For Soft Errors
Nicholas Kelly
University of Texas
Evaluating and Accelerating High-Fidelity Error Injection for HPC
Sean Kelly
Jet Propulsion Laboratory
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Alison Kennedy
Hartree Centre
Keynote 2: HPC and AI as Drivers for Industrial Engagement
Eimear Kenny
Icahn School of Medicine at Mount Sinai
Population Genetics and Computation in the Area of Precision Medicine
Garrett Kenyon
Los Alamos National Laboratory
Comparing Deep Learning with Quantum Inference on The D-Wave 2X
Ronan Keryell
Xilinx Inc
Reconfigurable Computing for HPC: Will It Make It this Time?
Gokcen Kestor
Oak Ridge National Laboratory
Characterization of the Impact of Soft Errors on Iterative Methods
Rajkumar Kettimuthu
Argonne National Laboratory
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Janis Keuper
Fraunhofer Institute for Industrial Mathematics
Introduction - Machine Learning in HPC Environments
Kurt Keville
Massachusetts Institute of Technology
xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory
Robert Kevis
Atomic Weapons Establishment (AWE), UK
Performance Portability of an Unstructured Hydrodynamics Mini-Application
David Keyes
King Abdullah University of Science and Technology
Panel 2: Arabia's Leap into the Cyber Era
Keynote 3: Hierarchical Algorithms on Hierarchical Architectures
Walid Keyrouz
National Institute of Standards and Technology
Introduction - Computational Reproducibility at Exascale 2018 (CRE2018)
Computational Reproducibility at Exascale 2018 (CRE2018)
M. Garda Khadafi
Telkom University, Indonesia
Student Cluster Competition Team Panel Presentation
Arif Khan
Pacific Northwest National Laboratory
Adaptive Anonymization of Data with b-Edge Covers
Hafiz Khan
Texas Tech University
Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs
Alireza Kheirkhahan
Louisiana State University
Asynchronous Execution of Python Code on Task Based Runtime Systems
Harsh Khetawat
North Carolina State University
Using Darshan and CODES to Evaluate Application I/O Performance
Warren Kibbe
Duke University School of Medicine
The Role of Computing in Predictive and Precision Oncology
Bob Killen
University of Michigan
Cloud Infrastructure Solutions To Run HPC Workloads
Introduction to Kubernetes
Heesoo Kim
Brown University
Effective Performance Portability
Jim Kim
Korea Advanced Institute of Science and Technology
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Jungwon Kim
Oak Ridge National Laboratory
OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing
Implementing Efficient Data Compression and Encryption in a Persistent Key-Value Store for HPC
Haklin Kimm
East Stroudsburg University of Pennsylvania
Introducing Three Basic Concepts in Parallel Computation to 1st Year Computer Science Students in a Simple and Effective Way
Yasuyuki Kimura
ExaScaler Inc
Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems
Heather Kincaid
Jet Propulsion Laboratory
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Volodymyr Kindratenko
University of Illinois
National Center for Supercomputing Applications
Designing and Building Next-Generation Computer Systems for Deep Learning
Michael Kinsner
Intel Corporation
Reconfigurable Computing for HPC: Will It Make It this Time?
Mariam Kiran
Energy Sciences Network (ESnet)
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Lara Kisielewska
Xand Marketing
The Difference Between HPC on Premises and in the Cloud
Joy Kitson
University of Delaware
Los Alamos National Laboratory
Plasma Meets Portability: A Journey to Performance Portability in a Particle-in-Cell Code
Effective Performance Portability
Scott Klasky
Oak Ridge National Laboratory
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows
Feature-Relevant Data Reduction for In Situ Workflows
Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)
High Performance I/O Frameworks 101
Kerstin Kleese van Dam
Brookhaven National Laboratory
Career Development Panel
Michael Klemm
Intel Corporation
Mastering Tasking with OpenMP
Advanced OpenMP: Host Performance and 5.0 Features
OpenMP API Version 5.0 - Getting Ready for Exascale
OpenMP® 5.0 Is Here: Find Out All the Things You Need to Know About It!
Gary Klimowicz
Nvidia Corporation
OpenMP GPU Offload in Flang and LLVM
Tobias Klöffel
University of Erlangen-Nuremberg
Boosting the Scalability of Car-Parrinello Molecular Dynamics Simulations for Multi- and Manycore Architectures
Rich Knepper
Cornell University
Upcoming Events in the HPC Systems Professionals Community
Programmable Education Infrastructure: Cloud Resources as HPC Education Environments
Christian Kniep
Docker Inc
Cloud Infrastructure Solutions To Run HPC Workloads
Christopher Knight
Argonne National Laboratory
Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations
Hiroaki Kobayashi
Tohoku University
Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
Greg Koenig
Energy Efficient HPC Working Group
A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management
Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis
Peter Kogge
University of Notre Dame
17th Graph500 List
Revisiting the 2008 ExaScale Computing Study and Venturing Predictions for 2028
Hidetaka Koie
National Institute of Advanced Industrial Science and Technology (AIST)
FlowOS-RM: Disaggregated Resource Management System
Chandra Kolla
Texas State University
High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem
Hemanth Kolla
Sandia National Laboratories
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows
Chaitanya Kolluru
Case Western Reserve University
Convolutional Neural Networks for Coronary Plaque Classification in Intravascular Optical Coherence Tomography (IVOCT) Images
Yuri Kolomiyets
Corsa Technology Inc
100G SSL/TLS Decryption Is Indeed Possible for High Capacity Links
Kazuhiko Komatsu
Tohoku University
Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
Vamsee Reddy Kommareddy
University of Central Florida
Exploring Allocation Policies in Disaggregated Non-Volatile Memories
Masaaki Kondo
University of Tokyo
Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack
Ivan Kondov
Karlsruhe Institute of Technology
The NAStJA Framework: Non-Collective Scalable Global Communications
Non-Collective Scalable Global Network Based on Local Communications
Fande Kong
Idaho National Laboratory
A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations
Alice Koniges
University of Hawaii
Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations
OpenMP Common Core: a “Hands-On” Exploration
Joseph Koning
Lawrence Livermore National Laboratory
Flux: Overcoming Scheduling Challenges for Exascale Workflows
Anton Korzh
Micron Technology Inc
17th Graph500 List
Sokol Kosta
Aalborg University, Copenhagen
DagOn*: Executing Direct Acyclic Graphs as Parallel Jobs on Anything
Doug Kothe
Oak Ridge National Laboratory
Delivering on the Exascale Computing Project Mission for the US Department of Energy
Patricia Kovatch
Icahn School of Medicine at Mount Sinai
Panel Discussion on Currents Trends, Needs, and Bottlenecks in Computational Human Phenomics
Introduction - Computational Phenomics @Scale: From Supercomputers to Bedside
Introduction – Fourth Computational Approaches for Cancer Workshop (CAFCW18)
Impacting Cancer with HPC: Opportunities and Challenges
James Kowalkowski
Fermi National Accelerator Laboratory
Methodology for the Rapid Development of Scalable HPC Data Services
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Data-Parallel Python for High Energy Physics Analyses
Quincey Koziol
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
Anycast: Rootless Broadcasting with MPI
HDF5: I/O Middleware and Ecosystem for HPC and Experimental and Observational Sciences
Matthew S. Krafczyk
University of Illinois
Assessing Reproducibility: An Astrophysical Example of Computational Uncertainty in the HPC Context
William T. Kramer
University of Illinois
National Center for Supercomputing Applications
Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience
Michal Kravcenko
Technical University of Ostrava, Czech Republic
Distributed Fast Boundary Element Methods
Nathaniel Kremer-Herman
University of Notre Dame
A Lightweight Model for Right-Sizing Master-Worker Applications
Reduction of Workflow Resource Consumption Using a Density-based Clustering Model
Christopher D. Krieger
Laboratory for Physical Sciences at University of Maryland
Impact of Traditional Sparse Optimizations on a Migratory Thread Architecture
Aravind Krishnamoorthy
University of Southern California
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Sriram Krishnamoorthy
Pacific Northwest National Laboratory
Characterization of the Impact of Soft Errors on Iterative Methods
HPC Software Verification in Action: A Case Study with Tensor Transposition
MPI Optimization and Characterization
Vandhana Krishnan
Stanford University
Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud
Bryce Kroencke
American River College
OpeNNdd: Open Neural Networks for Drug Discovery: Creating Free and Easy Methods for Designing Medicine
Martin Kronbichler
Technical University Munich
Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?
Jay Kruemcke
SUSE
Cloud Infrastructure Solutions To Run HPC Workloads
Michael Kruse
Argonne National Laboratory
Argonne Leadership Computing Facility
User-Directed Loop-Transformations in Clang
Bon Woong Ku
Georgia Institute of Technology
Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation
Vladyslav Kucher
University of Münster
Unified Cross-Platform Profiling of Parallel C++ Applications
Andrey Kudryavtsev
Intel Corporation
Evaluation of Intel Memory Drive Technology Performance for Scientific Applications
Jeff Kuehn
Los Alamos National Laboratory
OpenSHMEM in the Era of Exascale
Unified Communication X (UCX) Community
Mohammad Amin Kuhail
University of Missouri, Kansas City
Lessons from Integrating Parallelism into Undergraduate Curriculum at UMKC
Alexander Kuhn
Nvidia Corporation
Programmable Interactive Visualization of a Core-Collapse Supernova Simulation
Michael Kuhn
University of Hamburg
Toward a HPC Certification Program
Navjot Kukreja
Imperial College, London
A Study on Checkpoints Compression for Adjoint Computation
Igor Kulikov
Institute of Computational Mathematics and Mathematical Geophysics SB RAS
Evaluation of Intel Memory Drive Technology Performance for Scientific Applications
Bipin Kumar
Indian Institute of Tropical Meteorology
Visualization of Droplet Dynamics in Cloud Turbulence
Nalini Kumar
Intel Corporation
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Kalyan Kumaran
Argonne National Laboratory
Characterization of MPI Usage on a Production Supercomputer
Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications
Manaschai Kunaseth
National Science and Technology Development Agency, Thailand
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Sukhamay Kundu
Louisiana State University
Introducing Three Basic Concepts in Parallel Computation to 1st Year Computer Science Students in a Simple and Effective Way
Julian Kunkel
University of Reading
International HPC Certification Program
The IO-500 and the Virtual Institute of I/O
Analyzing Parallel I/O
Toward a HPC Certification Program
Toward Understanding I/O Behavior in HPC Workflows
Tahsin Kurc
Stony Brook University
Feature-Relevant Data Reduction for In Situ Workflows
Thorsten Kurth
Lawrence Berkeley National Laboratory
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Exascale Deep Learning for Climate Analytics
Deep Learning at Scale
A Case Study for Performance Portability Using OpenMP 4.5
Gregory Kurtzer
Sylabs
The Difference Between HPC on Premises and in the Cloud
Yipkei Kwok
Missouri Western State University
CV Review
CV Review and Career Development Panel
Return to Top
L
Jesus Labarta
Barcelona Supercomputing Center
Polytechnic University of Catalonia
Compiler and Runtime Based Parallelization and Optimization for GPUs
Lev Lafayette
University of Melbourne
Toward a HPC Certification Program
Ignacio Laguna
Lawrence Livermore National Laboratory
FlipTracker: Understanding Natural Error Resilience in HPC Applications
Introduction - 2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)
2nd International Workshop on Software Correctness for HPC Applications (Correctness 2018)
Sumathi Lakshmiranganatha
University of Wyoming
Optimizing Next Generation Hydrodynamics Code for Exascale Systems
Michael O. Lam
James Madison University
Lawrence Livermore National Laboratory
ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning
Facilitating the Adoption of Correctness Tools in HPC Applications
Automatic Generation of Mixed-Precision Programs
Jacob Lambert
University of Oregon
OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing
David Lambiaso
Environmental Systems Design (ESD)
High Performance Computing (HPC) Data Center Planning and TCO: A Case Study and Roadmap
Haidong Lan
Shandong University
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures
Sandy Landsberg
US Department of Defense HPC Modernization Program
What the Heck Is HEC?
HPC in the DoD
Michael Lang
Los Alamos National Laboratory
Heterogeneous Memory and Arena-Based Heap Allocation
Thomas Lange
retired
Students@SC Keynote: The Computing Hidden in Everyday Things
Akhil Langer
Intel Corporation
Framework for Scalable Intra-Node Collective Operations Using Shared Memory
Kristi Lanier
Cray Inc
The Power of Storytelling: Exposing User Experiences and Lessons Learned to Inspire and Instruct Technology Adoption
Jeff Larkin
Nvidia Corporation
Session 3: Using OpenMP
James Laros
Sandia National Laboratories
Advanced Architecture Testbeds: A Catalyst for Co-design Collaborations
Power API and Redfish: Standardizing Power Measurement and Control for HPC
Energy Efficiency Considerations for HPC Procurements
Matthew Larsen
Lawrence Livermore National Laboratory
A Flexible System For In Situ Triggers
Jeffrey Larson
Argonne National Laboratory
Hybrid Quantum-Classical Computing Architectures
Robert Latham
Argonne National Laboratory
Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era
Parallel-IO in Practice
Methodology for the Rapid Development of Scalable HPC Data Services
Scott Lathrop
University of Illinois
National Center for Supercomputing Applications
Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience
Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education
Software Engineering and Reuse in Computational Science and Engineering
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
Fifth SC Workshop on Best Practices for HPC Training and Education
Bruno Lathuilière
EDF Research and Development
Debugging and Optimization of HPC Programs in Mixed Precision with the Verrou Tool
Michael Lau
Texas A&M University
Student Cluster Competition Team Panel Presentation
Jan Laukemann
University of Erlangen-Nuremberg
Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures
Erwin Laure
KTH Royal Institute of Technology
Characterizing Deep-Learning I/O Workloads in TensorFlow
Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows
Timothy R. Law
University of Warwick
Performance Portability of an Unstructured Hydrodynamics Mini-Application
Margaret Lawson
University of Illinois
Sandia National Laboratories
Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales
Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales
Valentin Le Fèvre
ENS Lyon
Approximating a Multi-Grid Solver
Franck Le
IBM
Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences
Shaina Le
Texas A&M University
Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level
Anton Lebedev
University of Tübingen
On Advanced Monte Carlo Methods for Linear Algebra on Advanced Accelerator Architectures
Francis Lee
Nanyang Technological University, Singapore
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
Craig Lee
Aerospace Corporation
Federated Cloud: An Evolutionary Path from Grid Computing
Dongyoon Lee
Virginia Tech
BESPOKV: Application Tailored Scale-Out Key-Value Stores
Frank Lee
IBM
Experience New Records for Speed and Scale: High Performance Genomics and Imaging
Jason Lee
Los Alamos National Laboratory
Heterogeneous Memory and Arena-Based Heap Allocation
Seyong Lee
Oak Ridge National Laboratory
DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access
OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing
Programming the EMU Architecture: Algorithm Design Considerations for Migratory-Threads-Based Systems
Clacc: Translating OpenACC to OpenMP in Clang
Sunwoo Lee
Northwestern University
Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training
Victor Lee
Intel Corporation
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Understanding Potential Performance Issues Using Resource-Based alongside Time Models
Wonchan Lee
Stanford University
Dynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes
Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems
Miriam Leeser
Northeastern University
Preserving Privacy through Processing Encrypted Data
Introduction - Computational Reproducibility at Exascale 2018 (CRE2018)
Matthew P. Legendre
Lawrence Livermore National Laboratory
Managing HPC Software Complexity with Spack
Gotcha: A Function-Wrapping Interface for HPC Tools
Steve Legensky
Intelligent Light
Invited Talk: Data Science Meets CFD
Arnaud Legrand
University of Grenoble
SMPI Courseware: Teaching Distributed-Memory Computing with MPI in Simulation
Remi Lehe
Lawrence Berkeley National Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Tom Lehman
University of Maryland
Mid-Atlantic Crossroads
SDN for End-to-End Networked Science at the Exascale (SENSE)
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Jan-Patrick Lehr
Technical University Darmstadt
Scientific Computing
Compiler-Aided Type Tracking for Correctness Checking of MPI Applications
Olli-Pekka Lehto
Jump Trading LLC
Introduction - 5th International Workshop on HPC User Support Tools: HUST-18
John D. Leidel
Tactical Computing Laboratories
Texas Tech University
Introduction - 4th Workshop for Open Source Supercomputing (OpenSuCo)
xBGAS: Toward a RISC-V ISA Extension for Global, Scalable, Shared Memory
4th Workshop for Open Source Supercomputing (OpenSuCo)
Jason Leigh
University of Hawaii at Manoa
SAGE2 10th Annual International SC BOF: Scalable Amplified Group Environment for Global Collaboration
William Leinberger
General Dynamics Mission Systems
Analytic Based Monitoring of High Performance Computing Applications
Matthew L. Leininger
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Jacques-Bernard Lekien
Atomic Energy and Alternative Energies Commission (CEA)
PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows
John Leonardini
Komprise
Simplifying HPC Data Management at Scale
Man Chong Leong
Rice University
Script of Scripts Polyglot Notebook and Workflow System
Ulf Leser
Humboldt University of Berlin
LOS: Level Order Sampling for Task Graph Scheduling on Heterogeneous Resources
Khaled Ben Letaief
Hong Kong University of Science and Technology
SP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition
Richard Lethin
Reservoir Labs Inc
Fast Detection of Elephant Flows with Dirichlet-Categorical Inference
Mary Ann Leung
Sustainable Horizons Institute
Welcome and Introduction
Building a Career on Your Strengths
Connecting and Thinking Strategically through Your Strengths
Randall LeVeque
University of Washington
Accelerating Wave-Propagation Algorithms with Adaptive Mesh Refinement Using the Graphics Processing Unit (GPU)
Dustin Leverman
Oak Ridge National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Markus Levonyak
University of Vienna
Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods
Scott Levy
Sandia National Laboratories
Lessons Learned from Memory Errors Observed Over the Lifetime of Cielo
Workshop Morning Break
Introduction - Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)
Cannada Lewis
Sandia National Laboratories
Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications
Sven Leyffer
Argonne National Laboratory
Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing
Ang Li
Pacific Northwest National Laboratory
Energy Efficiency of Reconfigurable Caches on FPGAs
Binarized ImageNet Inference in 29us
Cheng Li
University of Illinois
MLModelScope: Evaluate and Measure Machine Learning Models within AI Pipelines
Dong Li
University of California, Merced
Runtime Data Management on Non-Volatile Memory-Based Heterogeneous Memory for Task-Parallel Programs
FlipTracker: Understanding Natural Error Resilience in HPC Applications
Understanding Application Recomputability without Crash Consistency in Non-Volatile Memory
Hongbo Li
University of California, Riverside
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
Jiajia Li
Georgia Institute of Technology
HiCOO: Hierarchical Storage of Sparse Tensors
Liandeng Li
Tsinghua University
National Supercomputing Center, Wuxi
Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers
Ruipeng Li
Lawrence Livermore National Laboratory
Computing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver
Sherry Li
Lawrence Berkeley National Laboratory
Approximating for Faster, Better and Cheaper Scientific Computing
Sihuan Li
University of California, Riverside
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation
Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation
Tonglin Li
Lawrence Berkeley National Laboratory
Anycast: Rootless Broadcasting with MPI
Xiangyu Li
Northeastern University
PRISM: Predicting Resilience of GPU Applications Using Statistical Methods
Xiaoye Sherry Li
Lawrence Berkeley National Laboratory
High Performance Computing in Dynamic Traffic Simulation
Yuxuan Li
Tsinghua University
National Supercomputing Center, Wuxi
Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight
Zhen Li
Brown University
A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks
Zhenyu Li
University of Warwick
Optimizing Machine Learning on Apache Spark in HPC Environments
Zhimin li
University of Utah
SpotSDC: an Information Visualization System to Analyze Silent Data Corruption
Xin Liang
University of California, Riverside
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
Exploring Best Lossy Compression Strategy By Combining SZ with Spatiotemporal Decimation
Improving Error-Bounded Lossy Compression for Cosmological N-Body Simulation
Chunhua Liao
Lawrence Livermore National Laboratory
Is Data Placement Optimization Still Relevant on Newer GPUs?
Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling
Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free
Wei-keng Liao
Northwestern University
Integration of Burst Buffer in High-Level Parallel I/O Library for Exascale Computing Era
Optimal Algorithms for Half-Duplex Inter-Group All-to-All Broadcast on Fully Connected and Ring Topologies
Communication-Efficient Parallelization Strategy for Deep Convolutional Neural Network Training
Antonio Libri
ETH Zurich
DiG: Enabling Out-of-Band Scalable High-Resolution Monitoring for Data-Center Analytics, Automation, and Control
Seung-Hwan Lim
Oak Ridge National Laboratory
Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation
Sung-Kyu Lim
Georgia Institute of Technology
Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation
Heng Lin
Tsinghua University
Fma Technology
ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds
Pei-Hung Lin
Lawrence Livermore National Laboratory
Is Data Placement Optimization Still Relevant on Newer GPUs?
Data Placement Optimization in GPU Memory Hierarchy Using Predictive Modeling
Using Polyhedral Analysis to Verify OpenMP Applications Are Data Race Free
Peter Lindstrom
Lawrence Livermore National Laboratory
Compression for Scientific Data
John Linford
ARM Ltd
The ARM HPC Experience: From Testbeds to Exascale
Performance Optimization Studies
David Liu
Jet Propulsion Laboratory
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Feng Liu
University of Minnesota
Dynamically Negotiating Capacity Between On-Demand and Batch Clusters
Hang Liu
University of Massachusetts, Lowell
iSpan: Parallel Identification of Strongly Connected Components with Spanning Trees
TriCore: Parallel Triangle Counting on GPUs
Honggao Liu
Texas A&M University
Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level
Jialin Liu
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
Kuang Liu
University of Southern California
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Qing Liu
New Jersey Institute of Technology
Workshop Afternoon Break
Introduction - The 4th International Workshop on Data Reduction for Big Scientific Data (DRBSD-4)
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
High Performance I/O Frameworks 101
Si Liu
University of Texas
OOOPS: An Innovative Tool for IO Workload Management on Supercomputers
Weiguo Liu
Shandong University
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures
Xin Liu
National Research Centre of Parallel Computer Engineering and Technology
ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds
Cross-Layer Group Regularization for Deep Neural Network Pruning
Xing Liu
Intel Corporation
High-Performance Dense Tucker Decomposition on GPU Clusters
Y. Jace Liu
Tongji University
Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences
Yan Liu
University of Illinois
A Massively Parallel Evolutionary Markov Chain Monte Carlo Algorithm for Sampling Complicated Multimodal State SpacesState
Yuanlai Liu
University of California, Riverside
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
Zhengchun Liu
Argonne National Laboratory
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Yarden Livnat
University of Utah
SpotSDC: an Information Visualization System to Analyze Silent Data Corruption
Scott Lloyd
Lawrence Livermore National Laboratory
ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning
Automatic Generation of Mixed-Precision Programs
Li-Ta Lo
Los Alamos National Laboratory
Using Thrill to Process Scientific Data on HPC
Vicki Lockhart
Red Oak Consulting
Procurement and Commissioning of HPC Systems
Glenn K. Lockwood
Lawrence Berkeley National Laboratory
Evaluation of HPC Application I/O on Object Storage Systems
A Year in the Life of a Parallel File System
Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems
Parallel-IO in Practice
Jay Lofstead
Sandia National Laboratories
Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales
The IO-500 and the Virtual Institute of I/O
Jeremy Logan
University of Tennessee
Feature-Relevant Data Reduction for In Situ Workflows
Felix Loh
University of Wisconsin
Fault Tolerant Cholesky Factorization on GPUs
Gabriel Loh
Advanced Micro Devices Inc
Challenges of High-Capacity DRAM Stacks and Potential Directions
Johann Lombardi
Intel Corporation
Enabling Data Services for HPC
Victor Lomuller
Codeplay Software Ltd
Challenges of C++ Heterogeneous Programming Using SYCL Implementation Experience: the Four Horsemen of the Apocalypse
Lyle Long
Pennsylvania State University
HPC Impact at TAE Technologies and Pratt & Whitney
HPC Impact at Procter & Gamble, Boeing and GE
HPC Impact at BP and Lockheed Martin
HPC Impact at GM and John Deere
Julia Looney
University of Texas
TACC's Cloud Deployer: Automating the Management of Distributed Software Systems
Burlen Loring
Lawrence Berkeley National Laboratory
Python-Based In Situ Analysis and Visualization
SENSEI Cross-Platform View of In Situ Analytics
Nuria Losada
University of A Coruña, Spain
Toward Ad Hoc Recovery For Soft Errors
Pavel Lougovski
Oak Ridge National Laboratory
Quantum Computing for Scientific Applications
James Low
Keysight Technologies Inc
Make Sure the Network Isn’t the Problem! 400GE Considerations and Best Practices for Testing the Cluster Fabric
Micheal Lowe
Indiana University
Cloud Infrastructure Solutions To Run HPC Workloads
David K. Lowenthal
University of Arizona
Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing
Hatem Ltaief
King Abdullah University of Science and Technology
Approximating for Faster, Better and Cheaper Scientific Computing
Toward Smoothing Data Movement Between RAM and Storage
Qiming Lu
Fermi National Accelerator Laboratory
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Xiaoyi Lu
Ohio State University
Designing High-Performance, Resilient, and Heterogeneity-Aware Key-Value Storage for Modern HPC Clusters
Exploiting HPC Technologies for Accelerating Big Data Processing and Associated Deep Learning
Yutong Lu
National Supercomputer Center Guangzhou
Panel 3: Challenge and Chance for Supercomputing Center in China
Thomas Ludwig
German Climate Computing Center
Toward Understanding I/O Behavior in HPC Workflows
Toward a HPC Certification Program
Nathan Luehr
Nvidia Corporation
Exascale Deep Learning for Climate Analytics
Jakob Luettgau
German Climate Computing Center
Argonne National Laboratory
Toward Understanding I/O Behavior in HPC Workflows
Andrew Lumsdaine
Pacific Northwest National Laboratory
17th Graph500 List
Dalton Lunga
Oak Ridge National Laboratory
Ramifications of Evolving Misbehaving Convolutional Neural Network Kernel and Batch Sizes
Lixiang Luo
IBM
A Parallel-Efficient GPU Package for Multiphase Flow in Realistic Nano-Pore Networks
Ye Luo
Argonne National Laboratory
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Ziqing Luo
University of Delaware
Toward Deductive Verification of Message-Passing Parallel Programs
Piotr Luszczek
University of Tennessee
Innovative Computing Laboratory
Batched, Reproducible, and Reduced Precision BLAS
HPCG Benchmark Update
Joseph Lykken
Fermi National Accelerator Laboratory
Quantum Communication Networks and Technologies
Sangkug Lym
University of Texas
Evaluating and Accelerating High-Fidelity Error Injection for HPC
Benjamin Lynch
University of Minnesota
Ceph Applications in HPC Environments
Vickie E. Lynch
Oak Ridge National Laboratory
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Marc Lyonnais
Ciena Corporation
INDIS Showcases Panel: NRE and XNET and Architecture
Return to Top
M
Julie Ma
Massachusetts Green High Performance Computing Center
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Xiaosong Ma
Qatar Computing Research Institute
ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds
John MacAuley
Lawrence Berkeley National Laboratory
Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences
SDN for End-to-End Networked Science at the Exascale (SENSE)
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Jens Mache
Lewis & Clark College
Jupyter Notebooks and User-Friendly HPC Access
Maciej Maciejewski
Intel Corporation
DAQDB - a Distributed Key-Value Store for Petascale Hot Storage
Lalith Maddegedara
University of Tokyo
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing
Kamesh Madduri
Pennsylvania State University
Doctoral Showcase II
Kavitha Madhu
Argonne National Laboratory
MPICH: A High Performance Open-Source MPI Implementation
Don Maghrak
Krell Institute
How to Analyze the Performance of Parallel Codes 101
Ashish Mahabal
California Institute of Technology
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Bert Maher
Facebook
Keynote: Glow: An Optimizing Compiler for High-Performance Machine Learning
Ankur Mahesh
Lawrence Berkeley National Laboratory
Exascale Deep Learning for Climate Analytics
Satheesh Maheswaran
Atomic Weapons Establishment (AWE), UK
Performance Portability of an Unstructured Hydrodynamics Mini-Application
Abdulrahman Mahmoud
University of Illinois
Optimizing Software-Directed Instruction Replication for GPU Error Detection
Tom Maiden
Pittsburgh Supercomputing Center
Strategies for Inclusive and Scalable HPC Outreach and Education
Evaluating the Wide Area Classroom after 10,500 HPC Students
Akalanka Mailewa Dissanayaka
Texas Tech University
Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs
Matthias Maiterth
Ludwig Maximilian University of Munich
Intel Corporation
A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management
Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis
Jun Makino
Kobe University
Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems
Pros and Cons of HPCx benchmarks
Pawel Makowski
Intel Corporation
DAQDB - a Distributed Key-Value Store for Petascale Hot Storage
Preeti Malakar
Indian Institute of Technology Kanpur
Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations
Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications
Nicholas Malaya
Advanced Micro Devices Inc
Keynote: Full Stack Open Source Supercomputing
Tanu Malik
DePaul University
Semantically Organized Containers for Reproducible Research
Allen Malony
University of Oregon
VPA18 Keynote: Not Your Mama’s Angry Fruit Salad: Ruminations on 30 Years of Performance Visualization and Visual Performance Analysis
OpenACC to FPGA: A Directive-Based High-Level Programming Framework for High-Performance Reconfigurable Computing
Allen D. Malony
ParaTools Inc
University of Oregon
Tuning CFD Applications for Intel Xeon Phi with TAU Commander and ParaTools ThreadSpotter
Carlos Maltzahn
University of California, Santa Cruz
Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems
Semantically Organized Containers for Reproducible Research
Spotting Black Swans With Ease: The Case for a Practical Reproducibility Platform
Joe Mambretti
International Center for Advanced Internet Research (iCAIR)
Northwestern University
Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Anirban Mandal
Renaissance Computing Institute (RenCI)
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Andreas Mang
University of Houston
GPU-Accelerated Interpolation for 3D Image Registration
Filippo Mantovani
Barcelona Supercomputing Center
Teaching HPC Systems and Parallel Programming with Small Scale Clusters of Embedded SoCs
Filling the Gap between Education and Industry: Evidence-Based Methods for Introducing Undergraduate Students to HPC
Don D. March
Oak Ridge National Laboratory
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation
Alexander Margolin
Hebrew University of Jerusalem
Tree-Based Fault-Tolerant Collective Operations for MPI
Oana Marin
Argonne National Laboratory
Large-Scale PDE-Constrained Optimization
Ivana Marincic
University of Chicago
A Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints
Stefano Markidis
KTH Royal Institute of Technology
Characterizing Deep-Learning I/O Workloads in TensorFlow
Efficient Algorithms for Collective Operations with Notified Communication in Shared Windows
HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making
George Markomanolis
Oak Ridge National Laboratory
The IO-500 and the Virtual Institute of I/O
Aram Markosyan
Sandia National Laboratories
Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications
Pak Markthub
Tokyo Institute of Technology
DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access
Osni Marques
Lawrence Berkeley National Laboratory
Innovative Approaches for Developing Accessible, Productive, Scalable HPC Training
The HPC Best Practices Webinar Series
Chris Marroquin
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Suresh Marru
Indiana University
SciGaP: Apache Airavata Hosted Science Gateways
Nicole Marsaglia
University of Oregon
A Flexible System For In Situ Triggers
David Martin
Argonne National Laboratory
Achieving Performance on Large-Scale Intel Xeon-Based Systems
HPC Impact at TAE Technologies and Pratt & Whitney
HPC Impact at Procter & Gamble, Boeing and GE
HPC Impact at BP and Lockheed Martin
HPC Impact at GM and John Deere
Joanne Martin
Hartman Executive Advisors
SC: The Conference
Jos Martin
MathWorks Inc
Panel Discussion
Steve Martin
Cray Inc
Power API and Redfish: Standardizing Power Measurement and Control for HPC
Maxime Martinasso
Swiss National Supercomputing Centre
RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing
Richard C. Martineau
Idaho National Laboratory
A General-Purpose Hierarchical Mesh Partitioning Method with Node Balancing Strategies for Large-Scale Numerical Simulations
Dominique Martinet
Atomic Energy and Alternative Energies Commission (CEA)
On the Applicability of PEBS-Based Online Memory Access Tracking for Heterogeneous Memory Management at Scale
Dave Martinez
Sandia National Laboratories
The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification
Data Analytics for System and Facility Energy Management
Energy Efficiency Considerations for HPC Procurements
Jose Maria Martinez
Technical University of Valencia
The MANGO Process for Designing and Programming Multi-Accelerator Multi-FPGA Systems
Daniel A. Martinez
US Army Engineer Research and Development Center
Deep Learning Evolutionary Optimization for Regression of Rotorcraft Vibrational Spectra
Jan Martinovic
Technical University of Ostrava, Czech Republic
HPC-as-a-Service for Life Sciences
Job Simulation for Large-Scale PBS-Based Clusters with the Maui Scheduler
Margaret Martonosi
Princeton University
What Is the Role of Architecture and Software Researchers in Making Quantum Computing Practical?
Xavier Martorell
Barcelona Supercomputing Center
Benchmarking Scientific Reconfigurable / FPGA Computing
Naoya Maruyama
Lawrence Livermore National Laboratory
Benchmarking Scientific Reconfigurable / FPGA Computing
Exascale Machine Learning
Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems
Programming Systems Tools
Michael Mascagni
Florida State University
National Institute of Standards and Technology
Introduction - Computational Reproducibility at Exascale 2018 (CRE2018)
Computational Reproducibility at Exascale 2018 (CRE2018)
Kristyn Maschhoff
Cray Inc
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
George Mason
Mississippi State University
Large Scale MPI-Parallelization of LBM and DEM Systems: Accelerating Research by Using HPC
Matt Masten
Intel Corporation
Function/Kernel Vectorization via Loop Vectorizer
Fabian Mastenbroek
Delft University of Technology
A Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments
Sergi Mateo Bellido
Barcelona Supercomputing Center
Mastering Tasking with OpenMP
Michael Matheson
Oak Ridge National Laboratory
Exascale Deep Learning for Climate Analytics
Gerald Mathias
Leibniz Supercomputing Centre
Boosting the Scalability of Car-Parrinello Molecular Dynamics Simulations for Multi- and Manycore Architectures
Amrita Mathuriya
Intel Corporation
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Yoshimasa Matsumura
University of Tokyo
Multi-GPU Accelerated Non-Hydrostatic Numerical Ocean Model with GPUDirect RDMA Transfers
Satoshi Matsuoka
RIKEN
Tokyo Institute of Technology
DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access
Keynote
Approximating for Faster, Better and Cheaper Scientific Computing
“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities
Introduction - The 3rd International Workshop on Post-Moore Era Supercomputing (PMES)
Exascale Machine Learning
Greg Matthews
NASA Ames Research Center
PBS Pro Open Source Project Community BoF
Marta Mattoso
Federal University of Rio de Janeiro
A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows
Tim Mattson
Intel Corporation
Programming Your GPU with OpenMP: A Hands-On Introduction
OpenMP Common Core: a “Hands-On” Exploration
Zakhar Matveev
Intel Corporation
Performance Tuning of Scientific Codes with the Roofline Model
Curtis Maves
Purdue University
cgroups py : Using Linux Control Groups and Systemd to Manage CPU Time and Memory
John Mawer
University of Manchester
First Steps in Porting the LFRic Weather and Climate Model to the FPGAs of the EuroExa Architecture
Don Maxwell
Oak Ridge National Laboratory
GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Rajiv Mayani
University of Southern California
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Wil Mayers
Alces Flight Limited
The Movement toward HPC Inclusivity: Achieving On-Demand Accessibility of High Performance Computing (HPC) through Ephemeral Projects Utilizing the Alces Gridware Project
John D. McCalpin
University of Texas
Texas Advanced Computing Center
HPL and DGEMM Performance Variability on the Xeon Platinum 8160 Processor
Dave McCarren
US Navy, Oceanographer of the Navy
Purpose-Built HPC: Last Hope for Earth System Prediction?
Jarrod McClean
Google LLC
Quantum Computing for Scientific Applications
Meghan McClelland
Versity Software Inc
Exascale Archiving - Challenges and Opportunities
Patrick McCormick
Los Alamos National Laboratory
Correctness of Dynamic Dependence Analysis for Implicitly Parallel Tasking Systems
Task-Based Programming
Peter McCorquodale
Lawrence Berkeley National Laboratory
A Low-Communicaton Method to Solve Poisson's Equation on Locally-Structured Grids
Ralph McEldowney
US Department of Defense HPC Modernization Program, Air Force Research Laboratory
Learning to Lead in HPC - Strategies to Start Your Leadership Journey
Ken McElvain
University of California, Berkeley
Lawrence Berkeley National Laboratory
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Kenton McHenry
University of Illinois
National Research Infrastructure: Collaborative Session
Suzanne McIntosh
New York University
Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems
Third Annual Meeting of the SIGHPC - Big Data Chapter
Simon McIntosh-Smith
University of Bristol
The ARM HPC Experience: From Testbeds to Exascale
Programming Your GPU with OpenMP: A Hands-On Introduction
Linda McIver
Australian Data Science Education Institute
Data Science and HPC Education and Outreach
Robert McLay
University of Texas
Getting Scientific Software Installed
Scott McMillan
Nvidia Corporation
Making Container Easier with HPC Container Maker
Donald McMullen
Texas A&M University
Evaluating Active Learning Approaches for Teaching Intermediate Programing at an Early Undergraduate Level
CiSE-ProS - Using Virtual Reality to Enforce Principles of Physical Cybersecurity
Colin McMurtrie
Swiss National Supercomputing Centre
RM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management
Panel 1: A Site-Local View of Creating a Pan-European Federated Research Infrastructure
Stephen McNally
Oak Ridge National Laboratory
GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan
Lawrence Meadows
Intel Corporation
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Ondřej Meca
Technical University of Ostrava, Czech Republic
Workflow for Parallel Processing of Sequential Mesh Databases
Maryam Mehri Dehnavi
University of Toronto
ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism
Susan Mehringer
Cornell University
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
Fifth SC Workshop on Best Practices for HPC Training and Education
Mario Melara
Lawrence Berkeley National Laboratory
Managing HPC Software Complexity with Spack
Rami Melhem
University of Pittsburgh
Partial Redundancy in HPC Systems with Non-Uniform Node Reliabilities
John Mellor-Crummey
Rice University
Dynamic Data Race Detection for OpenMP Programs
Ruby Mendenhall
University of Illinois
Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.
A Black Woman’s Sojourn in High Performance Computing: Recovering Lost History
Celso L. Mendes
University of Illinois
National Center for Supercomputing Applications
Best Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience
Pete Mendygral
Cray Inc
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Jintao Meng
Tencent Holdings Ltd
FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures
Xiangxu Meng
Shandong University
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight
Susan Mengel
Texas Tech University
Dynamic and Portable Vulnerability Assessment Testbed with Linux Containers to Ensure the Security of MongoDB in Singularity LXCs
Harshitha Menon
Lawrence Livermore National Laboratory
Error Analysis in HPC Applications Using Algorithmic Differentiation
ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning
SpotSDC: an Information Visualization System to Analyze Silent Data Corruption
Automatic Generation of Mixed-Precision Programs
Michael Mercier
Atos
Considering the Development Workflow to Achieve Reproducibility with Variation
Cristin Merritt
Alces Flight Limited
The Movement toward HPC Inclusivity: Achieving On-Demand Accessibility of High Performance Computing (HPC) through Ephemeral Projects Utilizing the Alces Gridware Project
Personalized Medicine and HPC
Michal Merta
Technical University of Ostrava, Czech Republic
Distributed Fast Boundary Element Methods
Jeff Messig
Enablence Technologies Inc
FlexLION: Scalable and Reconfigurable All-to-All Photonic Interconnects
Paul Messina
Argonne National Laboratory
How Can Lessons Learned in the Past Forty Years Guide Future HPC Research Strategies?
Peter Messmer
Nvidia Corporation
Nvidia Corporation
Interactivity in HPC
Martin Meuer
ISC Group
TOP500 Supercomputers
Bernd Meyer
University of Erlangen-Nuremberg
University of Erlangen-Nuremberg
Boosting the Scalability of Car-Parrinello Molecular Dynamics Simulations for Multi- and Manycore Architectures
Dr. Rene Meyer
AMAX
The Next Wave of HPC in the Datacenter
Scott Michael
Indiana University
Machine Learning and AI
HPC Workflow
Marek Michalewicz
Interdisciplinary Center for Mathematical and Computational Modeling
University of Warsaw
Panel 1: Role of Federated Polish HPC Centers in Polish AI Initiatives and EuroHPC Program
Martial Michel
Data Machines Corporation
Federated Cloud: An Evolutionary Path from Grid Computing
Cloud Infrastructure Solutions To Run HPC Workloads
Lauren Milechin
Massachusetts Institute of Technology
The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training
Barton Miller
University of Wisconsin
Secure Coding Practices and Automated Assessment Tools
Julian Miller
RWTH Aachen University
PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications
Michelle Strout
University of Arizona
ParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism
Students@SC: Careers in Industry, Research Labs, and Academia
Navigating the SC Conference Technical Program Submission Process
Josh Milthorpe
Australian National University
AIWC: OpenCL-Based Architecture Independent Workload Characterization
Amanda J. Minnich
Lawrence Livermore National Laboratory
Safety, Reproducibility, Performance: Accelerating Cancer Drug Discovery with Cloud, ML, and HPC Technologies
Ron Minnich
Google LLC
Google LLC
Students@SC: Careers in Industry, Research Labs, and Academia
Tiffany Mintz
Oak Ridge National Laboratory
Shortest Path and Neighborhood Subgraph Extraction on a Spiking Memristive Neuromorphic Implementation
Marco Minutoli
Pacific Northwest National Laboratory
Enabling High-Level Graph Processing via Dynamic Tasking
Tommy Minyard
University of Texas
Texas Advanced Computing Center
The New NSF-Funded Resource: Frontera - Towards a Leadership Class Computing Facility
Azalia Mirhoseini
Google LLC
Morning Keynote – Azalia Mirhoseini (Google)
Vladimir Mironov
Lomonosov Moscow State University
MPI/OpenMP parallelization of the Fragment Molecular Orbitals Method in GAMESS
Evaluation of Intel Memory Drive Technology Performance for Scientific Applications
Sanchit Misra
Intel Corporation
Parallel Computing Lab
Optimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting
Konstantina Mitropoulou
Intel Corporation
Function/Kernel Vectorization via Loop Vectorizer
Fumiaki Miura
Japan Telegraph and Telephone Corporation
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning
Takaaki Miyajima
RIKEN
Stream Computing of Lattice-Boltzmann Method on Intel Programmable Accelerator Card
Daniel Mlakar
Graz University of Technology
faimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU
Susan Mniszewski
Los Alamos National Laboratory
Community Detection Across Emerging Quantum Architectures
Non-Neural Network Applications for Spiking Neuromorphic Hardware
Bernd Mohr
Forschungszentrum Juelich
SC: The Conference
Kathryn Mohror
Lawrence Livermore National Laboratory
ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning
Students@SC: Careers in Industry, Research Labs, and Academia
Introduction - PDSW-DISCS: Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems
SpotSDC: an Information Visualization System to Analyze Silent Data Corruption
Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems
VeloC: Very Low Overhead Checkpointing System
Multi-Level Memory and Storage for HPC and Data Analytics
Diana Moise
Cray Inc
CosmoFlow: Using Deep Learning to Learn the Universe at Scale
Cleve Moler
MathWorks Inc
Panel: Interactivity in Supercomputing
Shintaro Momose
Tohoku University
NEC Corporation
Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
Inder Monga
Lawrence Berkeley National Laboratory
Energy Sciences Network (ESnet)
SDN for End-to-End Networked Science at the Exascale (SENSE)
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Laura Monroe
Los Alamos National Laboratory
Improving Application Resilience by Extending Error Correction with Contextual Information
Raffaele Montella
Parthenope University of Naples
DagOn*: Executing Direct Acyclic Graphs as Parallel Jobs on Anything
David Montoya
Los Alamos National Laboratory
Energy and Power Aware Job Scheduling and Resource Management: Global Survey --- An In-Depth Analysis
How to Analyze the Performance of Parallel Codes 101
Adam Moody
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Multi-Client DeepIO for Large-Scale Deep Learning on HPC Systems
VeloC: Very Low Overhead Checkpointing System
Logan Moody
Lawrence Livermore National Laboratory
James Madison University
Automatic Generation of Mixed-Precision Programs
Tim Moon
Lawrence Livermore National Laboratory
Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems
Scalable Deep Ensemble Learning for Cancer Drug Discovery
Sebastien Morais
Atomic Energy and Alternative Energies Commission (CEA)
PaDaWAn: a Python Infrastructure for Loosely Coupled In Situ Workflows
Nicolas Morales
Sandia National Laboratories
Distributed Memory Futures for Compile-Time, Deterministic-by-Default Concurrency in Distributed C++ Applications
José Moreira
IBM
HPC Graph Toolkits and the GraphBLAS Forum
Kenneth Moreland
Sandia National Laboratories
ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization
Miquel Moretó
Barcelona Supercomputing Center
Polytechnic University of Catalonia
Runtime-Assisted Cache Coherence Deactivation in Task Parallel Programs
Kazutaka Morita
Japan Telegraph and Telephone Corporation
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning
Vitali Morozov
Argonne National Laboratory
Benchmarking Machine Learning Methods for Performance Modeling of Scientific Applications
Karla Morris
Sandia National Laboratories
Introduction - PAW-ATM: Parallel Applications Workshop - Alternatives to MPI
Alexander Moskovsky
RSC Group
Panel 2: Russian HPC Trends: a View From a Local Vendor Trench
Evaluation of Intel Memory Drive Technology Performance for Scientific Applications
Daniel Mossé
University of Pittsburgh
Supporting Thorough Artifact Evaluation with Occam
Philipp Mösta
University of California, Berkeley
Programmable Interactive Visualization of a Core-Collapse Supernova Simulation
Michael Motley
University of Washington
Accelerating Wave-Propagation Algorithms with Adaptive Mesh Refinement Using the Graphics Processing Unit (GPU)
Charles Moulinec
Science and Technology Facilities Council, UK
GPU Acceleration at Scale with OpenPower Platforms in Code_Saturne
Irene Moulitsas
Cranfield University
Development and Performance Comparison of MPI and Fortran Coarrays within an Atmospheric Research Model
Misbah Mubarak
Argonne National Laboratory
Introduction - Women in HPC: Diversifying the HPC Community
Evaluating the Impact of Spiking Neural Network Traffic on Extreme-Scale Hybrid Systems
Women in HPC: Diversifying the HPC Community
Gihan Mudalige
University of Warwick
Swiss Army Programming: Performance and Portability from Modern Tools
OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling
Heterogeneous CPU-GPU Execution of Stencil Applications
Mayur Mudigonda
Lawrence Berkeley National Laboratory
Exascale Deep Learning for Climate Analytics
Leonie Mueck
Public Library of Science
Quantum Communication Networks and Technologies
Frank Mueller
North Carolina State University
Doomsday: Predicting Which Node Will Fail When on Supercomputers
Using Darshan and CODES to Evaluate Application I/O Performance
Hummingbird: Efficient Performance Prediction for Executing Genomics Applications in the Cloud
Benson Muite
University of Tartu, Estonia
Best Practices for Scaling-Up and Sustaining HPC Education, Outreach and Training
Diptajyoti Mukherjee
Allegheny College
Optimizing Next Generation Hydrodynamics Code for Exascale Systems
Daichi Mukunoki
Tokyo Woman's Christian University
High Performance Implementation of Reproducible BLAS Routines with Tunable Accuracy Using Ozaki Scheme
Julia Mullen
Massachusetts Institute of Technology
Introduction - Fifth SC Workshop on Best Practices for HPC Training and Education
Strategies for Inclusive and Scalable HPC Outreach and Education
The Impact of MOOC Methodology on the Scalability, Accessibility and Development of HPC Education and Training
Fifth SC Workshop on Best Practices for HPC Training and Education
Matthias Müller
RWTH Aachen University
PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications
Yannik Müller
RWTH Aachen University
PInT: Pattern Instrumentation Tool for Analyzing and Classifying HPC Applications
Todd Munson
Argonne National Laboratory
Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations
Omar Mures
Universidade da Coruña
In-Transit Molecular Dynamics Analysis with Apache Flink
Kohei Murotani
Railway Technical Research Institute, Japan
Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis
Richard Murphy
Micron Technology Inc
17th Graph500 List
Akihiro Musa
Tohoku University
NEC Corporation
Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA
Burcu Mutlu
Pacific Northwest National Laboratory
Polytechnic University of Catalonia
Characterization of the Impact of Soft Errors on Iterative Methods
Erdal Mutlu
Pacific Northwest National Laboratory
HPC Software Verification in Action: A Case Study with Tensor Transposition
Andrew Myers
Lawrence Berkeley National Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Python-Based In Situ Analysis and Visualization
Return to Top
N
Santosh Nagarakatte
Rutgers University
A Parallelism Profiler with What-If Analyses for OpenMP Programs
Ahmad Turan Naimey
Northern Arizona University
Pathogen and Microbiome Institute
Enabling Reproducible Microbiome Science through Decentralized Provenance Tracking in QIIME 2
Koji Nakade
Railway Technical Research Institute, Japan
Development of Numerical Coupled Analysis Method by Air Flow Analysis and Snow Accretion Analysis
Kengo Nakajima
University of Tokyo
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing
Takashi Nakamura
PEZY Computing
Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems
Aiichiro Nakano
University of Southern California
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Kouta Nakashima
Fujitsu Laboratories Ltd
DeepSim-HiPAC: Deep Learning High Performance Approximate Calculation for Interactive Design and Prototyping
Yuji Nakatsukasa
National Institute of Informatics, Japan
Performance Evaluation of the Shifted Cholesky QR Algorithm for Ill-Conditioned Matrices
Raymond Namyst
University of Bordeaux
Runtime for Exascale and Beyond: Convergence or Divergence?
Sai Narasimhamurthy
Seagate Systems UK
Characterizing Deep-Learning I/O Workloads in TensorFlow
Sri Hari Krishna Narayanan
Argonne National Laboratory
A Study on Checkpoints Compression for Adjoint Computation
Joseph Nardi
Carleton College
A Statistical Analysis of Compressed Climate Model Data
Akira Naruse
Nvidia Corporation
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing
Saber Naserifar
California Institute of Technology
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Maxim Naumov
Facebook
A Block-Oriented, Parallel, and Collective Approach to Sparse Indefinite Preconditioning on GPUs
Rob Neely
Lawrence Livermore National Laboratory
P3HPC Community Discussion and Next Steps
Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)
International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)
Henry J. Neeman
University of Oklahoma
Sustaining Research Software
Workloads and Benchmarks for System Acquisition
Gianina Alina Negoita
Iowa State University
Deep Learning: Extrapolation Tool for Computational Nuclear Physics
David H. Neill Asanza
Grinnell College
Los Alamos National Laboratory
Challenges of Performance Portability for Fortran Unstructured Mesh Codes
Effective Performance Portability
Performance Portability Challenges for Fortran Applications
Richard Nelson
Faucet Foundation
University of Waikato
Faucet: SDN made Easy
CJ Newburn
Nvidia Corporation
Meeting HPC Container Challenges as a Community
Brett Newman
Microway Inc
The Difference Between HPC on Premises and in the Cloud
Harvey Newman
California Institute of Technology
Fine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences
SDN for End-to-End Networked Science at the Exascale (SENSE)
Cho Ng
SLAC National Accelerator Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Huy Cu Ngo
Japan Telegraph and Telephone Corporation
Large-Message Size Allreduce at Wire Speed for Distributed Deep Learning
Linh B. Ngo
West Chester University
Using CloudLab as a Scalable Platform for Teaching Cluster Computing
An Nguyen
University of Texas
Institute for Computational Engineering and Sciences
Arctic Ocean-Sea Ice Interactions
Nga Nguyen
Los Alamos National Laboratory
Comparing Deep Learning with Quantum Inference on The D-Wave 2X
Tan Nguyen
Lawrence Berkeley National Laboratory
Phase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows
Trung Nguyen
University of Massachusetts
Toward Developing a Repository of Logical Errors Observed in Parallel Code for Teaching Code Correctness
Vinh T. Nguyen
Texas Tech University
HPCViz: Monitoring Health Status of High Performance Computing Systems
Amy Nicholson
University of North Carolina
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
Bogdan Nicolae
Argonne National Laboratory
A Study on Checkpoints Compression for Adjoint Computation
VeloC: Very Low Overhead Checkpointing System
Marc Nienhaus
Nvidia Corporation
Programmable Interactive Visualization of a Core-Collapse Supernova Simulation
Daisuke Nishiura
Japan Agency for Marine-Earth Science and Technology
Massively Parallel Stress Chain Characterization for Billion Particle DEM Simulation of Accretionary Prism Formation
Keigo Nitadori
Riken Center for Computational Science
Automatic Generation of High-Order Finite-Difference Code with Temporal Blocking for Extreme-Scale Many-Core Systems
Bill Nitzberg
Altair Engineering
PBS Pro Open Source Project Community BoF
Anderson Braulio Nobrega da Silva
Federal Institute of Paraíba
Federal University of Rio Grande do Norte
PaScal Viewer: A Tool for the Visualization of Parallel Scalability Trends
Seo-young Noh
Korea Advanced Institute of Science and Technology
BigData Express: Toward Schedulable, Predictable, and High-Performance Data Transfer
Kelly Nolan
Talent Strategy Institute
Women in HPC: the Importance of Male Allies
Jean-Philippe Nominé
European Technology Platform for High Performance Computing (ETP4HPC)
Atomic Energy and Alternative Energies Commission (CEA)
Consolidating the European Exascale Effort
Ken-ichi Nomura
University of Southern California
Shift-Collapse Acceleration of Generalized Polarizable Reactive Molecular Dynamics for Machine Learning-Assisted Computational Synthesis of Layered Materials
Jorji Nonaka
Riken Center for Computational Science
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments
Andrew Norman
Fermi National Accelerator Laboratory
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Michael L. Norman
San Diego Supercomputer Center
University of California, San Diego
Computational Cosmology and Astrophysics on Adaptive Meshes Using Charm++
Ali Akbar Nosrati
Texas Tech University
Simulating Data Centers with Redfish-Enabled Equipment
Marziyeh Nourian
North Carolina State University
A Compiler Framework for Fixed-Topology Non-Deterministic Finite Automata on SIMD Platforms
Javier Novo Rodríguez
Appentra Solutions
Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.
Clara Novoa
Texas State University
High-Accuracy Scalable Solutions to the Dynamic Facility Layout Problem
Lucy Nowell
US Department of Energy Office of Advanced Scientific Computing Research
Hot Topics Discussion II: Thriving at Work
Perspectives on Data Reduction from ASCR
Panel Discussion – Best Practices from Organizations on Improving Workplace Diversity.
Tranquility Amidst Turbulence: A Vision for Advancing Scientific Discovery in the Era of Extreme Heterogeneity
Understanding the Reader
Karina Nunez
Pawsey Supercomputing Centre
Strategies for Inclusive and Scalable HPC Outreach and Education
HPC Education and Training: An Australian Perspective
Dorit Nuzman
Intel Corporation
Compiler Optimization for Heterogeneous Locality and Homogeneous Parallelism in OpenCL and LLVM
Marguerite Nyhan
United Nations Global Pulse
HPC Inspires Plenary: HPC and AI: Helping to Solve Humanity’s Grand Challenges
Return to Top
O
Kathryn O'Brien
IBM
Workshop Morning Break
Introduction - International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)
Patrick O'Leary
Kitware Inc
Introduction - ISAV 2018: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization
SENSEI Cross-Platform View of In Situ Analytics
Jared O'Neal
Argonne National Laboratory
Better Scientific Software
Victor Ocaña
University of Texas
Institute for Computational Engineering and Sciences
Arctic Ocean-Sea Ice Interactions
James Oeth
University of Southern California
WRENCH: A Framework for Simulating Workflow Management Systems
Takeshi Ogita
Tokyo Woman's Christian University
High Performance Implementation of Reproducible BLAS Routines with Tunable Accuracy Using Ozaki Scheme
Martin Ohlerich
Leibniz Supercomputing Centre
Which Architecture Is Better Suited for Matrix-Free Finite-Element Algorithms: Intel Skylake or Nvidia Volta?
Martin Ohmacht
IBM
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Dusan Okanovic
University of Stuttgart
Visual Analytics Challenges in Analyzing Calling Context Trees
Kentaro Oku
Kashika Inc
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments
Stephan Olbrich
University of Hamburg
Toward a HPC Certification Program
Dan Olds
Gabriel Consulting Group
Introduction to Student Cluster Competitions
Katia Oleinik
Boston University
On Launching Ask.CI, a Q&A Platform for Research Computing, Using StackExchange and Discourse
Leonid Oliker
Lawrence Berkeley National Laboratory
Extreme Scale De Novo Metagenome Assembly
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
Daniel Oliveira
Fluminense Federal University, Fluminense Federal University, Brazil
A Practical Roadmap for Provenance Capture and Data Analysis in Spark-Based Scientific Workflows
Luís Oliveira
University of Pittsburgh
Supporting Thorough Artifact Evaluation with Occam
Vyacheslav Olshevsky
KTH Royal Institute of Technology
HPC Meets Real-Time Data: Interactive Supercomputing for Urgent Decision Making
Hensley Omorodion
University of Benin
Special Interest Group on HPC in Resource Constrained Environments (SIGHPC-RCE)
Kenji Ono
Kyushu University
RIKEN
HIVE: A Cross-Platform, Modular Visualization Ecosystem for Heterogeneous Computational Environments
Naoyuki Onodera
Japan Atomic Energy Agency
Communication Reduced Multi-Timestep Algorithm for Real-Time Wind Simulation on GPU-Based Supercomputers
Communication Avoiding Multigrid Preconditioned Conjugate Gradient Method for Extreme Scale Multiphase CFD Simulations
Colin Ophus
Lawrence Berkeley National Laboratory
Automated Labeling of Electron Microscopy Images Using Deep Learning
Sarp H. Oral
Oak Ridge National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
LUSTRE Community BOF: Lustre in HPC and Emerging Data Markets: Roadmap, Features and Challenges
Kostas Orginos
College of William & Mary
Thomas Jefferson National Accelerator Facility
Simulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing
James Osborn
Argonne National Laboratory
Hybrid Quantum-Classical Computing Architectures
Daniel Osei-Kuffuor
Lawrence Livermore National Laboratory
ADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning
Marcin Ostasz
Barcelona Supercomputing Center
Personalized Medicine and HPC
Consolidating the European Exascale Effort
Daniel Otero
Appentra Solutions
Parallelware Analyzer: Speeding Up the Parallel Software Development Lifecycle.
Michael Ott
Leibniz Supercomputing Centre
A Look Ahead: Energy and Power Aware Job Scheduling and Resource Management
Frederick Oullet
University of Florida
Optimizing Next Generation Hydrodynamics Code for Exascale Systems
Kaiming Ouyang
University of California, Riverside
Fault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs
John D. Owens
University of California, Davis
Linear Algebra Is the Right Way to Think About Graphs
Katsuhisa Ozaki
Shibaura Institute of Technology
High Performance Implementation of Reproducible BLAS Routines with Tunable Accuracy Using Ozaki Scheme
Guray Ozen
Barcelona Supercomputing Center
Polytechnic University of Catalonia
Compiler and Runtime Based Parallelization and Optimization for GPUs
OpenMP GPU Offload in Flang and LLVM
Jonathan Ozik
Argonne National Laboratory
Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration
Return to Top
P
Hans Pabst
Intel Corporation
Anatomy of High-Performance Deep Learning Convolutions on SIMD Architectures
Carlos Pachajoa
University of Vienna
Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods
Emilio Padrón
Universidade da Coruña
In-Transit Molecular Dynamics Analysis with Apache Flink
Francesco Paesani
University of California, San Diego
Parallel Implementation of Machine Learning-Based Many-Body Potentials on CPU and GPU
Scott Pakin
Los Alamos National Laboratory
Navigating the SC Conference Technical Program Submission Process
Introduction to Quantum Computing
Miguel Palacios
Procter and Gamble Company
HPC Enables Simulation-Led Innovation in Places You Would Not Expect
Krishna Palem
Rice University
Doing Moore with Less – Leapfrogging Moore’s Law with Inexactness for Supercomputing
Sudhakar Pamidighantam
Indiana University
SciGaP: Apache Airavata Hosted Science Gateways
Brian Pan
H3 Platform Inc
A Cost-Effective Flexible System Optimized for DNN and ML
Tony C. Pan
Georgia Institute of Technology
School of Computational Science and Engineering
Optimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting
D. K. Panda
Ohio State University
Cooperative Rendezvous Protocols for Improved Performance and Overlap
ESPM2 2018: Closing Remarks
The Next Wave of HPC in the Datacenter
Introduction - ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware
High Performance Middlewares for Next Generation Architectures: Challenges and Solutions
Designing High-Performance, Resilient, and Heterogeneity-Aware Key-Value Storage for Modern HPC Clusters
Unified Communication X (UCX) Community
Scalable and Distributed DNN Training on Modern HPC Systems
InfiniBand, Omni-Path, and High-Speed Ethernet: Advanced Features, Challenges in Designing HEC Systems, and Usage
Exploiting HPC Technologies for Accelerating Big Data Processing and Associated Deep Learning
InfiniBand, Omni-Path, and High-Speed Ethernet for Beginners
ESPM2 2018: Fourth International Workshop on Extreme Scale Programming Models and Middleware
Suraj Pandey
University of Hawaii at Manoa
WRENCH: A Framework for Simulating Workflow Management Systems
Ramesh Pankajakshan
Lawrence Livermore National Laboratory
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems
Ajay Panyala
Pacific Northwest National Laboratory
HPC Software Verification in Action: A Case Study with Tensor Transposition
Jean-Pierre Panziera
European Technology Platform for High Performance Computing (ETP4HPC)
Atos
Consolidating the European Exascale Effort
George Papadimitriou
University of Southern California
End-to-End Online Performance Data Capture and Analysis for Scientific Workflows
Tom Papatheodore
Oak Ridge National Laboratory
Application Porting and Optimization on GPU-Accelerated POWER Architectures
Michael E. Papka
Argonne National Laboratory
Northern Illinois University
Topology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations
Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows
libIS: A Lightweight Library for Flexible In Transit Visualization
Ketan Paranjape
Roche - Diagnostic Information Solutions
Morning Keynote – Computational Approaches in Clinical Applications
Manish Parashar
National Science Foundation
The Future of NSF Supported Advanced Cyberinfrastructure
Stacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows
Enabling Data Services for HPC
TCHPC Career Panel
Scaling Deep Learning for Cancer with Advanced Workflow Storage Integration
Leveraging Scalable Event Distribution to Enable Data-Driven In Situ Scientific Workflows
High Performance I/O Frameworks 101
Ojas Parekh
Sandia National Laboratories
Quantum Computing for Scientific Applications
Byung Hoon Park
Oak Ridge National Laboratory
A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform
Jaehong Park
Lawrence Berkeley National Laboratory
WarpX: Toward Exascale Modeling of Plasma Particle Accelerators
Scott Parker
Argonne National Laboratory
Characterization of MPI Usage on a Production Supercomputer
Scott Paschke
University of Michigan
Introduction to Kubernetes
Valerio Pascucci
University of Utah
SpotSDC: an Information Visualization System to Analyze Silent Data Corruption
A Task-Based Abstraction Layer for User Productivity and Performance Portability in Post-Moore’s Era Supercomputing
libIS: A Lightweight Library for Flexible In Transit Visualization
Marc Paterno
Fermi National Accelerator Laboratory
Methodology for the Rapid Development of Scalable HPC Data Services
Enabling Neutrino and Antineutrino Appearance Observation Measurements with HPC Facilities
Data-Parallel Python for High Energy Physics Analyses
Tapasya Patki
Lawrence Livermore National Laboratory
“If you can’t measure it, you can’t improve it” -- Software Improvements from Power/Energy Measurement Capabilities
Collaboration Toward a Software Stack for System Power Optimization: The HPC PowerStack
Flux: Overcoming Scheduling Challenges for Exascale Workflows
Understanding Simultaneous Impact of Network QoS and Power on HPC Application Performance
Gopal Patnaik
US Naval Research Laboratory
The ARM HPC Experience: From Testbeds to Exascale
Aristides Patrinos
Novim Group
Precision and Personalized Medicine: The Time Has Arrived
Christos Patriotis
National Cancer Institute
Toward a Pre-Cancer Image Atlas through Crowdsourcing and Machine Learning
Josh Patterson
Nvidia Corporation
Panel: Interactivity in Supercomputing
Michael Patterson
Intel Corporation
The Facility Perspective on Liquid Cooling: Experiences and Proposed Open Specification
Robert M. Patton
Oak Ridge National Laboratory
Exploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines
Introduction - Machine Learning in HPC Environments
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation
Md Mostofa Ali Patwary
Baidu USA
Graph Algorithms and Systems
Anmol Paudel
Marquette University
OpenACC-Based GPU Parallelization of Plane Sweep Algorithm for Geometric Intersection
Sri Raj Paul
Rice University
A Unified Runtime for PGAS and Event-Driven Programming
David Paulsen
Viking Enterprise Solutions
Cassandra in Dockers Deployment Using an NVMe Fabric
Robert Pavel
Los Alamos National Laboratory
Optimizing Next Generation Hydrodynamics Code for Exascale Systems
Nicholas Pavini
American River College