SC23 Proceedings

SC Technical Program Archives

Workshops

High Performance Python for Science at Scale


  1. High Performance Python for Science at Scale

  2. HPPSS – Introduction

  3. HPPSS – Invited Speaker

  4. HPPSS – Morning Break

  5. Demonstration of Portable Performance of Scientific Machine Learning on High Performance Computing Systems

  6. WisdomWombat: A Polyglot Dataflow CFD Code Using Python and Dragon

  7. Maximizing Data Utility for HPC Python Workflow Execution

  8. Dragon Proxy Runtimes and Multi-System Workflows

  9. HPPSS – Panel Discussion

  10. HPPSS – Concluding Remarks


Back to Workshop Archive Listing


HPC Systems Professionals Workshop (HPCSYSPROS23)


  1. HPCSYSPROS 23 – Opening Remarks

  2. Clushible: Tidal Wave-Like Configuration with Ansible

  3. Embracing Batch on Kubernetes

  4. Self-Service Monitoring of HPC and Openstack Jobs for Users

  5. ICE 2.0: Restructuring and Growing an Instructional HPC Cluster

  6. HPCSYSPROS 23 – Morning Break

  7. MareNostrum 5: Site Report from BSC

  8. Democratizing Remote HPC Storage Access

  9. What a GReaT Scheduling Opportunity

  10. Overcoming Active Directory Woes with Plain Text Caches and Replacing Passwords

  11. Heterogeneous Syslog Analysis: There Is Hope

  12. Report on Adaptable Open-Source Disaster Recovery Solution for Multi-Petabyte Storage Systems

  13. HPCSYSPROS 23 – Closing Remarks


Back to Workshop Archive Listing


RSDHA: Redefining Scalability for Diversely Heterogeneous Architectures


  1. Invited Talk: Scaling Computing for Concurrent Data Structures Using Near-Memory Processing Architectures

  2. Value-Based Resource Management at SoC Scale

  3. NVMe-Backed GNN Training on GPU Leveraging a Paged UVM Memory System

  4. RSDHA – Morning Break

  5. FFTX-IRIS: Toward Performance Portability and Heterogeneity for SPIRAL Generated Code

  6. CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types

  7. Vertical Scaling of Variational Multiscale Modeling for Fluid Dynamics: Successes, Challenges, and Opportunities

  8. Accelerator Integration in a Tile-Based SoC: Lessons Learned with a Hardware Floating Point Compression Engine

  9. Evaluating Primitives in Deep Neural Network Libraries: A Case Study with the Softmax Functions

  10. RSDHA – Panel Discussion


Back to Workshop Archive Listing


Third International Symposium on Quantitative Codesign of Supercomputers


  1. Third International Symposium on Quantitative Codesign of Supercomputers

  2. Welcome and Workshop Logistics

  3. Co-design at system and component level: examples from the DEEP and EPI projects

  4. Toward the Development of a Comprehensive Digital Twin of an Exascale Supercomputer

  5. SQCS – Morning Break

  6. Enabling Codesign in the Software Tools Ecosystem Project (STEP)

  7. SQCS'23 – Panel

  8. SQCS'23 – Moderated Discussion

  9. SQCS'23 – Closing Remarks


Back to Workshop Archive Listing


Research Software Engineers in HPC (RSE-HPC-2023)


  1. Research Software Engineers in HPC (RSE-HPC-2023)

  2. RSE-HPC-2023 - Welcome and Overview

  3. RSE-HPC-2023 – Featured Talk: UNIVERSE-HPC – Toward a Sustainable RSE Training Ecosystem

  4. Elevating the Undergraduate Internship: Five Strategies for Putting the “R” in RSE

  5. International RSE Collaboration with the Institute of Computing for Climate Science and the Virtual Earth System Research Institute

  6. Years as a Trustee of SocRSE UK: A Retrospective

  7. Life as an RSE at the University of Birmingham, UK

  8. RSE-HPC-2023 – Morning Break

  9. RSE-HPC-2023 – Panel: RSE Training and Mentoring

  10. Starting at the Bottom, Now We’re Here: Building an African RSE Community

  11. Catalyzing Research Software Engineering (RSE) Adoption in Underrepresented Regions: Harnessing the Power of Bioinformatics Communities

  12. RSE-HPC-2023 – Breakout Discussions

  13. RSE-HPC-2023 – Report Back from Breakouts

  14. RSE-HPC-2023 – Wrapup


Back to Workshop Archive Listing


LLVM-HPC2023: The Ninth Workshop on the LLVM Compiler Infrastructure in HPC


  1. LLVM-HPC2023: The Ninth Workshop on the LLVM Compiler Infrastructure in HPC

  2. OpenMP Kernel Language Extensions for Performance Portable GPU Codes

  3. DPU Offloading Programming with the OpenMP API

  4. LLVM-HPC2023 – Morning Break

  5. Fortran Performance Optimisation and Auto-Parallelization by Leveraging MLIR-Based Domain Specific Abstractions in Flang

  6. Precision and Performance Analysis of C Standard Math Library Functions on GPUs

  7. Lightning Talk - Cppless: Productive and Performant Serverless Programming in C++

  8. Lightning Talk – Automating Loop Optimization with Code Samples and AST Matching

  9. Lightning Talk – Just-in-Time Autotuning

  10. Lightning Talk – META: A Toolkit for Template Metaprogramming Performance Analysis

  11. Panel Discussion

  12. Closing Remarks


Back to Workshop Archive Listing


Sustainable Supercomputing


  1. Sustainable Supercomputing

  2. Sustainability in HPC: Vision and Opportunities

  3. Evaluating Total Environmental Impact for a Computing Infrastructure

  4. Comparing Power Signatures of HPC Workloads: Machine Learning vs Simulation

  5. Accurate Measurement of Application-Level Energy Consumption for Energy-Aware Large-Scale Simulations

  6. Sustainable Supercomputing – Morning Break

  7. Keys to Sustainable Leadership Supercomputing for 2025+: Location, Power, and Flexibility

  8. Energy Efficiency of Quantum Statevector Simulation at Scale

  9. Reducing HPC Energy Footprint for Large Scale GPU Accelerated Workloads

  10. Emissions and Energy Efficiency on Large-Scale High Performance Computing Facilities: ARCHER2 UK National Supercomputing Service Case Study

  11. ReAPER: Region Aware Power and Energy Regulator

  12. Wrap-Up Discussion


Back to Workshop Archive Listing


HUST-23: 10th International Workshop on HPC User Support Tools


  1. HUST-23: 10th International Workshop on HPC User Support Tools

  2. HUST-23 Introduction

  3. REMORA Resource Monitor: Usability, Performance, and User Interface Improvements

  4. NPAT - A Power Analysis Tool at NERSC

  5. Centralized Provisioning of Large Language Models for a Research Community

  6. HUST-23 – Morning Break

  7. ZeroSum: User Space Monitoring of Resource Utilization and Contention on Heterogeneous HPC Systems

  8. A Fast and Responsive Web-Based Framework for Visualizing HPC Application Usage

  9. CaRV – Accelerating Program Optimization through Capture, Replay, Validate

  10. Introducing Open OnDemand to Supercomputer Fugaku

  11. MSR-genie: Navigating Model Specific Registers across Processor Generations

  12. PEAK: A Light-Weight Profiler for HPC Systems

  13. BaRRT: Buildtime and Runtime Reproducibilty Tool for Software Development and Testing

  14. PTI-GPU: Kernel Profiling and Assessment on Intel GPUs

  15. HUST-23 – Conclusion


Back to Workshop Archive Listing


1st Workshop on Enabling Predictive Science with Optimization and Uncertainty Quantification in HPC


  1. Welcome and Introduction

  2. Keynote Speaker

  3. Efficient Probabilistic Tuning of Ensemble Forecasting Method

  4. Uncertainty Quantification of Reduced-Precision Time Series in Turbulent Channel Flow

  5. EPSOUQ-HPC – Morning Break

  6. Optimized Uncertainty Estimation for Vision Transformers: Enhancing Adversarial Robustness and Performance Using Selective Classification

  7. Localization of Gamma-Ray Bursts in a Balloon-Borne Telescope

  8. Automatic Search Guided Code Optimization Framework for Mixed-Precision Scientific Applications

  9. Uncertainty Quantification of Metal Additive Manufacturing Processing Conditions Through the Use of Exascale Computing

  10. Closing Remarks


Back to Workshop Archive Listing


10th Annual International Workshop on Innovating the Network for Data Intensive Science (INDIS) Final


  1. INDIS and SCinet Introduction

  2. INDIS Esteemed Guest Talk: Professor Eylem Ekici (Ohio State University)

  3. Morning Break

  4. INDIS Paper 1: Enhancing perfSONAR Measurement Capabilities Using P4 Programmable Data Planes

  5. INDIS Paper 2: Experimental Study of TCP Throughput Profiles and Dynamics Over Dedicated Connections

  6. INDIS Paper 3: Elephants Sharing the Highway – Studying TCP Fairness in Large Transfers Over High Throughput Links

  7. INDIS Paper 4: Evaluation of SCION for User-Driven Path Control – A Usability Study

  8. INDIS Paper 5: Throughput Optimization with a NUMA-Aware Runtime System for Efficient Scientific Data Streaming

  9. Lightning Talks


Back to Workshop Archive Listing


13th International Workshop on Runtime and Operating Systems for Supercomputers (ROSS)


  1. 13th International Workshop on Runtime and Operating Systems for Supercomputers (ROSS)

  2. ROSS – Welcome and Introduction

  3. ROSS – Opening Panel: Is Accelerator Firmware the New HPC OS? Opportunities and Challenges for the OS/R Research Community

  4. ROSS – Morning Break

  5. RDARuntime: An OS for AI Accelerators

  6. GPU Acceleration in Unikernels Using Cricket GPU Virtualization

  7. CARAT KOP: Toward Protecting the Core HPC Kernel from Linux Kernel Modules

  8. Fine-Grained Accelerator Partitioning for Machine Learning and Scientific Computing in Function as a Service Platform

  9. Analysis and Characterization of Performance Variability for OpenMP Runtime


Back to Workshop Archive Listing


XLOOP 2023: The 5th Annual Workshop on Extreme-Scale Experiment-in-the-Loop Computing


  1. XLOOP 2023: The 5th Annual Workshop on Extreme-Scale Experiment-in-the-Loop Computing

  2. XLOOP – Introduction

  3. Demonstrating Cross-Facility Data Processing at Scale with Laue Microdiffraction

  4. Linking the Dynamic PicoProbe Analytical Electron-Optical Beam Line / Microscope to Supercomputers

  5. Speeding Up Charge Exchange Recombination Spectroscopy Analysis in Support of NERSC/DIII-D Realtime Workflow

  6. XLOOP – Morning Break

  7. DLSIA: Deep Learning for Scientific Image Analysis

  8. Exploring Benchmarks for Self-Driving Labs Using Color Matching

  9. Empowering Scientific Discovery through Computing at the Advanced Photon Source

  10. Cross-Facility Orchestration of Electrochemistry Experiments and Computations

  11. Streaming Data from Experimental Facilities to Supercomputers for Real-Time Data Processing

  12. Workflows Are the New Applications – So What?

  13. DevOps Approaches for Interconnected Science Ecosystems

  14. XLOOP – Awards Ceremony


Back to Workshop Archive Listing


5th Workshop on Programming and Performance Visualization Tools (ProTools 2023)


  1. Invited Talk: Using XDMoD for HPC Performance and Quality-of-Service Analysis

  2. Enabling Agile Analysis of I/O Performance Data with PyDarshan

  3. ProTools 2023 – Morning Break

  4. An Event Model for Trace-Based Performance Analysis of MPI Partitioned Point-to-Point Communication

  5. FROOM: A Framework of Operators for OTF2 Modification

  6. GPUscout: Locating Data Movement-Related Bottlenecks on GPUs

  7. Filtering and Ranking of Code Regions for Parallelization via Hotspot Detection and OpenMP Overhead Analysis

  8. Extra-Deep: Automated Empirical Performance Modeling for Distributed Deep Learning


Back to Workshop Archive Listing


The 18th Workshop on Workflows in Support of Large-Scale Science (WORKS23) - Part 1 of 2


  1. The 18th Workshop on Workflows in Support of Large-Scale Science (WORKS23)

  2. Welcome – Part I

  3. Workflow Building Blocks: The Success Story of Environmental Modeling, HPC, and AI for Predicting Farmed Seafood Bacteria Contamination

  4. End-to-End Workflows for Climate Science: Integrating HPC Simulations, Big Data Processing, and Machine Learning

  5. WORKS23 – Afternoon Break

  6. Accelerating Data-Intensive Seismic Research Through Parallel Workflow Optimization and Federated Cyberinfrastructure

  7. A Systematic Mapping Study of Italian Research on Workflows

  8. Transcriptomics Atlas Pipeline: Cloud vs HPC

  9. Patterns and Anti-Patterns in Migrating from Legacy Workflows to Workflow Management Systems

  10. Scale Composite BaaS Services with AFCL Workflows

  11. Laminar: A New Serverless Stream-Based Framework with Semantic Code Search and Code Completion

  12. Optimization Toward Efficiency and Stateful of dispel4py

  13. Wrap Up – Part I


Back to Workshop Archive Listing


2nd International Workshop on Cyber Security in High Performance Computing (S-HPC 2023)


  1. Welcome

  2. Distinguished Speaker

  3. S-HPC 2023 – Afternoon Break

  4. Invited Talk: Thoughts on Security for CXL-3.x-GFAM Clusters with Embedded Computing

  5. Invited Talk: Information Security Controls Prioritization – SABSA for HPC

  6. Analyzing the Performance Impact of HPC Workloads with Gramine+SGX on 3rd Generation Xeon Scalable Processors

  7. RMF for HPC and RDT&E

  8. Closing Remarks


Back to Workshop Archive Listing


6th International Workshop on Emerging Parallel Distributed Runtime Systems and Middleware


  1. Welcome

  2. DeepSpeed4Science: Enabling Future Large-Scale Scientific Discovery through Sophisticated AI System Technologies

  3. IPDRM’2023 – Afternoon Break

  4. HPC Software Scaling for ML Using CXL 3.0 GFAM

  5. Dask-Extended External Tasks for HPC/ML In Transit Workflows

  6. Enabling Large Dynamic Neural Network Training with Learning-Based Runtime Memory Management

  7. MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators

  8. A gem5 Implementation of the Sequential Codelet Model: Reducing Overhead and Expanding the Software Memory Interface

  9. Closing Remarks


Back to Workshop Archive Listing


Ninth Computational Approaches for Cancer Workshop (CAFCW23)


  1. Overcoming the Challenges to Democratizing Precision Medicine: HPC Infrastructure, Health Equity Training Sets, Training a Diverse Workforce, and Mitigating Fears

  2. AI/ML-Derived Whole-Genome Predictor Prospectively and Clinically Predicts Survival and Response to Treatment in Brain Cancer

  3. CAFCW Announcements

  4. CAFCW23 – Afternoon Break

  5. Panel: Diversity, Equity, and Inclusion – from Data to Workforce

  6. Deep Semi-Supervised Transfer Learning for Fully Automated Whole-Body Tumor Quantification and Prognosis of Cancer on PET/CT

  7. Optimized Patient-Specific Catheter Placement for Convection-Enhanced Nanoparticle Delivery in Recurrent Glioblastoma

  8. Environmental Factors and Lung Cancer: A Predictive Spatial Approach

  9. Constructing a Large-Scale Biomedical Knowledge Graph and Its Applications in Drug Discovery

  10. Scalable Lead Prediction with Transformers Using HPC Resources

  11. Entropy-Based Regularization on Deep Learning Models for Anti-Cancer Drug Response Prediction


Back to Workshop Archive Listing


The 9th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-9)


  1. The 9th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-9)

  2. Opening Remarks and Welcome

  3. Invited Talk: Challenges and Opportunities for Data Democratization

  4. LibPressio-Predict: Flexible and Fast Infrastructure For Inferring Compression Performance

  5. What Operations Can Be Performed Directly on Compressed Arrays and with What Error?

  6. Analyzing Impact of Data Reduction Techniques on Visualization for AMR Applications Using AMReX Framework

  7. Fast 2D Bicephalous Convolutional Autoencoder for Compressing 3D Time Projection Chamber Data

  8. Streaming Hardware Compressor Generator Framework

  9. Lossy and Lossless Compression for BioFilm Optical Coherence Tomography (OCT)

  10. Closing Remarks


Back to Workshop Archive Listing


The First Workshop on Democratizing High-Performance Computing (D-HPC)


  1. D-HPC Opening: “LLMs and Democratizing HPC”

  2. The History and Future of Making HPC Technologies Accessible to the wider community

  3. Democratizing HPC by Building a Diverse and Inclusive Workforce

  4. D-HPC – Afternoon Break

  5. Democratizing HPC Access and Use with Knowledge Graphs

  6. Democratizing Science Through Equitable Access to Computing and Data

  7. S4PST: Stewardship of Programming Systems and Tools

  8. D-HPC: Closing Remarks


Back to Workshop Archive Listing


The 1st International Workshop on the Environmental Sustainability of High-Performance Software


  1. Opening Remarks

  2. Energy Consumption Comparison of Parallel Linear Systems Solver Algorithms on HPC Infrastructure

  3. Domain-Specific Energy Modeling for Drug Discovery and Magnetohydrodynamics Applications

  4. An End-to-End HPC Framework for Dynamic Power Objectives

  5. Automatic Energy-Efficient Job Scheduling in HPC: A Novel SLURM Plugin Approach

  6. PM100: A Job Power Consumption Dataset of a Large-Scale Production HPC System

  7. Augmenting ML-Based Predictive Modelling with NLP to Forecast a Job's Power Consumption

  8. Closing Remarks and Best Paper


Back to Workshop Archive Listing


13th Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS 2023)


  1. FTXS 2023 – Opening Remarks

  2. FTXS 2023 : Invited Speaker (Paolo Rech, "Quantum Computing Reliability: Problems, Tools, and Potential Solutions")

  3. FTXS 2023 – Afternoon Break

  4. Optimizing Write Performance for Checkpointing to Parallel File Systems Using LSM-Trees

  5. Recovery from Silent Data Corruption via Spatial Data Prediction

  6. Disk Failure Trends in Alpine Storage System

  7. Using Benford's Law to Identify Unusual Failure Regions

  8. Dynamic Selective Protection of Sparse Iterative Solvers via ML Prediction of Soft Error Impacts

  9. Evaluating the Resiliency of Posits for Scientific Computing

  10. When to Checkpoint at the End of a Fixed-Length Reservation?

  11. FTXS 2023 – Closing Remarks


Back to Workshop Archive Listing


PDSW23: 8th International Parallel Data Systems Workshop


  1. Welcome and Opening Remarks

  2. Invited Talk

  3. Toward Standardized, Open Object-Based Computational Storage For Large-Scale Scientific Data Analytics

  4. DAOS as HPC Storage: Exploring Interfaces

  5. Toward a Peer-to-Peer Data Distribution Layer for Efficient and Collaborative Resource Optimization of Distributed Dataflow Applications

  6. PDSW – Afternoon Break

  7. Enhancing Metadata Transfer Efficiency: Unlocking the Potential of DAOS in the ADIOS Context

  8. IOMax: Maximizing Out-of-Core I/O Analysis Performance on HPC Systems

  9. Domain-Aware Performant AI-Based Compression

  10. The I/O Trace Initiative: Building a Collaborative I/O Archive to Advance HPC

  11. GrIOt: Graph-Based Modeling of HPC Application I/O Call Stacks for Predictive Prefetch

  12. Advancing Automated I/O Analysis with Multi-Perspective Views

  13. PoliMOR: A Policy Engine "Made-to-Order" for Automated and Scalable Data Management in Lustre

  14. Accelerate Stage-Out in Single Shared Files from Node-Local Burst-Buffers

  15. DAOS Project Update

  16. Compression of Scientific Simulation Data by Stochastic Basis Expansion – Example on Multiple Computer Systems

  17. Closing Remarks


Back to Workshop Archive Listing


Workshop on Software and Hardware Co-Design of Deep Learning Systems in Accelerators (SHDA)


  1. Welcome to SC Workshop SHDA 2023

  2. Keynote: Design of Efficient and Privacy Preserving Machine Learning

  3. SHDA – Afternoon Break

  4. Accuracy-Constrained Efficiency Optimization and GPU Profiling of CNN Inference for Detecting Drainage Crossing Locations

  5. Invited Talk: I/O Profiling and Benchmarking for AI Applications

  6. Benchmarking and In-Depth Performance Study of Large Language Models on Habana Gaudi Processors

  7. Invited Talk: When Optimizing Software produces Optimized Hardware: A Case for Statically-Interpretable Control-Flow Programs

  8. Pareto Optimization of CNN Models via Hardware-Aware Neural Architecture Search for Drainage Crossing Classification on Resource-Limited Devices

  9. Accelerating Hyperparameter Optimization Algorithms with Mixed Precision

  10. Workshop SHDA23 Wrap-Up


Back to Workshop Archive Listing


7th International Workshop on Software Correctness for HPC Applications (Correctness '23)


  1. Correctness Workshop Opening Remarks

  2. HPC Bugs Fest Introduction

  3. Mapping High-Level Concurrency from OpenMP and MPI to ThreadSanitizer Fibers

  4. Rethinking Data Race Detection in MPI-RMA Programs

  5. Correctness '23 – Afternoon Break

  6. RMARaceBench: A Microbenchmark Suite to Evaluate Race Detection Tools for RMA Programs

  7. Data Race Detection Using Large Language Models

  8. Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores

  9. Toward Correctness Checking of MPI Partitioned Communication in MUST

  10. Adding Microbenchmarks with SIMD Data Race to DataRaceBench

  11. Investigating the Real-World Applicability of MPI Correctness Benchmarks

  12. Improve and Stabilize Classification Results of DataRaceBench

  13. Highlighting PARCOACH Improvements on MBI


Back to Workshop Archive Listing


Fourth International Symposium on Checkpointing for Supercomputing (SuperCheck-SC23)


  1. Welcome to SuperCheck-SC23

  2. AI-Augmented SWARM Based Resilience for Integrate Research Infrastructures

  3. Lightning Talk: Diaspora – Resilient Event Processing for Irregular, Distributed Scientific Applications

  4. SuperCheck-SC23 – Afternoon Break

  5. Checkpoint/Restart for CUDA Kernels

  6. Implementation-Oblivious Transparent Checkpoint-Restart for MPI

  7. Asynchronous Multi-Level Checkpointing: An Enabler of Reproducibility using Checkpoint History Analytics

  8. Lightning Talk: Update on Checkpointing and Localized Recovery for Nested Fork-Join Programs

  9. Lightning Talk: Toward Efficient Asynchronous Checkpointing for Large-Language Models

  10. Lightning Talk: Inherent Checkpointing Properties of Nested Parallelism

  11. Lightning Talk: Trade-Offs For Developing File Aggregated I/O For Asynchronous Checkpointing

  12. Lightning Talk: Datastates for Debugging – Using Productive Checkpointing for Improved Debugging


Back to Workshop Archive Listing


IA^3 2023 - 13th Workshop on Irregular Applications: Architectures & Algorithms


  1. IA^3 – Welcome and Introduction

  2. IA^3 – Invited Talk

  3. A New Sparse GEneral Matrix-Matrix Multiplication Method for Long Vector Architecture by Hierarchical Row Merging

  4. IA^3 2023 – Afternoon Break

  5. Towards a Massive-Scale Distributed Neighborhood Graph Construction

  6. A Parallel Algorithm for Updating a Multi-Objective Shortest Path in Large Dynamic Networks

  7. cuAlign: Scalable Network Alignment on GPU Accelerators

  8. TANGO: A GPU-Optimized Traceback Approach for Sequence Alignment Algorithms

  9. Filtering Wasteful Vertex Visits in Breadth-First Search

  10. Accelerating Deep Neural Network Guided MCTS Using Adaptive Parallelism

  11. IA^3 – Concluding Remarks


Back to Workshop Archive Listing


ISAV23: In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization


  1. ISAV23 – Introduction

  2. ISAV23 Invited Keynote – Progress in In-Situ Analysis and Visualization in the Fusion Exascale Code XGC

  3. Information Entropy-Based Camera Focus Point and Zoom Level Adjustment for Smart In-Situ Visualization

  4. Toward a Scalable In Situ Fast Fourier Transform

  5. Enabling In Situ Visualization of Large-Scale Cellular Simulations

  6. ISAV23 – Morning Break

  7. Extensions to the SENSEI In situ Framework for Heterogeneous Architectures

  8. A General Purpose Interface for Interactive Computational Steering Instrumentation Using Ascent

  9. Design of a Framework for Combined Flexible and Efficient Simulation and In Situ Processing

  10. Trigger Smart Data Saving Applied to CO2 Capture in Metal-Organic Frameworks

  11. Unraveling Diffusion in Fusion Plasma: A Case Study of In Situ Processing and Particle Sorting

  12. Using Umpire In-Situ for Improved Memory Performance

  13. State of In Situ Visualization in Simulations: We are fast. But are we inspiring?

  14. Scaling Computational Fluid Dynamics: In Situ Visualization of NekRS using SENSEI

  15. ISAV23 – Best Paper Award and Closing Remarks


Back to Workshop Archive Listing


Tenth Workshop on Accelerator Programming and Directives (WACCPD 2023)


  1. Tenth Workshop on Accelerator Programming Using Directives (WACCPD 2023)

  2. Porting and Optimizing Meso-NH to AMD MI250X GPUs

  3. Comparing a Naive and a Tree-Based N-Body Algorithm Using Different Standard SYCL Implementations on Various Hardware

  4. Specialized Kernels for Optimizing GPU Offload in OpenMP

  5. WACCPD 2023 – Morning Break

  6. Invited Talk

  7. Performance-Portable GPU Acceleration of the EFIT Tokamak Plasma Equilibrium Reconstruction Code

  8. Characterizing the Performance of Triangle Counting on Graphcore's IPU Architecture

  9. Memory Transfer Decomposition: Exploring Smart Data Movement through Architecture-Aware Strategies

  10. Analysis of MURaM – A Solar Physics Application, for Scalability, Performance, and Portability

  11. Tenth Workshop on Accelerator Programming and Directives (WACCPD2023) – Closing Remarks


Back to Workshop Archive Listing


AI Assisted Software Development for HPC (AI4DEV)


  1. Opening Remarks

  2. AI-Driven Performance Metaprogramming

  3. AI4DEV – Morning Break

  4. MPI-RICAL: Data-Driven MPI Distributed Parallelism Assistance with Transformers

  5. VSCuda: LLM-Based CUDA Extension for Visual Studio Code

  6. LLVM in the age of LLMs: Machine Learning for IR and optimization and more

  7. Unlocking the Potential of Large Language Models for High-Performance Computing Code


Back to Workshop Archive Listing


2023 International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)


  1. P3HPC – Welcome and Introduction

  2. A Performance-Portable SYCL Implementation of CRK-HACC for Exascale

  3. Performance Evaluation of Heterogeneous GPU Programming Frameworks for Hemodynamic Simulations

  4. Performance Portability Evaluation of Blocked Stencil Computations on GPUs

  5. P3HPC – Morning Break

  6. Benchmarking a Portable Lattice Quantum Chromodynamics Kernel Written in Kokkos and MPI

  7. MatRIS: Multilevel Math Library Abstraction for Heterogeneity and Performance Portability Using IRIS Runtime

  8. Porting Batched Iterative Solvers onto Intel GPUs with SYCL

  9. Evaluating the Performance of One-Sided Communication on CPUs and GPUs

  10. Performance Portability of Programming Strategies for Nearest-Neighbor Communication with GPU-Aware MPI

  11. Evaluating the Performance Portability of SYCL across CPUs and GPUs on Bandwidth-Bound Applications

  12. CuPBoP-AMD: Extending CUDA to AMD Platforms

  13. High-Level GPU Code: A Case Study Examining JAX and OpenMP

  14. Many Cores, Many Models: GPU Programming Model vs. Vendor Compatibility Overview

  15. P3HPC – Wrapup


Back to Workshop Archive Listing


5th International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC (CANOPIE-HPC)


  1. Canopie-HPC

  2. CANOPIE-HPC– Introduction and Welcome

  3. Survey of Adaptive Containerization Architectures for HPC

  4. HPC Container Conformance

  5. Kubeflow-as-a-Service on HPC clusters – First Experiences

  6. Preemptive Scheduling of Stateful GPU-Intensive HPC Applications in Kubernetes

  7. Enabling Performance for NGC Containers on the Slingshot 11 Interconnect

  8. Lightweight Isolation for HPC Applications

  9. Canopie-HPC – Morning Break

  10. Charliecloud’s Layer-Free, Git-Based Container Build Cache

  11. New Root Emulation Mode for Charliecloud Using seccomp

  12. eBPF-Based Performance Fingerprint of Containerized HPC Applications

  13. Understanding Energy Performance of Containers Deployment on HPC-Based Post-Moore Platforms

  14. Perspectives and Experiences Supporting Containers for Research Computing at the Texas Advanced Computing Center

  15. Early Experiences with Charliecloud for HPC

  16. Computing-as-a-Service Infrastructure for Accelerating Digital Engineering

  17. The Story of Spin: Five Years Supporting Science with Container-Based Services at NERSC

  18. CANOPIE-HPC Community Discussion/Open Q&A


Back to Workshop Archive Listing


The 18th Workshop on Workflows in Support of Large-Scale Science (WORKS23) - Part 2 of 2


  1. The 18th Workshop on Workflows in Support of Large-Scale Science (WORKS23)

  2. Welcome - Part II

  3. FAIRIST of Them All: Meeting Researchers Where They Are With Just-in-Time, FAIR Implementation Advice

  4. A data science pipeline synchronisation method for edge-fog-cloud continuum

  5. WORKS23 – Morning Break

  6. TaskVine: Managing In-Cluster Storage for High-Throughput Data Intensive Workflows

  7. Leveraging Large Language Models to Build and Execute Computational Workflows

  8. Delivering Rules-Based Workflows for Science

  9. Julia as a Unifying End-to-End Workflow Language on the Frontier Exascale System

  10. Scaling on Frontier: Uncertainty Quantification Workflow Applications Using ExaWorks to Enable Full System Utilization

  11. Distributed Data Locality-Aware Job Allocation

  12. Fluxion: A Scalable Graph-Based Resource Model for HPC Scheduling Challenges

  13. The Common Workflow Scheduler Interface: Status Quo and Future Plans

  14. Wrap Up – Part II


Back to Workshop Archive Listing


Workshop on Artificial Intelligence and Machine Learning for Scientific Applications (AI4S)


  1. AI4S – Keynote

  2. AI4S – Morning Break

  3. AI4S – Invited Talk

  4. A Comparison of Mesh-Free Differentiable Programming and Data-Driven Strategies for Optimal Control under PDE Constraints

  5. Toward Foundation Models for Materials Science: The Open MatSci ML Toolkit

  6. Protein Generation via Genome-Scale Language Models with Bio-Physical Scoring

  7. AI4S – Lunch Break

  8. Accelerating Particle and Fluid Simulations with Differentiable and Interpretable Graph Networks for Solving Forward and Inverse Problems

  9. Enabling Performant Thermal Conductivity Modeling with DeePMD and LAMMPS on CPUs

  10. Machine Learning Applied to Single-Molecule Activity Prediction

  11. AI4S – Afternoon Break

  12. Tournament-Based Pretraining to Accelerate Federated Learning

  13. Elastic Deep Learning through Resilient Collective Operations

  14. Toward Rapid Autonomous Electron Microscopy with Active Meta-Learning

  15. Autotuning Apache TVM-Based Scientific Applications Using Bayesian Optimization

  16. Enhancing Heterogeneous Federated Learning with Knowledge Extraction and Multi-Model Fusion

  17. Entropy-Driven Optimal Sub-Sampling of Fluid Dynamics for Developing Machine-Learned Surrogates

  18. Tencoder: Tensor-Product Encoder-Decoder Architecture for Predicting Solutions of PDEs with Variable Boundary Data


Back to Workshop Archive Listing


PMBS23: The 14th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems


  1. PMBS23: The 14th International Workshop on Performance Modeling, Benchmarking, and Simulation of High-Performance Computer Systems

  2. PMBS23 – Welcome

  3. Physical Oscillator Model for Supercomputing

  4. Comparative Evaluation of Bandwidth-Bound Applications on the Intel Xeon CPU MAX Series

  5. PMBS23 – Morning Break

  6. SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case Study

  7. Reducing Memory Requirements for the IPU Using Butterfly Factorizations

  8. Verifying Performance Guidelines for MPI Collectives at Scale

  9. A Performance Model for Estimating the Cost of Scaling to Practical Quantum Advantage

  10. Hardware Specialization: Estimating Monte Carlo Cross-Section Lookup Kernel Performance and Area

  11. PMBS23 – Lunch Break

  12. Power Analysis of NERSC Production Workloads

  13. Adaptive Stopping Rule for Performance Measurements

  14. Latency and Bandwidth Microbenchmarks of US Department of Energy Systems in the June 2023 Top 500 List

  15. PMBS23 – Afternoon Break

  16. Risk-Aware Scheduling Algorithms for Variable Capacity Resources

  17. A Reinforcement Learning-Based Backfilling Strategy for HPC Batch Jobs

  18. Evaluating the Potential of Elastic Jobs in HPC Systems

  19. Modeling Data Locality of Sparse Matrix-Vector Multiplication on the A64FX


Back to Workshop Archive Listing


WHPC@SC23: 16th International Women in HPC Workshop


  1. WHPC@SC23: 16th International Women in HPC Workshop

  2. WHPC@SC23 – Introduction

  3. WHPC@SC23 – Invited Speaker: When to Jump – Managing Your Career and Maximizing Your Impact

  4. WHPC@SC23 – Morning Break

  5. WHPC@SC23: Surviving and Thriving as an ‘Outsider' – with the Help of Allies

  6. WHPC@SC23 – Strength in Unity: Fostering Tech Career Persistence

  7. WHPC@SC23 – WHPC Lyceum

  8. WHPC@SC23 – Lunch Break

  9. Investigating Linear Solvers for Power Grid Analysis with Exascale Computing: A Journey of Learning and Collaboration

  10. Potential of Cryogenics Electronics for Future Computing Systems

  11. fAsyLex: Accelerating Legal NLP through Comparative Analysis of Multi-GPU Approaches

  12. An Analysis of Change Point Detection in High Performance Computing

  13. Scalable Graph Analytics and HPC Operational Enhancement: Parallel Computing and ML/DL Innovations

  14. OpenGPT-X: Advancements, Challenges, Exploration, and Future Goals

  15. Accelerating the HPC I/O for Low Latency and High Throughput with 16-Nanometer FPGA-Based Hardware Accelerators

  16. Exploring the Potential of GPU-initiated Communications in HPC Applications

  17. Simulating Quantum Chemistry on Heterogeneous Architectures

  18. Operationalizing HPC Tasks for Space Weather Forecasting Using Celery and Django: Making Automated, HPC-Powered Scientific Results Accessible in Near-Real Time.

  19. Queue Wait Time Prediction in Supercomputers

  20. Spatiotemporal Analysis and Prediction of Laboratory-Generated Turbulence

  21. WHPC@SC23 – Networking Breakout

  22. WHPC@SC23 – Afternoon Break

  23. Fostering Diversity, Equity, and Inclusion (DEI) at Big Tech Firms

  24. WHPC@SC23

  25. WHPC@SC23 – Leading from the Middle

  26. Our Success Case of Full Remote Working

  27. WHPC@SC23 – Conclusion


Back to Workshop Archive Listing


Fourth International Workshop on Quantum Computing Software


  1. Fourth International Workshop on Quantum Computing Software

  2. Fast Simulation of High-Depth QAOA Circuits

  3. Prototype of a Batched Quantum Circuit Simulator for the Vector Engine

  4. Enabling Quantum Computer Simulations on AMD GPUs: A HIP Backend for Google's qsim

  5. Quantum Computing Software – Morning Break

  6. Enabling Scalable VQE Simulation on Leading HPC Systems

  7. MEMQSim: Highly Memory-Efficient and Modularized Quantum State-Vector Simulation

  8. BGLS: A Python Package for the Gate-by-Gate Sampling Algorithm to Simulate Quantum Circuits

  9. TISCC: A Surface Code Compiler and Resource Estimator for Trapped-Ion Processors

  10. JuliQAOA: Fast, Flexible QAOA Simulation

  11. SimuQ: A Domain-Specific Language for Quantum Simulation with Analog Compilation

  12. Quantum Computing Software – Lunch Break

  13. Using Azure Quantum Resource Estimator to Evaluate Performance of Quantum Algorithms

  14. Making QIR Executable

  15. QASMTrans: A QASM Quantum Transpiler Framework for NISQ Devices

  16. Quantum Computing Software – Afternoon Break

  17. QArchSearch: A Scalable Quantum Architecture Search Package

  18. Towards an Expressive Python-Native Interface for Quantum Program Development

  19. An Ising-Based Model for Qubit Mapping

  20. A Reference Implementation for a Quantum Message Passing Interface

  21. Distributing Circuits Over Heterogeneous, Modular Quantum Computing Network Architectures

  22. Open Q&A Session


Back to Workshop Archive Listing


The 6th Annual Parallel Applications Workshop, Alternatives to MPI+X (PAW-ATM)


  1. The 6th Annual Parallel Applications Workshop, Alternatives to MPI+X (PAW-ATM)

  2. Introduction to The 6th Annual Parallel Applications Workshop, Alternatives to MPI+X

  3. Survey of Technologies for Developers of Parallel Applications: SHMEM

  4. Survey of Technologies for Developers of Parallel Applications: Swift/T

  5. Survey of Technologies for Developers of Parallel Applications: Julia

  6. Survey of Technologies for Developers of Parallel Applications: Legate and cuNumeric

  7. Survey of Technologies for Developers of Parallel Applications: Q&A

  8. PAW-ATM – Morning Break

  9. Implementing Scalable Matrix-Vector Products for the Exact Diagonalization Methods in Quantum Many-Body Physics

  10. High-Performance Programming and Execution of a Coral Biodiversity Mapping Algorithm Using Chapel

  11. Design and Analysis of the Network Software Stack of an Asynchronous Many-Task System – The LCI Parcelport of HPX

  12. shmem4py: High-Performance One-Sided Communication for Python Applications

  13. Pure: Evolving Message Passing To Better Leverage Shared Memory within Nodes

  14. PAW-ATM – Lunch Break

  15. PAW-ATM Distinguished Speaker: Ethan Gutmann – National Center for Atmospheric Research: Trials and Tribulations and Joys of Developing with Alternative Parallel Frameworks

  16. PAW-ATM – Afternoon Break

  17. symPACK: A GPU-Capable Fan-Out Sparse Cholesky Solver

  18. PAW-ATM Panel Discussion: Charting Paths to Success with Alternatives to MPI+X


Back to Workshop Archive Listing


ESPM2 2023: Eighth International Workshop on Extreme Scale Programming Models and Middleware


  1. Featured Talk: Aurora Exascale Architecture

  2. ESPM2 2023 – Morning Break

  3. Challenge on Extreme-Hetero Application Programming

  4. The MI300 APU: Programming for CPUs and GPUs on a Single Package

  5. Cross-Stack System Techniques for Trillion-Parameter Scale Model Inference

  6. Performance Portability in the Age of Extreme Heterogeneity

  7. ESPM2 2023 – Lunch Break

  8. Programming Model for Habana/Gaudi2 Accelerators and Its Impact on Deep Learning Inference/Training Performance at Scale

  9. An Autonomous Execution Model for GPUs: When CPUs Take a Back Seat

  10. ESPM2 – Afternoon Break

  11. Who's Winning the Performance Portability Race on GPU Platforms?

  12. Domain-Specific Programming Methodologies for Domain-Specific and Emerging Computing Systems

  13. Top 5 Challenges  in Programming Models and Runtimes for Large Language Models Training/Inference


Back to Workshop Archive Listing


14th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH'23)


  1. Welcome

  2. Invited Talk 1: The Legacy of ECP Software Efforts, Realized, and to Come

  3. ScalAH'23 – Morning Break

  4. Invited Talk 2: Living in a Heterogenous World – How Scientific Workflows Bridge Diverse Cyberinfrastructure and What Can We Do Better?

  5. GPU-Based LU Factorization and Solve on Batches of Matrices with Band Structure

  6. Massively Distributed Finite-Volume Flux Computation

  7. Parallel Symbolic Cholesky Factorization

  8. Advancing the Distributed Multi-GPU ChASE Library through Algorithm Optimization and NCCL Library

  9. ScalAH'23 – Lunch Break

  10. Invited Talk 3: The Pursuit of the Brain’s Ubiquitous Stochasticity

  11. Optimization of Ported CFD Kernels on Intel Data Center GPU Max 1550 Using oneAPI ESIMD

  12. ScalAH'23 – Afternoon Break

  13. Invited Talk 4: Innovative Supercomputing by Integrations of Simulations/Data/Learning on Oakforest-PACS II

  14. Invited Talk 5: Building Quantum Machine Learning for Real-World Applications

  15. Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware Accelerators

  16. Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs


Back to Workshop Archive Listing


EduHPC-23: Workshop on Education for High Performance Computing


  1. EduHPC-23: Workshop on Education for High Performance Computing

  2. EduHCP23 – Welcome Remarks

  3. EduHPC23 – Invited Talk by Kathy Yelick: Educating Post Exascale HPC Leaders

  4. Teaching Heterogeneous and Parallel Computing with Google Colab and Raspberry Pi Clusters

  5. Infrastructure for Writing Fork-Join Tests

  6. Data-Driven Discovery of Anchor Points for PDC Content

  7. AutoLearn: Learning in the Edge to Cloud Continuum

  8. EduHPC23: Panel Q&A Paper Session I

  9. EduHPC-23 – Afternoon Break

  10. Next Generation Pathways to Computing: Bridging the Diversity Gap in High-Performance Computing Education

  11. Training Experiences by Skills for HPC Ecosystems

  12. Teaching Non-Determinism in High Performance Applications

  13. ML Movie Night: A Pilot Machine Learning Course for High-School Students and Implications for Undergraduate Adaptation

  14. The World's Worst Optical NIC

  15. Composable HPC Curricula: Embracing the UNIX Development Paradigm and Leveraging Core Practices from Linux Kernel Development in HPC Training Marterial Development

  16. Adding Sustainability to Parallel Programming Assignments

  17. EduHPC23 – Panel Q&A: Lightning Talks

  18. The Wide Area Classroom: 24,000 HPC Students and Growing

  19. Faculty Development Workshops for Integrating PDC in Early Undergraduate Curricula: An Experience Report

  20. An NSF REU Site Based on Trust and Reproducibility of Intelligent Computation: Experience Report

  21. Performance Engineering for Graduate Students: a View from Amsterdam

  22. EduHPC23 – Panel Q&A: Paper Session II

  23. 1D Heat Equation in Chapel

  24. Program Your Favorite Data Science Pipeline in Spark

  25. Parallelizing a 1-Dim Nagel-Schreckenberg Traffic Model

  26. Using MPI For Distributed Hyper-Parameter Optimization and Uncertainty Evaluation

  27. k-Nearest Neighboor with Map Reduce MPI

  28. K-means Clustering: An Assignment for OpenMP, MPI, and CUDA/OpenCL

  29. EduHPC23 – Panel Q&A: Peachy Assignments

  30. CDER Announcements and Closing


Back to Workshop Archive Listing


Second International Workshop on RISC-V for HPC


  1. Introduction and Welcome

  2. RISC-V Everywhere

  3. Lightning Vendor Talk: Esperanto Technologies ET-SoC for AI and ML Workloads

  4. Lightning Vendor Talk: The InspireSemi next gen Thunderbird compute accelerator for HPC, AI, and graph analytics

  5. Lightning Vendor Talk: SG2042 Empowering RISC-V in High-Performance Computing

  6. Lightning Vendor Talk: E4 Experience with RISC-V in HPC

  7. RISC-V for HPC – Afternoon Break

  8. An Empirical Comparison of the RISC-V and AArch64 Instruction Sets

  9. Evaluating HPX and Kokkos on RISC-V Using an Astrophysics Application Octo-Tiger

  10. Is RISC-V Ready for HPC Prime-Time: Evaluating the 64-Core Sophon SG2042 RISC-V CPU

  11. Short Reasons for Long Vectors in HPC CPUs: A Study Based on RISC-V

  12. Automatic Generation of Micro-Kernels for Performance Portability of Matrix Multiplication on RISC-V Vector Processors

  13. Challenges and Opportunities in the Co-Design of Convolutions and RISC-V Vector Processors

  14. Second International Workshop on RISC-V for HPC


Back to Workshop Archive Listing


Workshop on Memory Technologies, Systems, and Applications


  1. Workshop on Memory Technologies, Systems, and Applications

  2. Keynote

  3. MTSA – Afternoon Break

  4. Accelerating In Situ Analysis Using Non-volatile Memory

  5. CXL Memory as Persistent Memory for Disaggregated HPC: A Practical Approach

  6. GPU Graph Processing on CXL-Based Microsecond-Latency External Memory

  7. Dynamic Memory Provisioning on Disaggregated HPC Systems


Back to Workshop Archive Listing


ExaMPI: Workshop on Exascale MPI


  1. Welcome and Introduction

  2. Distinguished Speaker: GPU Centric Communication – Is MPI Missing Out?

  3. ExaMPI – Afternoon Break

  4. Optimizing Irregular Communication with Neighborhood Collectives and Locality-Aware Parallelism

  5. Using Mixed-Radix Decomposition to Enumerate Computational Resources of Deeply Hierarchical Architectures

  6. Embedding Rust within Open MPI

  7. A Statistical Analysis of HPC Network Tuning

  8. OpenSHMEM Queues: An Abstraction for Enhancing Message Rate, Bandwidth Utilization, and Reducing Tail Latency in OpenSHMEM Applications

  9. Efficient Data Redistribution for Malleable Applications

  10. ExaMPI: Workshop on Exascale MPI


Back to Workshop Archive Listing


Tenth SC Workshop on Best Practices for HPC Training and Education


  1. Tenth SC Workshop on Best Practices for HPC Training and Education

  2. Emerging Technologies and HPC Education, Outreach, and Training

  3. Expanding Horizons: Advancing HPC Education in Colombia through CyberColombia's Summer Schools

  4. The BEAST LAB: A Practical Course on Experimental Evaluation of Diverse Modern HPC Architectures and Accelerators

  5. Using Unity for Scientific Visualization as a Course-Based Undergraduate Research Experience

  6. Best Practices for HPC Training and Education – Afternoon Break

  7. Scaling HPC Education

  8. Intro to HPC Bootcamp: Engaging New Communities through Energy Justice Projects

  9. Data Analytics Program in Community Colleges in Preparation for STEM and HPC Careers

  10. The Code-a-Thon, Improving Student Engagement through Community Coding

  11. Let’s Get Our Heads Out of the Clouds (A Scalable and Sustainable Approach to HPC Training Labs for Resource Constrained Environments and Anyone Else Stuck in the Clouds)

  12. Q&A and Discussion

  13. Bridging the Quantum Gap: Addressing Challenges in Training Individuals in Quantum Computing Using Self-Guided Learning Resources

  14. HPC Carpentry – A Scalable, Peer-Reviewed Training Pprogram to Democratize HPC Access

  15. Understanding Community Perspectives on HPC Skills and Training Pathways

  16. Cross-Institutional Research Engagement Network (CIREN): Initial Project Goals and Objectives in Support of Training, Mentoring, and Research Facilitation

  17. Exascale and Beyond – Required Competences for the Computational Scientists

  18. Q&A and Discussion


Back to Workshop Archive Listing


Workshop on Machine Learning with Graphs in High Performance Computing Environments


  1. Welcome Machine Learning with Graphs in High Performance Computing Environment

  2. Invited talk: Practical Machine Learning on Biological Knowledge Graphs

  3. MLG-HPCE – Afternoon Break

  4. Addressing Stale Gradients in Scalable Federated Deep Reinforcement Learning

  5. An Efficient Distributed Graph Engine for Deep Learning on Graphs

  6. HPC-GPT: Integrating Large Language Model for High-Performance Computing

  7. DDStore: Distributed Data Store for Scalable Training of Graph Neural Networks on Large Atomistic Modeling Datasets

  8. An Analysis of Graph Neural Network Memory Access Patterns


Back to Workshop Archive Listing


Digital Twins: Practices and Principles for High Performance Computing


  1. Digital Twins: Practices and Principles for High Performance Computing


Back to Workshop Archive Listing


Future Is Sparse: Methods and Tools for Sparse Computations


  1. Welcome & Introduction by SparCity

  2. Dynamic Data Structures on the GPU

  3. Consider studying all-pairs shortest paths

  4. Coffee Break

  5. Tensor cores for matrix multiplication are on the rise - is there any hope for sparse operations?

  6. The Future of Machine Learning is Sparse

  7. Future is Sparse Panel


Back to Workshop Archive Listing


Ninth International Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC 2023)


  1. Opening Remarks

  2. Invited Talk

  3. Chameleon: A Disaggregated CPU, GPU, and FPGA System for Retrieval-Augmented Language Models

  4. Enabling Communication with FPGA-Based Network-Attached Accelerators for HPC Workloads

  5. H2RC'23 – Morning Break

  6. Tydi-lang: A Language for Typed Streaming Hardware

  7. Altis-SYCL: Migrating Altis Benchmarking Suite from CUDA to SYCL for GPUs and FPGAs

  8. OctoRay: Framework for Scalable FPGA Cluster Acceleration of Python Big Data Applications

  9. Stencil-HMLS: A Multi-Layered Approach to the Automatic Optimization of Stencil Codes on FPGA

  10. Closing Remarks


Back to Workshop Archive Listing


Fourth Workshop on Heterogeneous Memory Systems (HMEM)


  1. Fourth Workshop on Heterogeneous Memory Systems (HMEM)

  2. HMEM – Welcome

  3. Keynote: Empowering Large AI Models Based on Heterogeneous Memory

  4. Persistent Snapshot Isolation with Unlimited Reads on Commodity Hardware Transactional Memory

  5. HMEM – Morning Break

  6. DAOS Beyond Persistent Memory: Architecture and Initial Performance Results

  7. CachedArrays: API and Framework to Optimize Data Movement for Heterogeneous Memory Systems

  8. Evaluating the Latest Optane Memory: A Glorious Swansong?


Back to Workshop Archive Listing


3rd International Workshop on RESource DISaggregation in High Performance Computing (RESDIS)


  1. Welcome and Introduction

  2. Keynote

  3. Sunfish: An Open Centralized Composable HPC Management Framework

  4. Morning Break

  5. RISA: Round-Robin Intra-Rack Friendly Scheduling Algorithm for Disaggregated Datacenters

  6. Resource Disaggregation in Practice – Industry Session

  7. Panel Discussion


Back to Workshop Archive Listing