Accelerating Actor-Based Distributed Triangle Counting.
Aniruddha Mysore (Georgia Institute of Technology), Kaushik Ravichandran (Georgia Institute of Technology), Youssef Elmougy (Georgia Institute of Technology), Akihiro Hayashi (Georgia Institute of Technology), Vivek Sarkar (Georgia Institute of Technology)
Balancing Latency and Throughput of Distributed Inference by Interleaved Parallelism.
Jiangsu Du (Sun Yat-Sen University), Jinhui Wei (Sun Yat-Sen University), Jiazhi Jiang (Sun Yat-Sen University), Shenggan Cheng (National University of Singapore), Zhiguang Chen (Sun Yat-Sen University), Dan Huang (Sun Yat-Sen University), Yutong Lu (Sun Yat-Sen University)
Characterizing GPU Effectiveness on NRP for IceCube fp32 Compute.
David Schultz (University of Wisconsin, Madison), Igor Sfiligoi (University of California, San Diego (UCSD)), Benedikt Riedel (University of Wisconsin, Madison), Frank Würthwein (University of California, San Diego (UCSD))
Delivering Digital Skills Across the Digital Divide: Creating an Accessible On-Demand Self-Paced HPC Virtual Training Lab.
Bryan Johnston (Council for Scientific and Industrial Research (CSIR), South Africa; ACE Lab, Centre for High Performance Computing, South Africa), Lara Timm (Council for Scientific and Industrial Research (CSIR), South Africa; ACE Lab, Centre for High Performance Computing, South Africa), Mabatho Hashatsi (Council for Scientific and Industrial Research (CSIR), South Africa; ACE Lab, Centre for High Performance Computing, South Africa)
An Early Case Study with Multi-Tenancy Support in SPDK’s NVMe-over-Fabric Designs.
Darren Ng (University of California, Merced), Charles Parkinson (University of California, Merced), Andrew Lin (University of California, Merced), Arjun Kashyap (University of California, Merced), Xiaoyi Lu (University of California, Merced)
EE-HPC – A Framework for Energy Efficient HPC System Operation.
Jan Eitzinger (Friedrich-Alexander Universität Erlangen-Nürnberg, Erlangen National High-Performance Computing Center (NHR@FAU)), Thomas Gruber (Friedrich-Alexander Universität Erlangen-Nürnberg, Erlangen National High-Performance Computing Center (NHR@FAU))
Exploring Green Cryptographic Hashing Algorithms for Eco-Friendly Blockchains.
Aahad Abubaker (DePaul University), Tanmay Anand (Illinois Institute of Technology), Sonal Gaikwad (Illinois Institute of Technology), Mahad Haider (Illinois Institute of Technology), Jacklyn McAninch (Illinois Institute of Technology), Lan Nguyen (Illinois Institute of Technology), Alexandru Orhean (DePaul University), Ioan Raicu (Illinois Institute of Technology)
Exploring Julia as a Unifying End-to-End Workflow Language for HPC on Frontier.
William F. Godoy (Oak Ridge National Laboratory (ORNL)), Pedro Valero-Lara (Oak Ridge National Laboratory), Caira Anderson (Oak Ridge National Laboratory (ORNL), Cornell University), Katrina W. Lee (Oak Ridge National Laboratory (ORNL); University of Texas, Dallas), Ana Gainaru (Oak Ridge National Laboratory (ORNL)), Rafael Ferreira da Silva (Oak Ridge National Laboratory (ORNL)), Jeffrey S. Vetter (Oak Ridge National Laboratory (ORNL))
Exploring the Impacts of Multiple I/O Metrics in Identifying I/O Bottlenecks.
Izzet Yildirim (Illinois Institute of Technology), Hariharan Devarajan (Lawrence Livermore National Laboratory), Anthony Kougkas (Illinois Institute of Technology), Xian-He Sun (Illinois Institute of Technology), Kathryn Mohror (Lawrence Livermore National Laboratory)
Exploring Userspace Memory Mapping for RDMA-Enabled Network-Attached Memory.
Jacob Wahlgren (KTH Royal Institute of Technology, Sweden), Jennifer Faj (KTH Royal Institute of Technology, Sweden), Eric Green (Lawrence Livermore National Laboratory), Maya Gokhale (Lawrence Livermore National Laboratory), Ivy Peng (KTH Royal Institute of Technology, Sweden)
GPU-Accelerated Dense Covariance Matrix Generation for Spatial Statistics Applications.
Zipei Geng (King Abdullah University of Science and Technology (KAUST)), Sameh Abdulah (King Abdullah University of Science and Technology (KAUST)), Hatem Ltaief (King Abdullah University of Science and Technology (KAUST)), Ying Sun (King Abdullah University of Science and Technology (KAUST)), Marc Genton (King Abdullah University of Science and Technology (KAUST)), David Keyes (King Abdullah University of Science and Technology (KAUST))
High-Performance PMEM-Aware Collective I/Os.
Keegan Sanchez (Washington State University, Vancouver), Alex Gavin (Washington State University, Vancouver), Suren Byna (Ohio State University), Kesheng Wu (Lawrence Berkeley National Laboratory (LBNL)), Xuechen Zhang (Washington State University, Vancouver)
The Impact of Process Topology on RMA Programming Models: A Study on NERSC Perlmutter.
Nikodemos Koutsoheras (Pacific Northwest National Laboratory (PNNL)), Sayan Ghosh (University of Maryland), Nathan Tallent (Pacific Northwest National Laboratory (PNNL)), Joshua Suetterlein (Pacific Northwest National Laboratory (PNNL)), Abhinav Bhatele (University of Maryland)
Improving Memory Interfacing in HLS-Generated Accelerators with Custom Caches.
Claudio Barone (Pacific Northwest National Laboratory (PNNL)), Giovanni Gozzi (Politecnico di Milano), Michele Fiorito (Politecnico di Milano), Ankur Limaye (Pacific Northwest National Laboratory (PNNL)), Antonino Tumeo (Pacific Northwest National Laboratory (PNNL)), Fabrizio Ferrandi (Politecnico di Milano)
Investigating Anomalies in Compute Clusters: An Unsupervised Learning Approach.
Yiyang Lu (College of William & Mary), Jie Ren (College of William & Mary), Yasir Alanazi (Thomas Jefferson National Accelerator Facility), Ahmed Mohammed (Thomas Jefferson National Accelerator Facility), Diana McSpadden (Thomas Jefferson National Accelerator Facility), Laura Hild (Thomas Jefferson National Accelerator Facility), Mark Jones (Thomas Jefferson National Accelerator Facility), Wesley Moore (Thomas Jefferson National Accelerator Facility), Malachi Schram (Thomas Jefferson National Accelerator Facility), Bryan Hess (Thomas Jefferson National Accelerator Facility), Evgenia Smirni (College of William & Mary)
Modeling Parallel Programs Using Large Language Models.
Daniel Nichols (University of Maryland), Aniruddha Marathe (Lawrence Livermore National Laboratory), Harshitha Menon (Lawrence Livermore National Laboratory), Todd Gamblin (Lawrence Livermore National Laboratory), Abhinav Bhatele (University of Maryland)
MPI Performance Analysis in Vlasiator: Unraveling Communication Bottlenecks.
Jennifer Faj (KTH Royal Institute of Technology, Sweden), Jeremy J. Williams (KTH Royal Institute of Technology, Sweden), Ivy B. Peng (KTH Royal Institute of Technology, Sweden), Urs Ganse (University of Helsinki), Markus Battarbee (University of Helsinki), Yann Pfau-Kempf (University of Helsinki), Leo Kotipalo (University of Helsinki), Minna Palmroth (University of Helsinki), Stefano Markidis (KTH Royal Institute of Technology, Sweden)
Optimizing Workflow Performance by Elucidating Semantic Data Flow.
Meng Tang (Illinois Institute of Technology), Nathan R. Tallent (Pacific Northwest National Laboratory (PNNL)), Anthony Kougkas (Illinois Institute of Technology), Xian-He Sun (Illinois Institute of Technology)
PanSim: A Performance-Portable Agent Based Model.
Istvan Z. Reguly (Pázmány Péter Catholic University, Hungary), Bence Keömley-Horváth (Pázmány Péter Catholic University, Hungary), Gábor Szederkényi (Pázmány Péter Catholic University, Hungary), Attila Csikász-Nagy (Pázmány Péter Catholic University, Hungary)
ParLeiden: Boosting Parallelism of Distributed Leiden Algorithm on Large-Scale Graphs.
Yongmin Hu (Douyin Vision Co., Ltd), Jing Wang (Shanghai Jiao Tong University), Cheng Zhao (Douyin Vision Co., Ltd), Yibo Liu (Shanghai Jiao Tong University), Cheng Chen (Douyin Vision Co., Ltd), Xiaoliang Cong (Douyin Vision Co., Ltd), Chao Li (Shanghai Jiao Tong University)
Performant Low-Order Matrix-Free Finite Element Kernels on GPE Architectures.
Randolph Settgast (Lawrence Livermore National Laboratory), William Tobin (Lawrence Livermore National Laboratory), Nicola Castelletto (Lawrence Livermore National Laboratory), Yohann Dudouit (Lawrence Livermore National Laboratory), Sergey Klevtsov (Stanford University), Ben Corbett (Lawrence Livermore National Laboratory)
Pipit: Simplifying Analysis of Parallel Execution Traces.
Alexander Movsesyan (University of Maryland), Rakrish Dhakal (University of Maryland), Aditya Ranjan (University of Maryland), Jordan Marry (University of Maryland), Onur Cankur (University of Maryland), Abhinav Bhatele (University of Maryland)
Preserving Data Locality in Multidimensional Variational Quantum Classification.
Mingyoung Jeng (University of Kansas), Alvir Nobel (University of Kansas), Vinayak Jha (University of Kansas), David Levy (University of Kansas), Dylan Kneidel (University of Kansas), Manu Chaudhary (University of Kansas), Ishraq Islam (University of Kansas), Esam El-Araby (University of Kansas)
Quantum Task Offloading with the OpenMP API.
Joseph K. L. Lee (Edinburgh Parallel Computing Centre (EPCC)), Martin Ruefenacht (Leibniz Supercomputing Centre), Johannes Doerfert (Lawrence Livermore National Laboratory), Oliver Thomson Brown (Edinburgh Parallel Computing Centre (EPCC)), Mark Bull (Edinburgh Parallel Computing Centre (EPCC)), Michael Klemm (AMD Research), Martin Schulz (Technical University Munich)
Real-Time Change Point Detection in Molecular Dynamics Streaming Data.
Vijayalakshmi Saravanan (University of South Dakota), Shinjae Yoo (Brookhaven National Laboratory), Hubertus Van Dam (Brookhaven National Laboratory), Christopher Kelly (Brookhaven National Laboratory), Thomas Flynn (Brookhaven National Laboratory), Perry Siehien (Brookhaven National Laboratory), Kalyan Muppudojo (Brookhaven National Laboratory), Aniket Kumar Ramesh (Brookhaven National Laboratory)
SCALABLE – Scalable Lattice Boltzmann Leaps to Exascale.
Jayesh Badwaik (Jülich Supercomputing Center), Lubomír Říha (IT4Innovations National Supercomputing Center, VŠB – Technical University of Ostrava), Radim Vavřík (IT4Innovations National Supercomputing Center, VŠB – Technical University of Ostrava), Ondřej Vysocký (IT4Innovations National Supercomputing Center, VŠB – Technical University of Ostrava), Kristian Kadlubiak (IT4Innovations National Supercomputing Center, VŠB – Technical University of Ostrava), Gabriel Staffelbach (CERFACS, France), Markus Holzer (CERFACS, France; Friedrich-Alexander-University of Erlangen-Nuremberg), Philipp Suffa (Friedrich-Alexander-University of Erlangen-Nuremberg), Romain Cuidard (CS GROUP), Denis Ricot (CS GROUP)
Scalable Algorithms for Analyzing Large Dynamic Networks Using CANDY.
Aashish Pandey (University of North Texas), Arindam Khanda (Missouri University of Science and Technology), Sriram Srinivasan (University of Oregon), Sudharshan Srinivasan (University of Oregon), S. M. Shovan (Missouri University of Science and Technology), Farahnaz Hosseini (University of North Texas), Sajal Das (Missouri University of Science and Technology), Boyana Norris (University of Oregon), Sanjukta Bhowmick (University of North Texas)
Scalable Reduced-Order Modeling for Three-Dimensional Turbulent Flow.
Kazuto Ando (RIKEN Center for Computational Science, Kobe University), Rahul Bale (RIKEN Center for Computational Science, Kobe University), Akiyoshi Kuroda (RIKEN Center for Computational Science), Makoto Tsubokura (RIKEN Center for Computational Science, Kobe University)
Scaling K-Path Centrality Using Optimized Distributed Data Structure.
Lance Fletcher (Texas A&M University, Lawrence Livermore National Laboratory), Trevor Steil (Lawrence Livermore National Laboratory), Roger Pearce (Lawrence Livermore National Laboratory, Texas A&M University)
Temporal Classification of Allocations for Reduced Memory Usage.
Kristi Belcher (Lawrence Livermore National Laboratory), David Beckingsale (Lawrence Livermore National Laboratory), Sam Schwartz (University of Oregon), Marty McFadden (Lawrence Livermore National Laboratory)
That's Right – The Same C++ STL Asynchronous Parallel Code Runs on CPUs and GPUs.
Muhammad Haseeb (Lawrence Berkeley National Laboratory (LBNL), National Energy Research Scientific Computing Center (NERSC)), Weile Wei (Lawrence Berkeley National Laboratory (LBNL), National Energy Research Scientific Computing Center (NERSC)), Jack Deslippe (Lawrence Berkeley National Laboratory (LBNL), National Energy Research Scientific Computing Center (NERSC)), Brandon Cook (Lawrence Berkeley National Laboratory (LBNL), National Energy Research Scientific Computing Center (NERSC))
Towards Enabling Digital Twins Capabilities for a Cloud Chamber.
Jiaqi Yang (Emory University), Mohammad Atif (Brookhaven National Laboratory), Vanessa Lopez-Marrero (Brookhaven National Laboratory), Tao Zhang (Brookhaven National Laboratory), Kwang Min Yu (Brookhaven National Laboratory), Meifeng Lin (Brookhaven National Laboratory), Lingda Li (Brookhaven National Laboratory), Fan Yang (Brookhaven National Laboratory), Yangang Liu (Brookhaven National Laboratory), Abdullahalmut Sharfuddin (Stony Brook University), Foluso Ladeinde (Stony Brook University)
Unleashing CGRA Potential for HPC.
Boma Adhi (RIKEN Center for Computational Science (R-CCS)), Emanuele Del Sozzo (RIKEN Center for Computational Science (R-CCS)), Carlos Cortes (RIKEN Center for Computational Science (R-CCS)), Xinyuan Wang (University of Toronto, RIKEN Center for Computational Science (R-CCS)), Tomohiro Ueno (RIKEN Center for Computational Science (R-CCS)), Kentaro Sano (RIKEN Center for Computational Science (R-CCS))