2022 IEEE High Performance Extreme Computing
Virtual Conference

19 - 23 September 2022

IEEE HPEC Preliminary Agenda 2022

All times are EDT (UTC/GMT -04 hours)

Day Monday Tuesday Wednesday Thursday Friday
10:30-11:00am Session 1-V: Sponsor Showcase – Lenovo Session 2-V: Keynote Session 3-V Session 4-V Session 5-V: Sponsor Showcase – Dell
11:00am-12:15pm Session 1-1: Advanced Multicore Software Technologies Session 2-1: Graph Analytics & Network Science 1 Session 3-1: AI / Machine Learning 1 Session 4-1: ASIC and FPGA Advances 1 Session 5-1: High Performance Data Analysis 1
12:15-12:30pm Break Poster Session 1-P (12:15-15:45) Break Poster Session 2-P (12:15-15:45) Break Tutorial Session 3-T (12:15-15:45): SPIRAL Tutorial Break Tutorial Session 4-T (12:15-15:45): Exploring Graph Analysis for HPC with Near-Memory Accelerators Break
12:30-1:45pm Session 1-2: Cloud HPEC Session 2-2: Graph Analytics & Network Science 2 Session 3-2: AI / Machine Learning 2 Session 4-2: ASIC and FPGA Advances 2 Session 5-2: High Performance Data Analysis 2
1:45-2:15pm Break Break Break Break Break
2:15-3:30pm Session 1-3: Quantum and Non-Deterministic Computing Session 2-3: Data Intensive Computing Session 3-3: AI / Machine Learning 3 Session 4-3: ASIC and FPGA Advances 3 Session 5-3: Big Data and Distributed Computing 1
3:30-3:45pm Break Break Break Break Break
3:45-5:00pm Session 1-4: BRAIDS – Boosting Resilience through Artificial Intelligence and Decision Support Session 2-4: Scaling HPC Education Session 3-4: General Purpose GPU Computing 1 Session 4-4: General Purpose GPU Computing 2 Session 5-4: Big Data and Distributed Computing 2
5:00-5:30pm Break Break Break Break Break
5:30-7:30pm Session 1-S1: Sky Computing – Toward Efficient Computing on the Cloud Session 2-S1: GraphBLAS BoF Session 3-S1: AI Challenges Session 3-S2: Emerging Technologies Session 4-S1: Graph Challenge

Monday, September 19

1-V: Sponsor Showcase – Lenovo Session (10:30-11:00)

Co-Chairs: Albert Reuther

TBD
Scott Tease (Lenovo)

1-1: Advanced Multicore Software Technologies Session (11:00-12:15)

Co-Chairs: Franz Franchetti & Brian Sroka

Performance speedup of Quantum Espresso using optimized AOCL-FFTW
S. Biplab Raut (AMD)
Task-Parallel Programming with Constrained Parallelism
Tsung-Wei Huang (University of Utah); Leslie Hwang (Synopsis)
HashTag: Fast Lookup in a Persistent Memory Filesystem
Matthew Curtis-Maury; Yash Trivedi (NetApp)
Computing In-Place FFTs with SIMD Lane Slicing
Benoît Dupont de Dinechin (Kalray)
Applying the Midas Touch of Reproducibility to High-Performance Computing
Austin Minor; Wu-chun Feng (Virginia Tech)

Poster Session: 1-P (12:15-15:45): Poster Session 1

Chair(s)/Host(s): Yehia Arafa & TBD

Resource-Constrained Optimizations For Synthetic Aperture Radar On-Board Image Processing [Outstanding Paper Award]
Maron Schlemon (German Aerospace Center); Martin Schulz (TU Munich); Rolf Scheiber (German Aerospace Center)
Enhancing the Performance Portability of Heterogeneous Circuit Analysis Programs
Tsung-Wei Huang (University of Utah)
Kv2vec: A Distributed Representation Method for Key-value Pairs from Metadata Attributes
Chenxu Niu; Wei Zhang (Texas Tech Univ.); Suren Byna (LBNL); Yong Chen (Texas Tech Univ.)
Unsupervised Adaptation of Spiking Networks in a Gradual Changing Environment
Zaidao Mei (Syracuse Univ.); Mark Barnell (Air Force Research Laboratory); Qinru Qiu (Syracuse Univ.)
Predicting Ankle Moment Trajectory with Adaptive Weighted Ensemble of LSTM Networks
Emilia A Grzesiak; Ho Chit Siu; Jennifer Sloboda (MIT Lincoln Laboratory)
Interval Arithmetic-based FFT for Large Integer Multiplication
Zibo Gong; Nathan Zhu; Matt Ngaw (Carnegie Mellon Univ.); Joao Rivera (ETH Zurich); Larry Tang; Eric Tang; Het Mankad; Franz Franchetti (Carnegie Mellon Univ.)
Machine Learning for Accurate and Fast Bandgap Prediction of Solid-State Materials
Shomik Verma; Shivam Kajale; Rafael Gomez-Bombarelli (MIT)
Systolic Array based FPGA accelerator for Yolov3-tiny
Prithvi Velicheti; Sivani Pentapati; Suresh Purini (IIIT Hyderabad)
Epigenetics and Transcriptomics Quality Control Pipelines in a HPC Environment
Darrell O Ricke (MIT Lincoln Laboratory); Derek Ng (Northeastern Univ.); Philip Fremont-Smith; Adam Michaleas; Rafael Jaimes (MIT Lincoln Laboratory)

1-2: Cloud HPEC Session (12:30-13:45)

Co-Chairs: Brian Sroka & Laura Brattain

Invited Talk: HPC Matters! How Supercomputing Supports NASA’s Mission
Dr. Piyush Mehrotra (NASA)
Scalable Interactive Autonomous Navigation Simulations on HPC
Wesley Brewer; Joel Bretheim (HPCMP PET/GDIT); John Kaniarz (DEVCOM Ground Vehicle Systems Center); Peilin Song; Burhman Gates (Engineer Research & Development Center)
Parallelizing Explicit and Implicit Extrapolation Methods for Ordinary Differential Equations
Utkarsh (IIT Kanpur); Chris Elrod; Yingbo Ma; Christopher Rackauckas (Julia Computing)
SuperCloud Lite in the Cloud – Lightweight, Secure, Self-Service, On-Demand Mechanisms for Creating Customizable Research Computing Environments
Kelsie Edie (US Military Academy); Kurt Keville; Lauren Milechin; Chris N Hill (MIT)
Site-Wide HPC Data Center Demand Response
Daniel C Wilson; Ioannis Paschalidis; Ayse K. Coskun (Boston Univ.)

1-3: Quantum and Non-Deterministic Computing Session (14:15-15:30)

Co-Chairs: Patrick Dreher & Donato Kava

C2QA – Bosonic Qiskit [Outstanding Paper Award]
Timothy Stavenger (PNNL); Eleanor Crane (JQI, QuICS); Kevin Smith (Brookhaven National Laboratory, Yale Univ.); Christopher T Kang (Univ. of Washington); Steven Girvin (Yale Univ.); Nathan Wiebe (Univ. of Toronto, PNNL)
Constructing Optimal Contraction Trees for Tensor Network Quantum Circuit Simulation [Outstanding Student Paper Award]
Cameron A Ibrahim (Univ. of Delaware); Danylo Lykov (Argonne National Laboratory); Zichang He (UC Santa Barbara); Yuri Alexeev (Argonne National Laboratory); Ilya Safro (Univ. of Delaware)
Quantum Netlist Compiler (QNC)
Shamminuj Aktar; Abdel-Hameed A. Badawy (New Mexico State Univ.); Nandakishore Santhi (Los Alamos National Laboratory)
Hardware Design and Implementation of Classic McEliece Post-Quantum Cryptosystem Based on FPGA
Shaofen Chen; Haiyan Lin; Wenjin Huang; Yihua Huang (Sun Yat-sen Univ.)
Hardware Design and Implementation of Post-Quantum Cryptography Kyber
Qingru Zeng; Quanxin Li; Baoze Zhao; Han Jiao; Yihua Huang (Sun Yat-sen Univ.)

1-4: BRAIDS – Boosting Resilience through Artificial Intelligence and Decision Support Session (15:45-17:00)

Co-Chairs: Courtland VanDam & Sandeep Pisharody

Invited Talk: Welcome to CyberWar: Long Term Ramifications Unleashed by Russia’s War
Barry Greene (Akamai)
Distributed Hardware Accelerated Secure Joint Computation on the COPA Framework [Outstanding Student Paper Award]
Rushi Patel; Pouya Haghi (Boston Univ.); Shweta Jain; Andriy Kot; Venkata Krishnan (Intel); Mayank Varia; Martin Herbordt (Boston Univ.)
Large Scale Enrichment and Statistical Cyber Characterization of Network Traffic
Ivan Kawaminami; Arminda Estrada; Youssef Elsakkary (Univ. of Arizona); Hayden Jananthan (MIT LLSC); Aydin Buluc (LBNL); Tim Davis (Texas A&M Univ.); Daniel Grant (GreyNoise); Michael Jones (MIT LLSC); Chad Meiners (MIT Lincoln Laboratory); Andrew Morris (GreyNoise); Sandeep Pisharody (MIT Lincoln Laboratory); Jeremy Kepner (MIT LLSC)
Edge Computing Security for a Multi-Agent System
Alice Lee; Karen Gettings; Matthias Beebe; Paul Monticciolo; Michael Vai (MIT Lincoln Laboratory)
Invited Talk: Proposed Empirical Assessment of Remote Workers’ Cyberslacking and Computer Security Posture to Assess Organizational Cybersecurity Risks
Ariel Luna; Yair Levy; Greg Simco; Wei Li (Nova Southeastern University)

1-S1: Sky Computing – Toward Efficient Computing on the Cloud Special (17:30-19:30)

Organizers: Marco Montes de Oca, Luna Xu, Erica Lin, Suraj Bramhavar, Jeffrey Chou (Sync Computing)

Deep Dive in Uber’s Remote Shuffle Service for Apache Spark
Bo Yang (Stealth Startup)
Taming High-Performance Computing Platform Heterogeneity with Machine Learning
Prasanna Balaprakash (Argonne National Laboratory)
Optimizing Heterogeneous Computing Resources Based Only on Cost and Time
Suraj Bramhavar (Sync Computing)
AI-Powered Acceleration of Deep Learning Inference on the Cloud
Glenn Ko (Stochastic)
Cost-Effective Batch Scheduling in the Cloud
Chaoran Yu (Apple)

Tuesday, September 20

2-V: Keynote Session (10:30-11:00)

Co-Chairs: Jeremy Kepner & Albert Reuther

Reflections on a Career in Computer Science
Prof. Barbara Liskov (MIT CSAIL)

2-1: Graph Analytics & Network Science 1 Session (11:00-12:15)

Co-Chairs: John Gilbert & Chris Long

Invited Talk: The NSF Computing and Information Science and Engineering Landscape: A Look Forward
Dr. Almadena Chtchelkanova (NSF)
GraphBLAS on the Edge: Anonymized High Performance Streaming of Network Traffic [Outstanding Paper Award]
Michael S Jones; Jeremy Kepner (MIT LLSC); Daniel Andersen (CAIDA); Aydın Buluc ̧(LBNL); Chansup Byun (MIT LLSC); K Claffy (CAIDA); Timothy Davis (Texas A&M); William Arcand (MIT LLSC); Jonathan Bernays (MIT Lincoln Laboratory); David Bestor; William Bergeron; Vijay Gadepally; Micheal Houle; Matthew Hubbell; Hayden Jananthan; Anna Klein (MIT LLSC); Chad Meiners (MIT Lincoln Laboratory); Lauren Milechin (MIT); Julie Mullen (MIT LLSC); Sandeep Pisharody (MIT Lincoln Laboratory); Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi (MIT LLSC); Jon Sreekanth (Accolade Technology); Doug Stetson (MIT Lincoln Laboratory); Charles Yee; Peter Michaleas (MIT LLSC)
Analyzing Multi-trillion Edge Graphs on Large GPU Clusters: A Case Study with PageRank [Outstanding Paper Award]
Seunghwa Kang; Joseph Nke; Brad Rees (NVIDIA)
Achieving Speedups for Distributed Graph Biconnectivity
Ian Bogle; George M. Slota (RPI)
Generating Permutations Using Hash Tables
Oded Green; Corey Nolet; Joe Eaton (NVIDIA)

Poster Session: 2-P (12:15-15:45): Poster Session 2

Chair(s)/Host(s): Yehia Arafa & TBD

ProtoX: A First Look
Het Mankad; Sanil Rao (Carnegie Mellon Univ.); Phillip Colella; Brian Van Straalen (Lawrence Berkeley National Laboratory); Franz Franchetti (Carnegie Mellon Univ.)
Magic Memory: A Programming Model For Big Data Analytics
Eric Tang; Franz Franchetti (Carnegie Mellon Univ.)
Approximating Manifolds and Geodesics with Curved Surfaces
Peter Oostema; Franz Franchetti (Carnegie Mellon Univ.)
Network Automation in Lab Deployment Using Ansible and Python
V Andal Priyadharshini; Anumalasetty Yashwanth Nath (SRM Institute of Science and Technology)
Optimizations to Increase JDBC Driver Performance in Spark
Deeptaanshu Kumar (Carnegie Mellon Univ.); Suxi Li ( Univ. of Miami)
Arachne: An Arkouda Package for Large-Scale Graph Analytics
Oliver A Alvarado Rodriguez; Zhihui Du; Joseph T Patchett; Fuhuan Li; David Bader (New Jersey Inst. of Tech.)
Triangle Centrality in Arkouda
Joseph T Patchett; Zhihui Du; Fuhuan Li; David Bader (New Jersey Inst. of Tech.)
Image Recognition Using Machine Learning For Forbidden Items Detection In Airports
Alaa Atef; Abde-ljalil Naser; Mahmoud Mohamed; Mariam Safwat; Menna Tulla Ayman; Mohamed Mostafa; Salma Hesham (Ain Shams University); Khaled Salah (Siemens)

2-2: Graph Analytics & Network Science 2 Session (12:30-13:45)

Co-Chairs: John Gilbert & Chris Long

Hypersparse Network Flow Analysis of Packets with GraphBLAS
Tyler Trigg; Chad Meiners; Sandeep Pisharody (MIT Lincoln Laboratory); Hayden Jananthan; Michael Jones (MIT LLSC); Adam Michaleas (MIT Lincoln Laboratory); Timothy Davis (Texas A&M Univ.); Erik Welch (NVIDIA); William Arcand; David Bestor; William Bergeron; Chansup Byun; Vijay Gadepally; Micheal Houle; Matthew Hubbell; Anna Klein; Peter Michaleas (MIT LLSC); Lauren Milechin (MIT); Julie Mullen; Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi (MIT LLSC); Doug Stetson (MIT Lincoln Laboratory); Charles Yee; Jeremy Kepner (MIT LLSC)
SHARP: Software Hint-Assisted Memory Access Prediction for Graph Analytics
Pengmiao Zhang (Univ. of Southern California); Rajgopal Kannan (US Army Research Lab-West); Xiangzhi Tong (Xi’an Jiaotong-Liverpool Univ.); Anant V Nori (Intel Labs); Viktor K Prasanna (Univ. of Southern California)
Fast Graph Algorithms for Superpixel Segmentation
Dimitris Floros (Aristotle Univ. of Thessaloniki); Tiancheng Liu (Duke Univ.); Nikos P Pitsianis (Aristotle Univ. of Thessaloniki, Duke Univ.); Xiaobai Sun (Duke Univ.)
Explicit Ordering Refinement for Accelerating Irregular Graph Analysis
Michael Mandulak; Ruochen Hu; George M Slota (RPI)
Efficient Calculation of Triangle Centrality in Big Data Networks
Wali Mohammad Abdullah; David Awosoga; Shahadat Hossain (Univ. of Lethbridge)

2-3: Data Intensive Computing Session (14:15-15:30)

Co-Chairs: Xiaobai Sun & Nikos Pitsianis

Enabling Novel In-Memory Computation Algorithms to Address Next-Generation Throughput Constraints on SWaP-Limited Platforms
Jessica M Ray; Chad Meiners (MIT Lincoln Laboratory)
Towards Fast Crash-Consistent Cluster Checkpointing
Andrew E Wood (Boston Univ.); Moshik Hershcovitch (IBM Research); Ilias Ennmouri (IBM); Weiyu Zong; Saurav Chennuri (Boston Univ.); Sarel Cohen (The Academic College of Tel Aviv-Yaffo); Swaminathan Sundararaman (IBM); Daniel G Waddington (IBM Research); Peter Chin (Dartmouth Univ.)
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
James Nguyen; Sanil Rao (Carnegie Mellon Univ.); Jose Moreira (IBM); Franz Franchetti (Carnegie Mellon Univ.)
Towards Hardware Accelerated Garbage Collection with Near-Memory Processing
Samuel Thomas; Jiwon Choe (Brown Univ.); Ofir Gordon; Erez Petrank (Technion Inst.); Tali Moreshet (Boston Univ.); Maurice Herlihy (Brown Univ.); Ruth Iris Bahar (Colorado School of Mines)
RaiderSTREAM: Adapting the STREAM Benchmark to Modern HPC Systems
Michael Beebe; Brody Williams; Stephen Devaney (Texas Tech Univ.); John Leidel (Tactical Computing Laboratories); Yong Chen (Texas Tech Univ.); Steve Poole (Los Alamos National Lab)

2-4: Scaling HPC Education Session (15:45-17:00)

Co-Chairs: Julie Mullen, Lauren Milechin & Hayden Jananthan

Invited Talk: TBD
Dr. Dhruva Chakravorty (Texas A&M Univ.)
Invited Talk: TBD
Dennis Milechin (Boston Univ.)
Invited Talk: TBD
Dr. Eric Coulter (Georgia Tech)

2-S1: GraphBLAS BoF Special (17:30-19:30)

Organizers: Tim Mattson & Scott McMillan

Wednesday, September 21

3-V Session (10:30-11:00)

Co-Chairs: Albert Reuther

3-1: AI / Machine Learning 1 Session (11:00-12:15)

Co-Chairs: Ashok Krishnamurthy & Julie Mullen

Invited Talk: Making AI Real: Insights from the Lab to Operations
Maj. Michael Kanaan (USAF)
Benchmarking Resource Usage for Efficient Distributed Deep Learning [Outstanding Paper Award]
Nathan C Frey (MIT LLSC); Baolin Li (Northeastern Univ.); Joseph P McDonald; Dan Zhao; Michael S Jones; David Bestor (MIT LLSC); Devesh Tiwari (Northeastern Univ.); Vijay Gadepally; Siddharth Samsi (MIT LLSC)
Ultra Low-Power Deep Learning Applications at the Edge with Jetson Orin AGX Hardware
Mark Barnell (AFRL); Courtney Raymond (AFRL); Steven Smiley; Darrek Isereau; Daniel Brown (SRC, Inc.)
Performance Estimation for Efficient Image Segmentation Training of Weather Radar Algorithms
Joseph P McDonald (MIT LLSC); James Kurdzo; Phillip Stepanian; Mark Veillette (MIT Lincoln Laboratory); David Bestor; Michael Jones; Vijay Gadepally; Siddharth Samsi (MIT LLSC)
Deep Gaussian Process with Multitask and Transfer Learning for Performance Optimization
Wissam Sid-Lakhdar (Univ. of Tennessee Knoxville); Mohsen Aznaveh (Texas A&M Univ.); Piotr Luszczek (Univ. of Tennessee Knoxville); Jack Dongarra (Univ. of Tennessee Knoxville, ORNL)

Tutorial Session: 3-T (12:15-15:45): SPIRAL Tutorial

Organizer(s): Franz Franchetti & Mike Franusich

3-2: AI / Machine Learning 2 Session (12:30-13:45)

Co-Chairs: Janice Gordon & Sanmukh Rao Kuppannagari

Invited Talk: Challenges in Geospatial Computing
Prof. Taylor Perron (MIT Geology)
A High-performance Deployment Framework for Pipelined CNN Accelerators with Flexible DSE Strategy
Conghui Luo; Wenjin Huang; Dehao Xiang; Yihua Huang (Sun Yat-sen Univ.)
Enabling Transformers to Understand Low-Level Programs
Zifan Guo; William S Moses (MIT)
Online Detection and Classification of State Transitions of Multivariate Shock and Vibration Data
Nicklaus Przybylski; William M Jones (Coastal Carolina Univ.); Nathan DeBardeleben (Los Alamos National Laboratory)
Surrogate ML/AI Model Benchmarking for FAIR Principles’ Conformance
Piotr Luszczek; Cade E Brown (Univ. of Tennessee Knoxville)

3-3: AI / Machine Learning 3 Session (14:15-15:30)

Co-Chairs: Janice Gordon & Sanmukh Rao Kuppannagari

Invited Talk: Trends in Energy Estimates for Computing in AI/Machine Learning Accelerators, Supercomputers, and Compute-Intensive Applications
Albert Reuther (MIT LLSC); Sadasivan Shankar (Stanford Univ.)
Walker Activity Tracking Using Machine Learning
Maxwell A. Huang; Edward A. Clancy (Worcester Polytechnic Institute)
An Evaluation of Low Overhead Time Series Preprocessing Techniques for Downstream Machine Learning
Matthew L. Weiss; Joseph McDonald; David Bestor; Charles Yee (MIT LLSC); Daniel Edelman (MIT); Michael S Jones (MIT LLSC); Andrew Prout (MIT LLSC); Andrew Bowne; Lindsey McEvoy (US Air Force); Vijay Gadepally; Siddharth Samsi (MIT LLSC)
Deep Learning For Tissue Classification
Kimberly Robasky (RENCI/UNC Chapel Hill)

3-4: General Purpose GPU Computing 1 Session (15:45-17:00)

Co-Chairs: Sadas Shankar & Hameed Badawy

Invited Talk: New Frontiers in Performance at Wafer Scale
Dr. Rob Schreiber (Cerebras)
AI and ML Accelerator Survey and Trends
Albert Reuther; Peter Michaleas; Michael S Jones; Vijay Gadepally; Siddharth Samsi; Jeremy Kepner (MIT LLSC)
A Multi-GPU Parallel Genetic Algorithm For Large-Scale Vehicle Routing Problems
Marwan Abdelatti; Manbir S Sodhi; Resit Sendag (Univ. of Rhode Island)
DASH: Scheduling Deep Learning Workloads on Multi-Generational GPU-Accelerated Clusters
Baolin Li; Tirthak Patel (Northeastern Univ.); Vijay Gadepally (MIT LLSC); Karen Gettings (MIT Lincoln Laboratory); Siddharth Samsi (MIT LLSC); Devesh Tiwari (Northeastern Univ.)
Evaluation of a Novel Scratchpad Memory Through Compiler Supported Simulation
Essa Imhmed (Eastern New Mexico Univ.); Jonathan Cook; Hameed Badawy (New Mexico State Univ.)

3-S1: AI Challenges Special (17:30-19:30)

Organizers: Vijay Gadepally

AIA Challenges – Status Update
Vijay Gadepally (MIT LLSC); Andy Bowne (US Air Force)
SEVIR Challenge
Mark Velliette; Esther Wolff (MIT Lincoln Laboratory)
Datacenter Challenge
Siddarth Samsi; Matthew Weiss (MIT LLSC)
Rainforest Challenge
Miriam Cha (MIT Lincoln Laboratory)
ManeuverID Challenge
Kaira Samuel (MIT)
MagNav Challenge
Jonathan Taylor (MIT Lincoln Laboratory)
CogPilot Challenge
Sophia Yuditskaya; Laura Brattain (MIT Lincoln Laboratory)

3-S2: Emerging Technologies Special (17:30-19:30)

Organizers: Kurt Keville, Donato Kava, Po Hao Chen

An HPC Watershed – Next Generation Arithmetic
John Gustafson (Arizona State Univ.)

Thursday, September 22

4-V Session (10:30-11:00)

Co-Chairs: Albert Reuther

4-1: ASIC and FPGA Advances 1 Session (11:00-12:15)

Co-Chairs: Sadas Shankar & Ken Cain

Invited Talk: GNN Acceleration on FPGAs
Prof. Viktor Prasanna (USC)
Flexible Hardware Accelerator Design Generation with SPIRAL
Guanglin Xu; James Hoe; Franz Franchetti (Carnegie Mellon Univ.)
On the Characterization of the Performance-Productivity Gap for FPGA
Atharva Gondhalekar; Thomas Twomey; Wu-chun Feng (Virginia Tech)
Optimizing Designs Using Several Types of Memories on Modern FPGAs
Mehmet Gungor; Kai Huang; Stratis Ioannidis; Miriam Leeser (Northeastern Univ.)
Performance Modeling Sparse MTTKRP Using Optical Static Random Access Memory on FPGA
Sasindu Wijeratne (Univ. of Southern California); Akhilesh Jaiswal; Ajey Jacob (USC-ISI); Bingyi Zhang; Viktor K Prasanna (Univ. of Southern California)

Tutorial Session: 4-T (12:15-15:45): Exploring Graph Analysis for HPC with Near-Memory Accelerators

Organizer(s): Jeffrey Young (Georgia Tech), Patrick Lavin (Georgia Tech), Jason Riedy (Lucata), Srinivas Eswar (Georgia Tech)

4-2: ASIC and FPGA Advances 2 Session (12:30-13:45)

Co-Chairs: Darrell Ricke & TBD

A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Larry Tang; Siyuan Chen; Keshav Harisrikanth; Guanglin Xu; Ken Mai; Franz Franchetti (Carnegie Mellon Univ.)
Modeling the Energy Efficiency of GEMM using Optical Random Access Memory
Bingyi Zhang; Akhilesh Jaiswal; Clynn Mathew; Ravi Teja Lakkireddy; Ajey Jacob; Sasindu Wijeratne; Viktor K. Prasanna (Univ. of Southern California)
Edge-Connected Jaccard Similarity for Graph Link Prediction on FPGA
Paul Sathre; Atharva Gondhalekar; Wu-chun Feng (Virginia Tech)
Design and Implementation of a Real-Time Parallel FFT for a Wideband Radar System on an FPGA
Lakshmi Pradeep Bheema; Rishu Anand; Pavan Vadakattu; Syed Azemuddin (IIIT Hyderabad); Aquibuddin Ahmed (RCI-DRDO)
Challenges Designing for FPGAs Using High-Level Synthesis
Clayton J Faber; Steven D Harris; Zhili Xiao; Roger Chamberlain (Washington Univ. in St. Louis); Anthony M Cabrera (Oak Ridge National Laboratory)

4-3: ASIC and FPGA Advances 3 Session (14:15-15:30)

Co-Chairs: Sanmukh Rao Kuppannagari & TBD

FPGA Acceleration of Fully Homomorphic Encryption over the Torus
Tian Ye (Univ. of Southern California); Rajgopal Kannan (Army Research Lab-West); Viktor K. Prasanna (Univ. of Southern California)
Optimizing Open-source FPGA CAD Tools
Shachi Vaman Khadilkar; Martin Margala (UMass Lowell)
LIMA: Hardware for FFT based large integer multiplication
James Nguyen; Michael Cai; Ziyi Zuo; Larry Tang; Ken Mai; Franz Franchetti (Carnegie Mellon Univ.)
Towards a Generic UVM
Kholoud Mahmoud; Randa Ahmed; Karim Ayman; Mostafa Ayman; Waleed Taie; Yasser Ibrahim; Hassan Mostafa (Cairo University); Khaled Salah (Siemens)
How to Prevent a Sick ASIC
William F. Ellersick (Analog Circuit Works)

4-4: General Purpose GPU Computing 2 Session (15:45-17:00)

Co-Chairs: Hameed Badawy & Mark Barnell

Optimal GPU Frequency Selection using Multi-Objective Approaches for HPC Systems
Ghazanfar Ali (Texas Tech Univ.); Sridutt Bhalachandra; Nicholas Wright (Lawrence Berkeley National Laboratory); Mert Side; Yong Chen (Texas Tech Univ.)
GPU-Accelerated High-Bandwidth Radar Centroiding
David J Brigada; Maximilian Merfeld; Kara Warner (MIT Lincoln Laboratory)
A Hierarchical Jacobi Iteration for Structured Matrices on GPUs using Shared Memory
Mohammad Shafaet Islam; Qiqi Wang (MIT)
Demystifying the Nvidia Ampere Architecture through Microbenchmarking and Instruction-level Analysis
Hamdy Abdelkhalik (New Mexico State Univ.); Yehia Arafa (Qualcomm Technologies); Nandakishore Santhi (Los Alamos National Laboratory); Hameed Badawy (New Mexico State Univ.)
Apple Silicon Performance in Scientific Computing
Connor Kenyon; Collin Capano (UMass Dartmouth)

4-S1: Graph Challenge Special (17:30-19:30)

Organizers: Jeremy Kepner

Sparse Deep Neural Network Inference Using Different Programming Models
Hyungro Lee; Milan Jain; Sayan Ghosh (PNNL)
Kalman Filter Driven Estimation of Community Structure in Time Varying Graphs
Lisa JK Durbeck; Peter Athanas (Virginia Tech)
Improved Distributed-Memory Triangle Counting by Exploiting the Graph Structure
Sayan Ghosh (PNNL, Washington State Univ.)
HTC: Hybrid Vertex-parallel and Edge-parallel Triangle Counting
Li Zeng; Kang Yang; Haoran Cai; Jinhua Zhou; Rongqian Zhao; Xin Chen (Huawei Technologies)
FAST: A Scalable Subgraph Matching Framework over Large Graphs
Jiezhong He; Zhouyang Liu; Yixin Chen; Hengyue Pan; Zhen Huang; Dongsheng Li (NUDT)
Towards Fast GPU-based Sparse DNN Inference: A Hybrid Compute Model
Shaoxian Xu; Minkang Wu; Long Zheng; Zhiyuan Shao; Xiangyu Ye; Xiaofei Liao; Hai Jin (Huazhong Univ. of Science and Technology)
Accelerating Sparse Deep Neural Network Inference Using GPU Tensor Cores
Yufei Sun; Long Zheng; Qinggang Wang; Xiangyu Ye; Yu Huang; Pengcheng Yao; Xiaofei Liao; Hai Jin (Huazhong Univ. of Science and Technology)

Friday, September 23

5-V: Sponsor Showcase – Dell Session (10:30-11:00)

Co-Chairs: Albert Reuther

How to Wrestle with Global Scale ML and Win
Dr. Ben Fauber (Dell Technologies)

5-1: High Performance Data Analysis 1 Session (11:00-12:15)

Co-Chairs: Darrell Ricke & Ken Cain

Optimizing Performance and Storage of Memory-Mapped Persistent Data Structures [Outstanding Student Paper Award]
Karim Youssef (Virginia Tech); Abdullah Al Raqibul Islam (Univ. of North Carolina at Charlotte); Keita Iwabuchi (Lawrence Livermore National Laboratory); Wu-chun Feng (Virginia Tech); Roger Pearce (Lawrence Livermore National Laboratory)
Processing Particle Data Flows with SmartNICs [Outstanding Student Paper Award]
Jianshen Liu; Carlos Maltzahn (UC Santa Cruz); Matthew Curry; Craig Ulmer (Sandia National Laboratories)
AUTOPAGER: Auto-tuning Memory-Mapped I/O Parameters in Userspace [Outstanding Student Paper Award]
Karim Youssef ; Niteya Shah (Virginia Tech); Maya B Gokhale; Roger Pearce (Lawrence Livermore National Laboratory); Wu-chun Feng (Virginia Tech)
An SSD-Based Accelerator for Singular Value Decomposition Recommendation Algorithm on Edge
Wei Wu; Letian Zhao; Qizhe Wu; Xiaotian Wang; Teng Tian; Xi Jin (Univ. of Science and Technology of China)
Hardware Software Codesign of Applications on the Edge: Accelerating Digital PreDistortion for Wireless Communications
Zhaoyang Han; Yiyue Jiang (Northeastern Univ.); Rahul Mushini; John Dooley (Maynooth Univ.); Miriam Leeser (Northeastern Univ.)

5-2: High Performance Data Analysis 2 Session (12:30-13:45)

Co-Chairs: Darrell Ricke & David Cousins

Im2win: Memory Efficient Convolution On SIMD Architectures
Shuai Lu; Jun Chu (Nanchang Hangkong Univ.); Xu T. Liu (Univ. of Washington)
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Naifeng Zhang (Carnegie Mellon Univ.); Homer Gamil (New York Univ.); Patrick Brinich (Drexel Univ.); Benedict Reynwar (USC ISI); Ahmad Al Badawi (Duality Technologies); Negar Neda; Deepraj Soni (New York Univ.); Yuriy Polyakov (Duality Technologies); Patrick Broderick (SpiralGen, Inc.); Michail Maniatakos (New York Univ.); Andrew Schmidt (USC ISI); Mike Franusich (SpiralGen, Inc.); Jeremy Johnson (Drexel Univ.); Brandon Reagen (New York Univ.); David Bruce Cousins (Duality Technologies); Franz Franchetti (Carnegie Mellon Univ.)
Python Implementation of the Dynamic Distributed Dimensional Data Model
Hayden R Jananthan (MIT LLSC); Lauren Milechin (MIT): Michael Jones; William Arcand; William Bergeron; David Bestor; Chansup Byun; Michale Houle; Matthew Hubbell; Vijay Gadepally; Anna Klein; Peter Michaleas; Guillermo Morales; Julie Mullen; Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi; Charles Yee; Jeremy Kepner (MIT LLSC)
Powering Practical Performance: Accelerated Numerical Computing in Pure Python
Matthew Penn; Christopher Milroy (NVIDIA)
Parallel Computing with DNA Forensics Data
Adam Michaleas; Philip Fremont-Smith; Chelsea Lennartz; Darrell O. Ricke (MIT Lincoln Laboratory)

5-3: Big Data and Distributed Computing 1 Session (14:15-15:30)

Co-Chairs: Sadas Shankar & Chansup Byun

Invited Talk: Data-Driven Precision Neuroscience
Dr. John Reynders (Neumora)
Distributed Out-of-Memory SVD on CPU/GPU Architectures [Outstanding Paper Award]
Ismael Boureima; Manish Bhattarai; Maksim E Eren; Nick Solovyev; Hirsto Djidjev; Boian Alexandrov (Los Alamos National Laboratory)
HuGraph: Acceleration of GCN Training on Heterogeneous FPGA Clusters with Quantization [Outstanding Student Paper Award]
Letian Zhao; Qizhe Wu; Xiaotian Wang; Teng Tian; Wei Wu; Xi Jin (Univ. of Science and Technology of China)
A Scalable Inference Pipeline for 3D Axon Tracing Algorithms
Benjamin M Fenelon; Lars Gjesteby (MIT Lincoln Laboratory); Webster Guan; Juhyuk Park; Kwanghun Chung (MIT); Laura Brattain (MIT Lincoln Laboratory)
Exploring the Impacts of Software Cache Configuration for In-line Data Compression
Sansriti Ranjan; Dakota Fulp; Jon C Calhoun (Clemson Univ.)

5-4: Big Data and Distributed Computing 2 Session (15:45-17:00)

Co-Chairs: Rich Vuduc & Nikos Pitsianis

Invited Talk: HPC Graphs in the AWS Cloud
Roger Pearce (LLNL)
pPython for Parallel Python Programming
Chansup Byun; William Arcand; David Bestor; Bill Bergeron; Vijay Gadepally; Michael Houle; Matthew Hubbell; Hayden Jananthan; Michael Jones (MIT LLSC); Kurt Keville (MIT); Anna Klein; Peter Michaleas (MIT LLSC); Lauren Milechin (MIT); Guillermo Morales; Julie Mullen; Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi; Charles Yee; Jeremy Kepner (MIT LLSC)
The Viability of Using Online Prediction to Perform Extra Work while Executing BSP Applications
Po Hao Chen; Pouya Haghi; Jae Yoon Chung (Boston Univ.); Tong Geng (Univ. of Rochester); Richard West (Boston Univ.); Anthony Skjellum (UTC); Martin Herbordt (Boston Univ.)
Real-Time Software Architecture for EM-Based Radar Signal Processing and Tracking
Alan W Nussbaum (Georgia Tech, GTRI); Byron Keel (GTRI); William Dale Blair (GTRI, Georgia Tech); Umakishore Ramachandran (Georgia Tech)