2022 IEEE High Performance Extreme Computing
Virtual Conference

19 - 23 September 2022

IEEE HPEC Preliminary Agenda

All times are EDT (UTC/GMT -04 hours)

Day Monday Tuesday Wednesday Thursday Friday
10:30-11:00am Session 1-V: Sponsor Showcase – Lenovo Session 2-V: Keynote Session 5-V: Sponsor Showcase – Dell
11:00am-12:15pm Session 1-1: Advanced Multicore Software Technologies Session 2-1: Graph Analytics & Network Science 1 Session 3-1: AI / Machine Learning 1 Session 4-1: ASIC and FPGA Advances 1 Session 5-1: High Performance Data Analysis 1
12:15-12:30pm Break Poster Session 1-P (12:15-14:15) Break Poster Session 2-P (12:15-14:15) Break Tutorial Session 3-T (12:15-15:45): SPIRAL Tutorial (http://www.spiral.net/tutorial-spiral.html) Break Tutorial Session 4-T (12:15-15:45): Exploring Graph Analysis for HPC with Near-Memory Accelerators Break
12:30-1:45pm Session 1-2: Cloud HPEC Session 2-2: Graph Analytics & Network Science 2 Session 3-2: AI / Machine Learning 2 Session 4-2: ASIC and FPGA Advances 2 Session 5-2: High Performance Data Analysis 2
1:45-2:15pm Break Break Break Break Break
2:15-3:30pm Session 1-3: Quantum and Non-Deterministic Computing Session 2-3: Data Intensive Computing Session 3-3: AI / Machine Learning 3 Session 4-3: ASIC and FPGA Advances 3 Session 5-3: Big Data and Distributed Computing 1
3:30-3:45pm Break Break Break Break Break
3:45-5:00pm Session 1-4: BRAIDS – Boosting Resilience through Artificial Intelligence and Decision Support Session 2-4: Scaling HPC Education Session 3-4: General Purpose GPU Computing 1 Session 4-4: General Purpose GPU Computing 2 Session 5-4: Big Data and Distributed Computing 2
5:00-5:30pm Break Break Break Break Break
5:30-7:30pm Session 1-S1: Sky Computing – Toward Efficient Computing on the Cloud Session 2-S1: GraphBLAS BoF Session 3-S1: AI Challenges Session 3-S2: Emerging Technologies Session 4-S1: Best Paper and Graph Challenge

Monday, September 19

 

1-V: Sponsor Showcase – Lenovo Session (10:30-11:00)

Co-Chairs: Albert Reuther  
Carbon Footprint Aware Computing
Scott Tease (Lenovo)
 

1-1: Advanced Multicore Software Technologies Session (11:00-12:15)

Co-Chairs: Franz Franchetti & Brian Sroka  
Performance speedup of Quantum Espresso using optimized AOCL-FFTW
S. Biplab Raut (AMD)
Task-Parallel Programming with Constrained Parallelism
Tsung-Wei Huang (University of Utah); Leslie Hwang (Synopsis)
HashTag: Fast Lookup in a Persistent Memory Filesystem
Matthew Curtis-Maury; Yash Trivedi (NetApp)
Computing In-Place FFTs with SIMD Lane Slicing
Benoît Dupont de Dinechin (Kalray)
Applying the Midas Touch of Reproducibility to High-Performance Computing
Austin Minor; Wu-chun Feng (Virginia Tech)
 

Poster Session: 1-P (12:15-14:15): Poster Session 1

Chair(s)/Host(s): TBD & TBD  
Resource-Constrained Optimizations For Synthetic Aperture Radar On-Board Image Processing [Outstanding Paper Award]
Maron Schlemon (German Aerospace Center); Martin Schulz (TU Munich); Rolf Scheiber (German Aerospace Center)
Enhancing the Performance Portability of Heterogeneous Circuit Analysis Programs
Tsung-Wei Huang (University of Utah)
Kv2vec: A Distributed Representation Method for Key-value Pairs from Metadata Attributes
Chenxu Niu; Wei Zhang (Texas Tech Univ.); Suren Byna (LBNL); Yong Chen (Texas Tech Univ.)
Unsupervised Adaptation of Spiking Networks in a Gradual Changing Environment
Zaidao Mei (Syracuse Univ.); Mark Barnell (Air Force Research Laboratory); Qinru Qiu (Syracuse Univ.)
Predicting Ankle Moment Trajectory with Adaptive Weighted Ensemble of LSTM Networks
Emilia A Grzesiak; Ho Chit Siu; Jennifer Sloboda (MIT Lincoln Laboratory)
Interval Arithmetic-based FFT for Large Integer Multiplication
Zibo Gong; Nathan Zhu; Matt Ngaw (Carnegie Mellon Univ.); Joao Rivera (ETH Zurich); Larry Tang; Eric Tang; Het Mankad; Franz Franchetti (Carnegie Mellon Univ.)
Machine Learning for Accurate and Fast Bandgap Prediction of Solid-State Materials
Shomik Verma; Shivam Kajale; Rafael Gomez-Bombarelli (MIT)
Systolic Array based FPGA accelerator for Yolov3-tiny
Prithvi Velicheti; Sivani Pentapati; Suresh Purini (IIIT Hyderabad)
Epigenetics and Transcriptomics Quality Control Pipelines in a HPC Environment
Darrell O Ricke (MIT Lincoln Laboratory); Derek Ng (Northeastern Univ.); Philip Fremont-Smith; Adam Michaleas; Rafael Jaimes (MIT Lincoln Laboratory)
 

1-2: Cloud HPEC Session (12:30-13:45)

Co-Chairs: Brian Sroka & Laura Brattain  
Invited Talk: HPC Matters! How Supercomputing Supports NASA’s Mission
Dr. Piyush Mehrotra (NASA)
Scalable Interactive Autonomous Navigation Simulations on HPC
Wesley Brewer; Joel Bretheim (HPCMP PET/GDIT); John Kaniarz (DEVCOM Ground Vehicle Systems Center); Peilin Song; Burhman Gates (Engineer Research & Development Center)
Parallelizing Explicit and Implicit Extrapolation Methods for Ordinary Differential Equations
Utkarsh (IIT Kanpur); Chris Elrod; Yingbo Ma; Christopher Rackauckas (Julia Computing)
SuperCloud Lite in the Cloud – Lightweight, Secure, Self-Service, On-Demand Mechanisms for Creating Customizable Research Computing Environments
Kelsie Edie (US Military Academy); Kurt Keville; Lauren Milechin; Chris N Hill (MIT)
Site-Wide HPC Data Center Demand Response
Daniel C Wilson; Ioannis Paschalidis; Ayse K. Coskun (Boston Univ.)
 

1-3: Quantum and Non-Deterministic Computing Session (14:15-15:30)

Co-Chairs: Patrick Dreher & Donato Kava  
C2QA – Bosonic Qiskit [Outstanding Paper Award]
Timothy Stavenger (PNNL); Eleanor Crane (JQI, QuICS); Kevin Smith (Brookhaven National Laboratory, Yale Univ.); Christopher T Kang (Univ. of Washington); Steven Girvin (Yale Univ.); Nathan Wiebe (Univ. of Toronto, PNNL)
Constructing Optimal Contraction Trees for Tensor Network Quantum Circuit Simulation [Best Student Paper Award]
Cameron A Ibrahim (Univ. of Delaware); Danylo Lykov (Argonne National Laboratory); Zichang He (UC Santa Barbara); Yuri Alexeev (Argonne National Laboratory); Ilya Safro (Univ. of Delaware)
Quantum Netlist Compiler (QNC)
Shamminuj Aktar; Abdel-Hameed A. Badawy (New Mexico State Univ.); Nandakishore Santhi (Los Alamos National Laboratory)
Hardware Design and Implementation of Classic McEliece Post-Quantum Cryptosystem Based on FPGA
Shaofen Chen; Haiyan Lin; Wenjin Huang; Yihua Huang (Sun Yat-sen Univ.)
Hardware Design and Implementation of Post-Quantum Cryptography Kyber
Qingru Zeng; Quanxin Li; Baoze Zhao; Han Jiao; Yihua Huang (Sun Yat-sen Univ.)
 

1-4: BRAIDS – Boosting Resilience through Artificial Intelligence and Decision Support Session (15:45-17:00)

Co-Chairs: Courtland VanDam & Sandeep Pisharody  
Invited Talk: Welcome to CyberWar: Long Term Ramifications Unleashed by Russia’s War
Barry Greene (Akamai)
Distributed Hardware Accelerated Secure Joint Computation on the COPA Framework [Outstanding Student Paper Award]
Rushi Patel; Pouya Haghi (Boston Univ.); Shweta Jain; Andriy Kot; Venkata Krishnan (Intel); Mayank Varia; Martin Herbordt (Boston Univ.)
Large Scale Enrichment and Statistical Cyber Characterization of Network Traffic
Ivan Kawaminami; Arminda Estrada; Youssef Elsakkary (Univ. of Arizona); Hayden Jananthan (MIT LLSC); Aydin Buluc (LBNL); Tim Davis (Texas A&M Univ.); Daniel Grant (GreyNoise); Michael Jones (MIT LLSC); Chad Meiners (MIT Lincoln Laboratory); Andrew Morris (GreyNoise); Sandeep Pisharody (MIT Lincoln Laboratory); Jeremy Kepner (MIT LLSC)
Edge Computing Security for a Multi-Agent System
Alice Lee; Karen Gettings; Matthias Beebe; Paul Monticciolo; Michael Vai (MIT Lincoln Laboratory)
Invited Talk: Proposed Empirical Assessment of Remote Workers’ Cyberslacking and Computer Security Posture to Assess Organizational Cybersecurity Risks
Ariel Luna; Yair Levy; Greg Simco; Wei Li (Nova Southeastern University)
 

1-S1: Sky Computing – Toward Efficient Computing on the Cloud Special (17:30-19:30)

Organizers: Marco Montes de Oca, Luna Xu, Erica Lin, Suraj Bramhavar, Jeffrey Chou (Sync Computing)  
Running Spark Applications In Large Scale On K8s: Challenges and Solutions
Bo Yang (Stealth Startup)
Taming High-Performance Computing Platform Heterogeneity with Machine Learning
Prasanna Balaprakash (Argonne National Laboratory)
Optimizing Heterogeneous Computing Resources Based Only on Cost and Time
Suraj Bramhavar (Sync Computing)
AI-Powered Acceleration of Deep Learning Inference on the Cloud
Glenn Ko (Stochastic)
Cost-Effective Batch Scheduling in the Cloud
Chaoran Yu (Apple)
   

Tuesday, September 20

 

2-V: Keynote Session (10:30-11:00)

Co-Chairs: Jeremy Kepner & Albert Reuther  
Reflections on a Career in Computer Science
Prof. Barbara Liskov (MIT CSAIL)
 

2-1: Graph Analytics & Network Science 1 Session (11:00-12:15)

Co-Chairs: John Gilbert & Chris Long  
Invited Talk: The NSF Computing and Information Science and Engineering Landscape: A Look Forward
Dr. Almadena Chtchelkanova (NSF)
GraphBLAS on the Edge: Anonymized High Performance Streaming of Network Traffic [Best Paper Award]
Michael S Jones; Jeremy Kepner (MIT LLSC); Daniel Andersen (CAIDA); Aydın Buluc ̧(LBNL); Chansup Byun (MIT LLSC); K Claffy (CAIDA); Timothy Davis (Texas A&M); William Arcand (MIT LLSC); Jonathan Bernays (MIT Lincoln Laboratory); David Bestor; William Bergeron; Vijay Gadepally; Micheal Houle; Matthew Hubbell; Hayden Jananthan; Anna Klein (MIT LLSC); Chad Meiners (MIT Lincoln Laboratory); Lauren Milechin (MIT); Julie Mullen (MIT LLSC); Sandeep Pisharody (MIT Lincoln Laboratory); Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi (MIT LLSC); Jon Sreekanth (Accolade Technology); Doug Stetson (MIT Lincoln Laboratory); Charles Yee; Peter Michaleas (MIT LLSC)
Analyzing Multi-trillion Edge Graphs on Large GPU Clusters: A Case Study with PageRank [Outstanding Paper Award]
Seunghwa Kang; Joseph Nke; Brad Rees (NVIDIA)
Achieving Speedups for Distributed Graph Biconnectivity
Ian Bogle; George M. Slota (RPI)
Generating Permutations Using Hash Tables
Oded Green; Corey Nolet; Joe Eaton (NVIDIA)
 

Poster Session: 2-P (12:15-14:15): Poster Session 2

Chair(s)/Host(s): Siddarth Samsi & Yehia Arafa  
ProtoX: A First Look
Het Mankad; Sanil Rao (Carnegie Mellon Univ.); Phillip Colella; Brian Van Straalen (Lawrence Berkeley National Laboratory); Franz Franchetti (Carnegie Mellon Univ.)
Magic Memory: A Programming Model For Big Data Analytics
Eric Tang; Franz Franchetti (Carnegie Mellon Univ.)
Approximating Manifolds and Geodesics with Curved Surfaces
Peter Oostema; Franz Franchetti (Carnegie Mellon Univ.)
Network Automation in Lab Deployment Using Ansible and Python
Deeptaanshu Kumar (Carnegie Mellon Univ.); Suxi Li ( Univ. of Miami)
Triangle Centrality in Arkouda
Joseph T Patchett; Zhihui Du; Fuhuan Li; David Bader (New Jersey Inst. of Tech.)
Image Recognition Using Machine Learning For Forbidden Items Detection In Airports
Alaa Atef; Abde-ljalil Naser; Mahmoud Mohamed; Mariam Safwat; Menna Tulla Ayman; Mohamed Mostafa; Salma Hesham (Ain Shams University); Khaled Salah (Siemens)
 

2-2: Graph Analytics & Network Science 2 Session (12:30-13:45)

Co-Chairs: John Gilbert & Chris Long  
Hypersparse Network Flow Analysis of Packets with GraphBLAS
Tyler Trigg; Chad Meiners; Sandeep Pisharody (MIT Lincoln Laboratory); Hayden Jananthan; Michael Jones (MIT LLSC); Adam Michaleas (MIT Lincoln Laboratory); Timothy Davis (Texas A&M Univ.); Erik Welch (NVIDIA); William Arcand; David Bestor; William Bergeron; Chansup Byun; Vijay Gadepally; Micheal Houle; Matthew Hubbell; Anna Klein; Peter Michaleas (MIT LLSC); Lauren Milechin (MIT); Julie Mullen; Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi (MIT LLSC); Doug Stetson (MIT Lincoln Laboratory); Charles Yee; Jeremy Kepner (MIT LLSC)
SHARP: Software Hint-Assisted Memory Access Prediction for Graph Analytics
Pengmiao Zhang (Univ. of Southern California); Rajgopal Kannan (US Army Research Lab-West); Xiangzhi Tong (Xi’an Jiaotong-Liverpool Univ.); Anant V Nori (Intel Labs); Viktor K Prasanna (Univ. of Southern California)
Fast Graph Algorithms for Superpixel Segmentation
Dimitris Floros (Aristotle Univ. of Thessaloniki); Tiancheng Liu (Duke Univ.); Nikos P Pitsianis (Aristotle Univ. of Thessaloniki, Duke Univ.); Xiaobai Sun (Duke Univ.)
Explicit Ordering Refinement for Accelerating Irregular Graph Analysis
Michael Mandulak; Ruochen Hu; George M Slota (RPI)
Efficient Calculation of Triangle Centrality in Big Data Networks
Wali Mohammad Abdullah; David Awosoga; Shahadat Hossain (Univ. of Lethbridge)
 

2-3: Data Intensive Computing Session (14:15-15:30)

Co-Chairs: Xiaobai Sun & Nikos Pitsianis  
Enabling Novel In-Memory Computation Algorithms to Address Next-Generation Throughput Constraints on SWaP-Limited Platforms
Jessica M Ray; Chad Meiners (MIT Lincoln Laboratory)
Towards Fast Crash-Consistent Cluster Checkpointing
Andrew E Wood (Boston Univ.); Moshik Hershcovitch (IBM Research); Ilias Ennmouri (IBM); Weiyu Zong; Saurav Chennuri (Boston Univ.); Sarel Cohen (The Academic College of Tel Aviv-Yaffo); Swaminathan Sundararaman (IBM); Daniel G Waddington (IBM Research); Peter Chin (Dartmouth Univ.)
Automatic Generation of Matrix-Vector Code Using SPIRAL for the Power10 ISA
James Nguyen; Sanil Rao (Carnegie Mellon Univ.); Jose Moreira (IBM); Franz Franchetti (Carnegie Mellon Univ.)
Towards Hardware Accelerated Garbage Collection with Near-Memory Processing
Samuel Thomas; Jiwon Choe (Brown Univ.); Ofir Gordon; Erez Petrank (Technion Inst.); Tali Moreshet (Boston Univ.); Maurice Herlihy (Brown Univ.); Ruth Iris Bahar (Colorado School of Mines)
RaiderSTREAM: Adapting the STREAM Benchmark to Modern HPC Systems
Michael Beebe; Brody Williams; Stephen Devaney (Texas Tech Univ.); John Leidel (Tactical Computing Laboratories); Yong Chen (Texas Tech Univ.); Steve Poole (Los Alamos National Lab)
 

2-4: Scaling HPC Education Session (15:45-17:00)

Co-Chairs: Julie Mullen, Lauren Milechin & Hayden Jananthan  

Focus: Education and Training for New Domains

Talk 1: Supporting GIS Workflows at Boston University
Dennis Milechin (Boston Univ.)
Talk 2: Machine Learning and GPU Computing Training via User Groups, Workshops, and Consultations
Dr. John Halverson (Princeton Univ.)
Discussion (15 mins): Education and Training for New Domains

Focus: Education and Training for New Communities

Talk 3: Science Gateways: Smoothing the Edges of Diverse Resources
Dr. Eric Coulter (Georgia Tech)
Talk 4: “Engaging with Two-year Institutions – The Journey so Far
Dr. Dhruva Chakravorty (Texas A&M Univ.)
Discussion (15 mins): Education and Training for New Communities
 

2-S1: GraphBLAS BoF Special (17:30-19:30)

Organizers: Tim Mattson & Scott McMillan  
   

Wednesday, September 21

 

3-1: AI / Machine Learning 1 Session (11:00-12:15)

Co-Chairs: Ashok Krishnamurthy & Julie Mullen  
Invited Talk: Making AI Real: Insights from the Lab to Operations
Maj. Michael Kanaan (USAF)
Benchmarking Resource Usage for Efficient Distributed Deep Learning [Outstanding Paper Award]
Nathan C Frey (MIT LLSC); Baolin Li (Northeastern Univ.); Joseph P McDonald; Dan Zhao; Michael S Jones; David Bestor (MIT LLSC); Devesh Tiwari (Northeastern Univ.); Vijay Gadepally; Siddharth Samsi (MIT LLSC)
Ultra Low-Power Deep Learning Applications at the Edge with Jetson Orin AGX Hardware
Mark Barnell (AFRL); Courtney Raymond (AFRL); Steven Smiley; Darrek Isereau; Daniel Brown (SRC, Inc.)
Performance Estimation for Efficient Image Segmentation Training of Weather Radar Algorithms
Joseph P McDonald (MIT LLSC); James Kurdzo; Phillip Stepanian; Mark Veillette (MIT Lincoln Laboratory); David Bestor; Michael Jones; Vijay Gadepally; Siddharth Samsi (MIT LLSC)
Deep Gaussian Process with Multitask and Transfer Learning for Performance Optimization
Wissam Sid-Lakhdar (Univ. of Tennessee Knoxville); Mohsen Aznaveh (Texas A&M Univ.); Piotr Luszczek (Univ. of Tennessee Knoxville); Jack Dongarra (Univ. of Tennessee Knoxville, ORNL)
 

Tutorial Session: 3-T (12:15-15:45): SPIRAL Tutorial (http://www.spiral.net/tutorial-spiral.html)

Organizer(s): Franz Franchetti & Mike Franusich  
 

3-2: AI / Machine Learning 2 Session (12:30-13:45)

Co-Chairs: Janice Gordon & Sanmukh Rao Kuppannagari  
Invited Talk: Challenges in Geospatial Computing
Prof. Taylor Perron (MIT Geology)
A High-performance Deployment Framework for Pipelined CNN Accelerators with Flexible DSE Strategy
Conghui Luo; Wenjin Huang; Dehao Xiang; Yihua Huang (Sun Yat-sen Univ.)
Enabling Transformers to Understand Low-Level Programs
Zifan Guo; William S Moses (MIT)
Online Detection and Classification of State Transitions of Multivariate Shock and Vibration Data
Nicklaus Przybylski; William M Jones (Coastal Carolina Univ.); Nathan DeBardeleben (Los Alamos National Laboratory)
Surrogate ML/AI Model Benchmarking for FAIR Principles’ Conformance
Piotr Luszczek; Cade E Brown (Univ. of Tennessee Knoxville)
 

3-3: AI / Machine Learning 3 Session (14:15-15:30)

Co-Chairs: Janice Gordon & Sanmukh Rao Kuppannagari  
Invited Talk: Trends in Energy Estimates for Computing in AI/Machine Learning Accelerators, Supercomputers, and Compute-Intensive Applications
Albert Reuther (MIT LLSC); Sadasivan Shankar (Stanford Univ.)
Walker Activity Tracking Using Machine Learning
Maxwell A. Huang; Edward A. Clancy (Worcester Polytechnic Institute)
An Evaluation of Low Overhead Time Series Preprocessing Techniques for Downstream Machine Learning
Matthew L. Weiss; Joseph McDonald; David Bestor; Charles Yee (MIT LLSC); Daniel Edelman (MIT); Michael S Jones (MIT LLSC); Andrew Prout (MIT LLSC); Andrew Bowne; Lindsey McEvoy (US Air Force); Vijay Gadepally; Siddharth Samsi (MIT LLSC)
Deep Learning For Tissue Classification
Kimberly Robasky (RENCI/UNC Chapel Hill)
 

3-4: General Purpose GPU Computing 1 Session (15:45-17:00)

Co-Chairs: Sadas Shankar & Hameed Badawy  
Invited Talk: New Frontiers in Performance at Wafer Scale
Dr. Rob Schreiber (Cerebras)
AI and ML Accelerator Survey and Trends
Albert Reuther; Peter Michaleas; Michael S Jones; Vijay Gadepally; Siddharth Samsi; Jeremy Kepner (MIT LLSC)
A Multi-GPU Parallel Genetic Algorithm For Large-Scale Vehicle Routing Problems
Marwan Abdelatti; Manbir S Sodhi; Resit Sendag (Univ. of Rhode Island)
DASH: Scheduling Deep Learning Workloads on Multi-Generational GPU-Accelerated Clusters
Baolin Li; Tirthak Patel (Northeastern Univ.); Vijay Gadepally (MIT LLSC); Karen Gettings (MIT Lincoln Laboratory); Siddharth Samsi (MIT LLSC); Devesh Tiwari (Northeastern Univ.)
Evaluation of a Novel Scratchpad Memory Through Compiler Supported Simulation
Essa Imhmed (Eastern New Mexico Univ.); Jonathan Cook; Hameed Badawy (New Mexico State Univ.)
 

3-S1: AI Challenges Special (17:30-19:30)

Organizers: Vijay Gadepally  
AIA Challenges – Status Update
Vijay Gadepally (MIT LLSC); Andy Bowne (US Air Force)
SEVIR Challenge
Mark Velliette; Esther Wolff (MIT Lincoln Laboratory)
Datacenter Challenge
Siddarth Samsi; Matthew Weiss (MIT LLSC)
Rainforest Challenge
Miriam Cha (MIT Lincoln Laboratory)
ManeuverID Challenge
Kaira Samuel (MIT)
MagNav Challenge
Jonathan Taylor (MIT Lincoln Laboratory)
CogPilot Challenge
Sophia Yuditskaya; Laura Brattain (MIT Lincoln Laboratory)
 

3-S2: Emerging Technologies Special (17:30-19:30)

Organizers: Kurt Keville, Donato Kava, Po Hao Chen  
An HPC Watershed – Next Generation Arithmetic
John Gustafson (Arizona State Univ.)
GCN Split: Bound the State of Streaming Graph Partitioning
Sonia Horchidan (KTH)
Isango: A Composable, Affordable, and Portable Platform for RCD Workforce Development
Elizabeth Leake (Boise State University/STEM-Trek Nonprofit)
   

Thursday, September 22

 

4-1: ASIC and FPGA Advances 1 Session (11:00-12:15)

Co-Chairs: Sadas Shankar & Ken Cain  
Invited Talk: GNN Acceleration on FPGAs
Prof. Viktor Prasanna (USC)
Flexible Hardware Accelerator Design Generation with SPIRAL
Guanglin Xu; James Hoe; Franz Franchetti (Carnegie Mellon Univ.)
On the Characterization of the Performance-Productivity Gap for FPGA
Atharva Gondhalekar; Thomas Twomey; Wu-chun Feng (Virginia Tech)
Optimizing Designs Using Several Types of Memories on Modern FPGAs
Mehmet Gungor; Kai Huang; Stratis Ioannidis; Miriam Leeser (Northeastern Univ.)
Performance Modeling Sparse MTTKRP Using Optical Static Random Access Memory on FPGA
Sasindu Wijeratne (Univ. of Southern California); Akhilesh Jaiswal; Ajey Jacob (USC-ISI); Bingyi Zhang; Viktor K Prasanna (Univ. of Southern California)
 

Tutorial Session: 4-T (12:15-15:45): Exploring Graph Analysis for HPC with Near-Memory Accelerators

Organizer(s): Jeffrey Young (Georgia Tech), Patrick Lavin (Georgia Tech), Jason Riedy (Lucata), Srinivas Eswar (Georgia Tech)  
 

4-2: ASIC and FPGA Advances 2 Session (12:30-13:45)

Co-Chairs: Darrell Ricke & Siddarth Samsi  
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look
Larry Tang; Siyuan Chen; Keshav Harisrikanth; Guanglin Xu; Ken Mai; Franz Franchetti (Carnegie Mellon Univ.)
Modeling the Energy Efficiency of GEMM using Optical Random Access Memory
Bingyi Zhang; Akhilesh Jaiswal; Clynn Mathew; Ravi Teja Lakkireddy; Ajey Jacob; Sasindu Wijeratne; Viktor K. Prasanna (Univ. of Southern California)
Edge-Connected Jaccard Similarity for Graph Link Prediction on FPGA
Paul Sathre; Atharva Gondhalekar; Wu-chun Feng (Virginia Tech)
Design and Implementation of a Real-Time Parallel FFT for a Wideband Radar System on an FPGA
Lakshmi Pradeep Bheema; Rishu Anand; Pavan Vadakattu; Syed Azemuddin (IIIT Hyderabad); Aquibuddin Ahmed (RCI-DRDO)
Challenges Designing for FPGAs Using High-Level Synthesis
Clayton J Faber; Steven D Harris; Zhili Xiao; Roger Chamberlain (Washington Univ. in St. Louis); Anthony M Cabrera (Oak Ridge National Laboratory)
 

4-3: ASIC and FPGA Advances 3 Session (14:15-15:30)

Co-Chairs: Sanmukh Rao Kuppannagari & Plamen Krastev  
FPGA Acceleration of Fully Homomorphic Encryption over the Torus
Tian Ye (Univ. of Southern California); Rajgopal Kannan (Army Research Lab-West); Viktor K. Prasanna (Univ. of Southern California)
Optimizing Open-source FPGA CAD Tools
Shachi Vaman Khadilkar; Martin Margala (UMass Lowell)
LIMA: Hardware for FFT based large integer multiplication
James Nguyen; Michael Cai; Ziyi Zuo; Larry Tang; Ken Mai; Franz Franchetti (Carnegie Mellon Univ.)
Towards a Generic UVM
Kholoud Mahmoud; Randa Ahmed; Karim Ayman; Mostafa Ayman; Waleed Taie; Yasser Ibrahim; Hassan Mostafa (Cairo University); Khaled Salah (Siemens)
How to Prevent a Sick ASIC
William F. Ellersick (Analog Circuit Works)
 

4-4: General Purpose GPU Computing 2 Session (15:45-17:00)

Co-Chairs: Hameed Badawy & Mark Barnell  
Optimal GPU Frequency Selection using Multi-Objective Approaches for HPC Systems
Ghazanfar Ali (Texas Tech Univ.); Sridutt Bhalachandra; Nicholas Wright (Lawrence Berkeley National Laboratory); Mert Side; Yong Chen (Texas Tech Univ.)
GPU-Accelerated High-Bandwidth Radar Centroiding
David J Brigada; Maximilian Merfeld; Kara Warner (MIT Lincoln Laboratory)
A Hierarchical Jacobi Iteration for Structured Matrices on GPUs using Shared Memory
Mohammad Shafaet Islam; Qiqi Wang (MIT)
Demystifying the Nvidia Ampere Architecture through Microbenchmarking and Instruction-level Analysis
Hamdy Abdelkhalik (New Mexico State Univ.); Yehia Arafa (Qualcomm Technologies); Nandakishore Santhi (Los Alamos National Laboratory); Hameed Badawy (New Mexico State Univ.)
Apple Silicon Performance in Scientific Computing
Connor Kenyon; Collin Capano (UMass Dartmouth)
 

4-S1: Graph Challenge Special (17:30-19:30)

Organizers: Jeremy Kepner  
Best Paper Award and Best Student Paper Award Annoucement
Manoj Kumar (IBM) and Sanmukh Rao Kuppannagari (Case Western Reserve Univ.)
Sparse Deep Neural Network Inference Using Different Programming Models
Hyungro Lee; Milan Jain; Sayan Ghosh (PNNL)
Kalman Filter Driven Estimation of Community Structure in Time Varying Graphs
Lisa JK Durbeck; Peter Athanas (Virginia Tech)
Improved Distributed-Memory Triangle Counting by Exploiting the Graph Structure
Sayan Ghosh (PNNL, Washington State Univ.)
HTC: Hybrid Vertex-parallel and Edge-parallel Triangle Counting
Li Zeng; Kang Yang; Haoran Cai; Jinhua Zhou; Rongqian Zhao; Xin Chen (Huawei Technologies)
FAST: A Scalable Subgraph Matching Framework over Large Graphs
Jiezhong He; Zhouyang Liu; Yixin Chen; Hengyue Pan; Zhen Huang; Dongsheng Li (NUDT)
Towards Fast GPU-based Sparse DNN Inference: A Hybrid Compute Model
Shaoxian Xu; Minkang Wu; Long Zheng; Zhiyuan Shao; Xiangyu Ye; Xiaofei Liao; Hai Jin (Huazhong Univ. of Science and Technology)
Accelerating Sparse Deep Neural Network Inference Using GPU Tensor Cores
Yufei Sun; Long Zheng; Qinggang Wang; Xiangyu Ye; Yu Huang; Pengcheng Yao; Xiaofei Liao; Hai Jin (Huazhong Univ. of Science and Technology)
   

Friday, September 23

 

5-V: Sponsor Showcase – Dell Session (10:30-11:00)

Co-Chairs: Albert Reuther  
How to Wrestle with Global Scale ML and Win
Dr. Ben Fauber (Dell Technologies)
 

5-1: High Performance Data Analysis 1 Session (11:00-12:15)

Co-Chairs: Darrell Ricke & Ken Cain  
Optimizing Performance and Storage of Memory-Mapped Persistent Data Structures [Outstanding Student Paper Award]
Karim Youssef (Virginia Tech); Abdullah Al Raqibul Islam (Univ. of North Carolina at Charlotte); Keita Iwabuchi (Lawrence Livermore National Laboratory); Wu-chun Feng (Virginia Tech); Roger Pearce (Lawrence Livermore National Laboratory)
Processing Particle Data Flows with SmartNICs [Outstanding Student Paper Award]
Jianshen Liu; Carlos Maltzahn (UC Santa Cruz); Matthew Curry; Craig Ulmer (Sandia National Laboratories)
AUTOPAGER: Auto-tuning Memory-Mapped I/O Parameters in Userspace [Outstanding Student Paper Award]
Karim Youssef ; Niteya Shah (Virginia Tech); Maya B Gokhale; Roger Pearce (Lawrence Livermore National Laboratory); Wu-chun Feng (Virginia Tech)
An SSD-Based Accelerator for Singular Value Decomposition Recommendation Algorithm on Edge
Wei Wu; Letian Zhao; Qizhe Wu; Xiaotian Wang; Teng Tian; Xi Jin (Univ. of Science and Technology of China)
Hardware Software Codesign of Applications on the Edge: Accelerating Digital PreDistortion for Wireless Communications
Zhaoyang Han; Yiyue Jiang (Northeastern Univ.); Rahul Mushini; John Dooley (Maynooth Univ.); Miriam Leeser (Northeastern Univ.)
 

5-2: High Performance Data Analysis 2 Session (12:30-13:45)

Co-Chairs: Darrell Ricke & David Cousins  
Im2win: Memory Efficient Convolution On SIMD Architectures
Shuai Lu; Jun Chu (Nanchang Hangkong Univ.); Xu T. Liu (Univ. of Washington)
Python Implementation of the Dynamic Distributed Dimensional Data Model
Hayden R Jananthan (MIT LLSC); Lauren Milechin (MIT): Michael Jones; William Arcand; William Bergeron; David Bestor; Chansup Byun; Michale Houle; Matthew Hubbell; Vijay Gadepally; Anna Klein; Peter Michaleas; Guillermo Morales; Julie Mullen; Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi; Charles Yee; Jeremy Kepner (MIT LLSC)
Powering Practical Performance: Accelerated Numerical Computing in Pure Python
Matthew Penn; Christopher Milroy (NVIDIA)
Parallel Computing with DNA Forensics Data
Adam Michaleas; Philip Fremont-Smith; Chelsea Lennartz; Darrell O. Ricke (MIT Lincoln Laboratory)
Towards Full-Stack Acceleration for Fully Homomorphic Encryption
Naifeng Zhang (Carnegie Mellon Univ.); Homer Gamil (New York Univ.); Patrick Brinich (Drexel Univ.); Benedict Reynwar (USC ISI); Ahmad Al Badawi (Duality Technologies); Negar Neda; Deepraj Soni (New York Univ.); Yuriy Polyakov (Duality Technologies); Patrick Broderick (SpiralGen, Inc.); Michail Maniatakos (New York Univ.); Andrew Schmidt (USC ISI); Mike Franusich (SpiralGen, Inc.); Jeremy Johnson (Drexel Univ.); Brandon Reagen (New York Univ.); David Bruce Cousins (Duality Technologies); Franz Franchetti (Carnegie Mellon Univ.)
 

5-3: Big Data and Distributed Computing 1 Session (14:15-15:30)

Co-Chairs: Sadas Shankar & Chansup Byun  
Invited Talk: Data-Driven Precision Neuroscience
Dr. John Reynders (Neumora)
Distributed Out-of-Memory SVD on CPU/GPU Architectures [Outstanding Paper Award]
Ismael Boureima; Manish Bhattarai; Maksim E Eren; Nick Solovyev; Hirsto Djidjev; Boian Alexandrov (Los Alamos National Laboratory)
HuGraph: Acceleration of GCN Training on Heterogeneous FPGA Clusters with Quantization [Outstanding Student Paper Award]
Letian Zhao; Qizhe Wu; Xiaotian Wang; Teng Tian; Wei Wu; Xi Jin (Univ. of Science and Technology of China)
A Scalable Inference Pipeline for 3D Axon Tracing Algorithms
Benjamin M Fenelon; Lars Gjesteby (MIT Lincoln Laboratory); Webster Guan; Juhyuk Park; Kwanghun Chung (MIT); Laura Brattain (MIT Lincoln Laboratory)
Exploring the Impacts of Software Cache Configuration for In-line Data Compression
Sansriti Ranjan; Dakota Fulp; Jon C Calhoun (Clemson Univ.)
 

5-4: Big Data and Distributed Computing 2 Session (15:45-17:00)

Co-Chairs: Rich Vuduc & Nikos Pitsianis  
Invited Talk: HPC Graphs in the AWS Cloud
Roger Pearce (LLNL)
pPython for Parallel Python Programming
Chansup Byun; William Arcand; David Bestor; Bill Bergeron; Vijay Gadepally; Michael Houle; Matthew Hubbell; Hayden Jananthan; Michael Jones (MIT LLSC); Kurt Keville (MIT); Anna Klein; Peter Michaleas (MIT LLSC); Lauren Milechin (MIT); Guillermo Morales; Julie Mullen; Andrew Prout; Albert Reuther; Antonio Rosa; Siddharth Samsi; Charles Yee; Jeremy Kepner (MIT LLSC)
Arachne: An Arkouda Package for Large-Scale Graph Analytics
Oliver A Alvarado Rodriguez; Zhihui Du; Joseph T Patchett; Fuhuan Li; David Bader (New Jersey Inst. of Tech.)
The Viability of Using Online Prediction to Perform Extra Work while Executing BSP Applications
Po Hao Chen; Pouya Haghi; Jae Yoon Chung (Boston Univ.); Tong Geng (Univ. of Rochester); Richard West (Boston Univ.); Anthony Skjellum (UTC); Martin Herbordt (Boston Univ.)
Real-Time Software Architecture for EM-Based Radar Signal Processing and Tracking
Alan W Nussbaum (Georgia Tech, GTRI); Byron Keel (GTRI); William Dale Blair (GTRI, Georgia Tech); Umakishore Ramachandran (Georgia Tech)