2019 IEEE High Performance Extreme Computing Conference (HPEC ‘19) Twenty-third Annual HPEC Conference 24 - 26 September 2019 Westin Hotel, Waltham, MA USA
GPU Applications and Technology 10:20-12:00 in Eden Vale A1/A2 Chair: David Cousins Session Abstracts Progressive Optimization of Batched LU Factorization on GPUs Ahmad Abdelfattah, Stanimire Tomov, and Jack Dongarra (ICL UTK) [Best Student Paper Finalist] Low Overhead Instruction Latencies Characterization for NVIDIA GPGPUs Yehia Arafa (NMSU), Abdel-Hameed A. Badawy (NMSU, LANL), Gopinath Chennupati (LANL), Nandakishore Santhi (LANL), Stephan Eidenbenz (LANL) Distributed Deep Learning for Precipitation Nowcasting Siddharth Samsi, Christopher J. Mattioli, Mark S. Veillette (MIT-LL) Embedded GPU Cluster Computing Framework for Inference of Convolutional Neural Networks Evan Kain (SHREC @ Pitt), Diego Wildenstein (Arizona State Univ.), Andrew C. Pineda (AFRL) Heterogeneous Cache Hierarchy Management for Integrated CPU-GPU Architecture Hao Wen (Virginia Commonwealth University); Wei Zhang (Virginia Commonwealth University)* Graphs, Networks & Sparse Data 1 10:20-12:00 in Eden Vale C1 Chair: Michael Wolf / Sandia Session Abstracts Invited Talk: Brilliant Technologists Building Cool Stuff Mr. Stan Reiss (Matrix Partners) [Best Paper Finalist] Efficient Implementation of Sparse Matrix- Sparse Vector Multiplication for Large Scale Graph Analytics Mauricio Serrano (IBM) Combinatorial Multigrid: Advanced Preconditioners For Ill-Conditioned Linear Systems M. Harper Langston, Pierre-David Letourneau, Richard Lethin, James Ezick, Mitchell S Harris (Reservoir Labs) Concurrent Katz Centrality for Streaming Graphs Chunxing Yin, Jason Riedy (Georgia Institute of Technology) Graph Algorithms in PGAS: Chapel and UPC++ Jesun S Firoz, Louis Jenkins, Marcin Zalewski, Cliff Joslyn, Mark Raugas (PNNL) HPC 10:20-12:00 in Eden Vale C3 Chair: Seung Woo Son / UMass Lowell Session Abstracts Scalable Solvers for Cone Complementarity Problems in Frictional Multibody Dynamics Saibal De (Univ. Michigan), Eduardo Corona (NYIT), Paramsothy Jayakumar (US Army), Shravan Veerapaneni (Univ. Michigan) Large Scale Parallelization Using File-Based Communications Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther (MIT-LL) Fast Large-Scale Algorithm for Electromagnetic Wave Propagation in 3D Media Mitchell Harris (Reservoir Labs), M. Harper Langston (Reservoir Labs), Pierre-David Létourneau (Reservoir Labs), George Papanicolaou (Stanford), James Ezick (Reservoir Labs), Richard Lethin(Reservoir Labs) Towards Improving Rate-Distortion Performance of Transform Based Lossy Compression for HPC Datasets Jialing Zhang, Aekyeung Moon, Xiaoyan Zhuo, Seung Woo Son (UMass Lowell) A Parallel Simulation Approach to ACAS X Development Adam Gjersvik and Robert J. Moss (MIT-LL) Lunch 12:00 - 1:00 Emerson View Posters & Demos 1 12:00-1:00 in Foyer Session Abstracts Embedded Processor-In-Memory Architecture for Accelerating Arithmetic Operations Richard Muri, Paul Fortier (UMass Dartmouth) FFTX for Micromechanical Stress-Strain Analysis Anuva Kulkarni, Daniele G. Spampinato, Franz Franchetti (CMU) ECG Feature Processing Performance Acceleration on SLURM Compute Systems Michael Nolan, Mark Hernandez, Philip Fremont-Smith, Albert Swiston, Kajal Claypool (MIT-LL) Challenges in Securing High-Performance Computing Systems Pruthvy Yellu, Zhiming Zhang, Mohammad Mezanur Rahman Monjur, Ranuli Abeysinghe, Qiaoyan Yu (UNH) Large Scale Organization and Inference of an Imagery Dataset for Public Safety Jeffrey Liu, David Strohschein, Siddharth Samsi, Andrew Weinert (MIT-LL) Deep-Learning Inferencing with High-Performance Hardware Accelerators Luke Kljucaric, Alan D. George (SHREC @ Pitt) Projecting Quantum Computational Advantage versus Classical State-of-the-Art Jason Larkin, Daniel Justice (CMU SEI) Resilience-Aware Decomposition and Monitoring of Large-Scale Embedded Systems Miguel Mark (Boston Univ.), David Whelihan (MIT-LL), Michael Vai (MIT-LL), Haley Whitman (MIT-LL), Michel Kinsy (Boston Univ.) Road Traffic Anomaly Detection using Functional Data Analysis George Tsitsopoulos (Northeastern), Eric Truslow (MIT-LL), Dimitris Manolakis (MIT-LL) Overcoming Limitations of GPGPU-Computing in Scientific Applications Connor Kenyon, Glenn Volkema, Gaurav Khanna (UMass Dartmouth) Optimizing the Visualization Pipeline of a 3D Monitoring and Management System Rebecca Wild, Matthew Hubbell, Jeremy Kepner (MIT-LL) Skip the Intersection: Quickly Counting Common Neighbors on Shared-Memory Systems Xiaojing An, Kasimir Gabert, James Fox, Oded Green, David A. Bader (Georgia Tech) [Graph Challenge Finalist] Fast Parallel Subgraph Matching on GPUs Leyuan Wang, John D. Owens (UC Davis) [Graph Challenge Finalist] Performance of Training Sparse Deep Neural Networks on GPUs Jianzong Wang, Zhangcheng Huang, Lingwei Kong, Jing Xiao (Ping An Technology (Shenzhen)), Pengyu Wang, Lu Zhang, Chao Li (Shanghai Jiao Tong University) [Graph Challenge Honorable Mention] Fast Triangle Counting on GPU Chuangyi Gui, Long Zheng, Pengcheng Yao, Xiaofei Liao, Hai Jin (Huazhong Univ. of Science and Technology) Wednesday, September 25 Afternoon Scaling HPC Education 1:00-2:40 in Eden Vale C2 Chair: Julie Mullen / MIT-LL and Lauren Milechin / MIT-EAPS Invited Talk: Humans in Scaling Research Computing Facilitation and Education Robert Freeman (Harvard Business School) Invited Talk: A Workforce Development Strategy for Research Computing Julie Ma (Northeast Cyberteam Initiative) Invited Talk: Developing HPC Skills across the University Community Brian Gregor (Boston University) Invited Talk: SPOC to MOOC, Extending Local Training to the HPC Community Julie Mullen (MIT-Lincoln Laboratory); Lauren Milechin ( MIT Earth, Atmospheric and Planetary Sciences) Al 1 1:00-2:40 in Eden Vale A1/A2 Chair: Paul Monticciolo / MIT LL Session Abstracts Invited Talk: Deploying High-Performance Deep Learning Applications Jaya Shankar (MathWorks) Survey and Benchmarking of Machine Learning Accelerators Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, and Jeremy Kepner (MIT-LL) Application of Approximate Matrix Multiplication to Neural Networks and Distributed SLAM Brian Plancher (Harvard), Camelia D. Brumar (WPI), Iulian Brumar (Harvard), Lillian Pentecost (Harvard), Saketh Rama (Harvard), David Brooks (Harvard) Low Power Computing and Simultaneous Electro-Optical/Radar Data Processing using IBM’s NS16e 16-chip Neuromorphic Hardware Mark Barnell (AFRL); Courtney Raymond (AFRL), Daniel Brown (SRC), Matthew Wilson (SRC), Eric R Cote (SRC) TapirXLA: Embedding Fork-Join Parallelism into the XLA Compiler in TensorFlow Using Tapir Tao B. Schardl (MIT CSAIL), Siddharth Samsi (MIT-LL) Graphs, Networks & Sparse Data 2 1:00-2:40 in Eden Vale C1 Chair: Michael Wolf / Sandia Session Abstracts Fast and Scalable Distributed Tensor Decompositions Muthu Baskaran, Thomas Henretty, James Ezick (Reservoir Labs) On Computing with Diagonally Structured Matrices Shahadat Hossain, Mohammad Sakib Mahmud (Univ. Lethbridge) Spaceland Embedding of Sparse Stochastic Graphs Nikos Pitsianis (Aristotle Univ., Duke), Alexandros-Stavros Iliopoulos (Aristotle Univ.), Dimitris Floros (Aristotle Univ.), Xiaobai Sun (Duke) Partitioning Graphs for the Cloud using Reinforcement Learning Mohammad Hasanzadeh Mofrad (CMU), Rami Melhem (CMU), Mohammad Hammoud (CMU Qatar) Improving Parallelism of Breadth First Search (BFS) Algorithm for Accelerated Performance on GPUs Hao Wen, Wei Zhang (VCU) Manycore 1:00-2:40 in Eden Vale C3 Chair: David Cousins / TBD Session Abstracts Improving Scheduling for Irregular Applications with Logarithmic Radix Binning James Fox (Georgia Tech), Alok Tripathy (Georgia Tech), and Oded Green (Georgia Tech, Nvidia) An Efficient and Composable Parallel Task Programming Library Chun-Xun Lin, Tsung-Wei Huang, Guannan Guo, Martin D. F. Wong (UIUC) MeXT: A Flow for Multiprocessor Exploration Christophe Bobda (Univ. Florida), Harold Ishebabi (Univ. Potsdam), Philipp Mahr (Univ. Potsdam), Joel Mandebi Mbongue (Univ. Florida) Optimizing Xeon Phi for Interactive Data Analysis Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, William Bergeron, Matthew Hubbell, Vijay Gadepally, Michael Houle, Michael Jones, Anne Klein, Lauren Milechin, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther (MIT) Automatic Parallelization to Asynchronous Task-Based Runtimes Through a Generic Tile-based API Charles Jin, Muthu Baskaran, Benoit Meister, Jonathan Springer (Reservoir Labs) Break 2:40-3:00 IEEE/MIT/Amazon Graph Challenge Awards 3:00-5:00 in Eden Vale B Chair: Jeremy Kepner/ MIT Lincoln Laboratory Session Abstracts [Champion] A GPU Implementation of the Sparse Deep Neural Network Graph Challenge Mauro Bisson, Massimiliano Fatica (NVIDIA) [Champion] H-INDEX: Hash-Indexing for Parallel Triangle Counting on GPUs Santosh Pandey (UMass Lowell), Xiaoye S. Li, Aydin Buluc (LBNL), Jiejun Xu (HRL), Hang Liu (UMass Lowell) [Champion] Write Quick, Run Fast: Sparse Deep Neural Network in 20 Minutes of Development Time via SuiteSparse:GraphBLAS Timothy A. Davis, Mohsen Aznaveh, and Scott Kolodziej (Texas A&M Univ.) [Champion] Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU Mark Blanco, Tze Meng Low (CMU), Kyungjoo Kim (Sandia) [Champion] One Quadrillion Triangles Queried on One Million Processors Roger Pearce, Trevor Steil, Benjamin Priest, Geoffrey Sanders (LLNL) [Innovation Award] Linear Algebra-Based Triangle Counting via Fine-Grained Tasking on Heterogeneous Environments Abdurrahman Yasşar (GA Tech), Sivasankaran Rajamanickam, Jonathan Berry, Michael Wolf (Sandia), Jeffrey S. Young, Ümit V. Çatalyürek (GA Tech) [Innovation Award] Scalable Triangle Counting on Distributed-Memory Systems Seher Acer (Sandia), Abdurrahman Yasşar (GA Tech), Sivasankaran Rajamanickam, Michael Wolf (Sandia), Ümit V. Çatalyürek† (GA Tech) [Innovation Award] Scalable Inference for Sparse Deep Neural Networks using Kokkos Kernels J. Austin Ellis and Sivasankaran Rajamanickam (Sandia) [Innovation Award] Scaling and Quality of Modularity Optimization Methods for Graph Clustering Sayan Ghosh, Mahantesh Halappanavar, Antonino Tumeo (PNNL), Ananth Kalyanaraman (Washington State Univ.) [Innovation Award] Direction-Optimizing Label Propagation Algorithm Xu Liu (Washington State Univ., PNNL), Jesun Sahariar Firoz, Marcin Zalewski, Mahantesh Halappanavar, Kevin J. Baker (PNNL), Andrew Lumsdaine (PNNL, Univ. of Washington), Assefaw H. Gebremedhin∗ (Washington State Univ.) [Student Innovation Award] Fast Stochastic Block Partitioning via Sampling Frank Wanye (Virginia Tech), Vitaliy Gleyzer (MIT LL), Wu-chun Feng (Virginia Tech) [Student Innovation Award] Accelerating DNN Inference with GraphBLAS and the GPU Xiaoyun Wang, Zhongyi Lin, Carl Yang, John D. Owens (UC Davis) [Student Innovation Award] Update on k-truss Decomposition on GPU Mohammad Almasri, Omer Anjum, Carl Pearson, Zaid Qureshi, Vikram S. Mailthody, Rakesh Nagi (UIUC), Jinjun Xiong (IBM TJ Watson), and Wen-mei Hwu (UIUC) [Student Innovation Award] DistTC: High Performance Distributed Triangle Counting Loc Hoang, Vishwesh Jatala, Xuhao Chen, Udit Agarwal, Roshan Dathathri, Gurbinder Gill, Keshav Pingali (UT Austin) Awards Session 5:00-5:30 in Eden Vale B Chair: Jeremy Kepner/ MIT Lincoln Laboratory Best Student Paper Award Presentation 5:00 in Eden Vale B Chair: Brian Sroka / MITRE Best Paper Award Presentation 5:10 in Eden Vale B Chair: Jeremy Kepner/ MIT Lincoln Laboratory IEEE Innovation in Societal Infrastructure Award 5:15 in Eden Vale B Chair: Jeremy Kepner/ MIT Lincoln Laboratory Invited Talk: Large-Scale Graph Processing Prof. Julian Shun (MIT CSAIL) IEEE Innovation in a Societal Infrastructure John Walz / IEEE Reception; View Posters & Demos; Attend BoFs 5:30-8:00 in Emerson & Foyer & Eden Vale Chair: Kurt Keville / MIT GraphBLAS BoF 6:00-7:00 in Eden Vale C1/C2 Co-Chairs: Timothy Mattson / Intel, Marcin Zalewski / PNNL, Scott McMillan / CMU Vendor Demo SpiralGen
Wednesday, September 25 Morning
Plenary Session 9:00am-10:00am in Eden Vale B Chair: TBD / MIT Keynote Speaker: Future Computing Systems Mr. Michael Rosenfield (IBM Vice President, Data Centric Solutions) Break 10:00 - 10:20