Home > News > Webinar on the A64FX processor: Understanding streaming kernels and sparse matrix-vector multiplication

Webinar on the A64FX processor: Understanding streaming kernels and sparse matrix-vector multiplication

2020/11/19Beatrice CalossoNews

The A64FX CPU powers the current #1 supercomputer on the Top500 list. Although it is a traditional cache-based multicore processor, its peak performance and memory bandwidth rival accelerator devices. Generating efficient code for such a new architecture requires a good understanding of its performance features. Using these features, the Erlangen Regional Computing Center (RRZE) team detailed how they construct the Execution-Cache-Memory (ECM) performance model for the A64FX processor in the FX700 supercomputer and validate it using streaming loops. They described how the machine model points to peculiarities in the microarchitecture to keep in mind when optimizing applications, and how, applying the ECM model to sparse matrix-vector multiplication (SpMV), they motivate why the CRS matrix storage format is inappropriate and how the SELL-C-sigma format can achieve bandwidth saturation for SpMV. In this context, they also looked into some code optimization strategies that are relevant for A64FX and compare SpMV performance with AMD Rome, Intel Cascade Lake and NVIDIA V100. This webinar, organized the 18th of November 2020 by the European Energy-Oriented Center of Excellence (EoCoE), had been hosted by Christie L. Alappat, PhD student at the RRZE, and Dr. Georg Hager, senior researcher in the HPC division at RRZE.

Go to the video

Bridging Energy Research and High Performance Computing: a Joint EERA-CASTIEL2 Exchange Event

2025/07/04 13:45

02 Jul 2025 The European Energy Research Alliance (EERA) and CASTIEL2, representing the network of EuroHPC National Competence Centres and the... Read more →

Podcast about EoCoe for New Supercomputing in Europe

2025/07/03 11:16

🟢 Spotify: https://lnkd.in/dfuqXXY6 🟣 Apple podcasts: https://lnkd.in/djy_nr3a RSS feed to add to your favourite podcast app: https://lnkd.in/dADpkW_c In this episode... Read more →

GYSELA-X Webinar Code of the month

2025/05/22 16:09

GYSELA-X, a code used to model turbulent transport in tokamak plasmas, will be presented during a webinar on 25th June... Read more →

In-situ data processing with DASK: Public Webinar

2025/03/21 09:06

27th March 2025, 10 am In-situ data processing with DASK Webinar organized by CASTIEL2 Abstract: In situ data processing refers to... Read more →

EoCoE-Parflow: Public Webinar

2025/03/21 09:00

26th March 2025, 11 am Code of the Month vol.15: EoCoE-Parflow Organized by CASTIEL2 and EoCoE ParFlow is a parallel,... Read more →

Bridging Energy Research and High Performance Computing: a Joint EERA-CASTIEL2 Exchange Event

2025/07/04 13:45

02 Jul 2025 The European Energy Research Alliance (EERA) and CASTIEL2, representing the network of EuroHPC National Competence Centres and the... Read more →

Podcast about EoCoe for New Supercomputing in Europe

2025/07/03 11:16

🟢 Spotify: https://lnkd.in/dfuqXXY6 🟣 Apple podcasts: https://lnkd.in/djy_nr3a RSS feed to add to your favourite podcast app: https://lnkd.in/dADpkW_c In this episode... Read more →

GYSELA-X Webinar Code of the month

2025/05/22 16:09

GYSELA-X, a code used to model turbulent transport in tokamak plasmas, will be presented during a webinar on 25th June... Read more →

In-situ data processing with DASK: Public Webinar

2025/03/21 09:06

27th March 2025, 10 am In-situ data processing with DASK Webinar organized by CASTIEL2 Abstract: In situ data processing refers to... Read more →

EoCoE-Parflow: Public Webinar

2025/03/21 09:00

26th March 2025, 11 am Code of the Month vol.15: EoCoE-Parflow Organized by CASTIEL2 and EoCoE ParFlow is a parallel,... Read more →