Repository logo
 

General hardware multicasting for fine-grained message-passing architectures

Accepted version
Peer-reviewed

Loading...
Thumbnail Image

Change log

Abstract

Manycore architectures are increasingly favouring message-passing or partitioned global address spaces (PGAS) over cache coherency for reasons of power efficiency and scalability. However, in the absence of cache coherency, there can be a lack of hardware support for one-to-many communication patterns, which are prevalent in some application domains. To address this, we present new hardware primitives for multicast communication in rack-scale manycore systems. These primitives guarantee delivery to both colocated and distributed destinations, and can capture large unstructured communication patterns precisely. As a result, reliable multicast transfers among any number of software tasks, connected in any topology, can be fully offloaded to hardware. We implement the new primitives in a research platform consisting of 50K RISC-V threads distributed over 48 FPGAs, and demonstrate significant performance benefits on a range of applications expressed using a high-level vertex-centric programming model.

Description

Journal Title

2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)

Conference Name

2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)

Journal ISSN

1066-6192
2377-5750

Volume Title

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Rights and licensing

Except where otherwised noted, this item's license is described as All rights reserved
Sponsorship
Engineering and Physical Sciences Research Council (EP/N031768/1)