Optimal Client Sampling for Federated Learning

Chen, Wenlin; Horváth, Samuel; Richtárik, Peter

doi:10.17863/CAM.89518

Optimal Client Sampling for Federated Learning

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/342118

Repository DOI

https://doi.org/10.17863/CAM.89518

Files

Accepted version (1.66 MB)

Type

Article

Authors

Chen, Wenlin

https://orcid.org/0000-0002-3759-1858

Horváth, Samuel

Richtárik, Peter

Abstract

It is well understood that client-master communication can be a primary bottleneck in federated learning (FL). In this work, we address this issue with a novel client subsampling scheme, where we restrict the number of clients allowed to communicate their updates back to the master node. In each communication round, all participating clients compute their updates, but only the ones with "important" updates communicate back to the master. We show that importance can be measured using only the norm of the update and give a formula for optimal client participation. This formula minimizes the distance between the full update, where all clients participate, and our limited update, where the number of participating clients is restricted. In addition, we provide a simple algorithm that approximates the optimal formula for client participation, which allows for secure aggregation and stateless clients, and thus does not compromise client privacy. We show both theoretically and empirically that for Distributed SGD (DSGD) and Federated Averaging (FedAvg), the performance of our approach can be close to full participation and superior to the baseline where participating clients are sampled uniformly. Moreover, our approach is orthogonal to and compatible with existing methods for reducing communication overhead, such as local methods and communication compression methods.

Journal Title

Transactions on Machine Learning Research

Publisher URL

https://openreview.net/forum?id=8GvRCWKHIL

Rights

Attribution 4.0 International

Collections

University of Cambridge Research Outputs (Articles and Conferences)

Optimal Client Sampling for Federated Learning

Accepted version

Peer-reviewed

Repository URI

Repository DOI

Files

Type

Change log

Authors

Abstract

Description

Keywords

Journal Title

Conference Name

Journal ISSN

Volume Title

Publisher

Publisher DOI

Publisher URL

Rights

Collections