Poster
Problem-Parameter-Free Federated Learning
Wenjing Yan · Kai Zhang · Xiaolu Wang · Xuanyu Cao
Hall 3 + Hall 2B #597
[
Abstract
]
Oral
presentation:
Oral Session 4E
Fri 25 Apr 12:30 a.m. PDT — 2 a.m. PDT
Thu 24 Apr 7 p.m. PDT
— 9:30 p.m. PDT
Fri 25 Apr 12:30 a.m. PDT — 2 a.m. PDT
Abstract:
Federated learning (FL) has garnered significant attention from academia and industry in recent years due to its advantages in data privacy, scalability, and communication efficiency. However, current FL algorithms face a critical limitation: their performance heavily depends on meticulously tuned hyperparameters, particularly the learning rate or stepsize. This manual tuning process is challenging in federated settings due to data heterogeneity and limited accessibility of local datasets. Consequently, the reliance on problem-specific parameters hinders the widespread adoption of FL and potentially compromises its performance in dynamic or diverse environments. To address this issue, we introduce PAdaMFed, a novel algorithm for nonconvex FL that carefully combines adaptive stepsize and momentum techniques. PAdaMFed offers two key advantages: 1) it operates autonomously without relying on problem-specific parameters; and 2) it manages data heterogeneity and partial participation without requiring heterogeneity bounds. Despite these benefits, PAdaMFed provides several strong theoretical guarantees: 1) It achieves state-of-the-art convergence rates with a sample complexity of O(ϵ−4) and communication complexity of O(ϵ−3) to obtain an accuracy of ||∇f(θ)||≤ϵ, even using constant learning rates; 2) these complexities can be improved to the best-known O(ϵ−3) for sampling and O(ϵ−2) for communication when incorporating variance reduction; 3) it exhibits linear speedup with respect to the number of local update steps and participating clients at each global round. These attributes make PAdaMFed highly scalable and adaptable for various real-world FL applications. Extensive empirical evidence on both image classification and sentiment analysis tasks validates the efficacy of our approaches.
Live content is unavailable. Log in and register to view live content