Skip to yearly menu bar Skip to main content


Oral

Improving the Efficiency of Distributed Training using Sparse Parameter Averaging

Matt Beton · Seth Howes · Alex Cheema · Mohamed Baioumy

Keywords: [ generative models ] [ style transfer ] [ controllable text generation ] [ conditional generative models ]

[ PDF
2025 Oral

Abstract:

Chat is not available.