SC23 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Workshops Archive

Addressing Stale Gradients in Scalable Federated Deep Reinforcement Learning


Workshop: Workshop on Machine Learning with Graphs in High Performance Computing Environments

Authors: Justin Stanley and Ali Jannesari (Iowa State University)


Abstract: Advancements in reinforcement learning (RL) via deep neural networks have enabled their application to a variety of real-world problems. However, these applications often suffer from long training times. While attempts to distribute training have been successful in controlled scenarios, they face challenges in heterogeneous-capacity, unstable, and privacy critical environments. This work applies concepts from federated learning (FL) to distributed RL, specifically addressing the stale gradient problem. A deterministic framework for asynchronous federated RL is utilized to explore dynamic methods for handling stale gradient updates in the Arcade Learning Environment. Experimental results from applying these methods to two Atari-2600 games demonstrate a relative speedup of up to 95% compared to plain A3C in large and unstable federations.





Back to Workshop on Machine Learning with Graphs in High Performance Computing Environments Archive Listing



Back to Full Workshop Archive Listing