Concept
Push and Pull Operations in Distributed Training
Using a key-value store abstraction for distributed training allows for the management of many sets of gradients by indexing them with a key . This abstraction defines two main operations:
- push(key, value): Sends a particular gradient (the value) from a worker to a common storage where it is aggregated (e.g., by summation).
- pull(key, value): Retrieves the final aggregate gradient value from the common storage after combining the inputs from all workers.
This architecture shares characteristics with distributed key-value stores like Dynamo, facilitating efficient parameter distribution across multiple servers.
0
1
Updated 2026-05-18
Tags
D2L
Dive into Deep Learning @ D2L