Concept

Asynchronous Training Trade-offs

Asynchronous training can be employed to manage heterogeneity in computational resources among nodes, mitigating synchronization delays. However, this approach has significant trade-offs, as it may lead to the use of outdated 'stale' gradients for model updates, which in turn can result in non-guaranteed convergence of the training process.

0

1

Updated 2026-04-21

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences