Learn Before
Types of Parallelism in LLM Training
Pipeline Parallelism
Pipeline parallelism is a technique designed to mitigate the inefficiency of basic model parallelism, where only one device is active at any given moment. It addresses this by enabling computational overlap between different devices. This method is based on the principle of creating a processing pipeline where multiple computational steps can be executed concurrently across the distributed hardware.
0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Data Parallelism
Model Parallelism
Pipeline Parallelism
Learn After
Micro-batching in Pipeline Parallelism
Illustration of Pipeline Parallelism with Micro-batches