Concept

Model Parallelism

Model parallelism is a technique used when a model is too large to be loaded and executed on a single device, making data parallelism impractical. Unlike data parallelism, which requires each worker to have a full copy of the model for both forward and backward passes, model parallelism involves partitioning the model itself into smaller components. These components are then distributed and run on different devices.

0

1

Updated 2026-04-21

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences