Example
Example: Gradient of with Respect to
Consider the scalar-valued function . When differentiated with respect to the column vector , the resulting gradient is . In deep learning frameworks, after allocating gradient memory and computing , calling the backward pass function calculates this gradient and stores it in the vector's gradient attribute.
0
1
Updated 2026-05-02
Tags
D2L
Dive into Deep Learning @ D2L