Learn Before
Theory

Recurrent Models

Recurrent models are a class of neural networks designed to process sequential data. They operate by maintaining a hidden state, denoted as 'h', which captures information from previous elements in the sequence. At each time step 'i', the model updates its state to 'h_i' by applying a function 'f' to the previous state 'h_{i-1}' and the current input 'input_i'. This relationship is formally expressed by the recurrence relation: hi=f(hi1,inputi)h_i = f(h_{i-1}, \text{input}_i) This mechanism allows the model to have a 'memory' of past events, making it suitable for tasks involving sequences like natural language processing.

Image 0

0

1

Updated 2026-04-22

Contributors are:

Who are from:

Tags

Ch.2 Generative Models - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Related