Learn Before
Formula for Neural Network Memory Update
When a memory component operates as a neural network, its update mechanism can be formulated as a general mathematical function: . Here, and denote the memory outputs at the current and prior steps, while is a set of key-value pairs representing the recent states of the model. This flexible formulation enables the creation of diverse memory models by altering the function and the contents of .

0
1
Tags
Ch.2 Generative Models - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Related
Formula for Neural Network Memory Update
A computational model is designed to process a sequence of items one by one. To keep a running summary of the sequence, it uses a specific neural network as a memory component. At each step, this network updates its internal state. Suppose at step
t=5, the memory network has just produced an output representing its state, which we'll callMem_prior. The main model has also processed the fifth item in the sequence, resulting in a current state representation calledS_current. To generate the new memory state for the next step, what inputs should be fed into the memory network?An attention mechanism uses a neural network to maintain a memory of the information it has processed. Arrange the following events in the correct chronological order for a single update step of this memory component.
An engineer is developing a text summarization model that processes a document sentence by sentence. The model uses a special neural network as a memory component to keep track of the document's overall context. The engineer observes that the model generates excellent summaries for short articles but produces incoherent summaries for long articles, often forgetting information from the initial paragraphs. The main model components responsible for processing individual sentences are confirmed to be working correctly. Based on this observation, which of the following is the most likely malfunction within the memory component's update process?
Learn After
A system processes information in discrete steps and updates its internal memory at each step. The update is performed by a neural network that follows the general rule:
New_Memory = Update_Function(Current_State, Previous_Memory). If the system is currently processing Step 10, what specific inputs are required by theUpdate_Functionto calculate the memory for this step?Analysis of Memory Update Inputs
Diagnosing a Faulty Memory Update Mechanism
RNN-like Configuration for Neural Memory