Short Answer

Distinguishing Sequence-Level vs. Element-Level Functions

Consider two different functional approaches for processing a sequence of vectors yˉ1,yˉ2,,yˉk\bar{\mathbf{y}}_1, \bar{\mathbf{y}}_2, \dots, \bar{\mathbf{y}}_{k}.

Approach A uses a function s(yˉ1,yˉ2,,yˉk)s(\bar{\mathbf{y}}_1, \bar{\mathbf{y}}_2, \dots, \bar{\mathbf{y}}_{k}) which takes the entire sequence as input and produces a single output.

Approach B applies a function f()f(\cdot) to each vector individually, producing a sequence of outputs: f(yˉ1),f(yˉ2),,f(yˉk)f(\bar{\mathbf{y}}_1), f(\bar{\mathbf{y}}_2), \dots, f(\bar{\mathbf{y}}_{k}).

Explain the fundamental difference in the type of information these two approaches can capture from the input sequence.

0

1

Updated 2025-10-03

Contributors are:

Who are from:

Tags

Ch.4 Alignment - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science