Learn Before
Short Answer

Analyzing Search Algorithm Behavior

Imagine a text generation model is producing a sequence. At the first step, the most probable next word is 'apple' with a log-probability of -0.8. The second most probable word is 'apricot' with a log-probability of -0.9. A simple greedy approach would select 'apple'. However, the best complete sequence actually starts with 'apricot'. Explain, in detail, the mechanism by which a search process that keeps track of multiple hypotheses at each step could arrive at the better overall sequence, even though it did not start with the most probable first word.

0

1

Updated 2025-10-02

Contributors are:

Who are from:

Tags

Data Science

Foundations of Large Language Models Course

Computing Sciences

Ch.5 Inference - Foundations of Large Language Models

Foundations of Large Language Models

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science

Related