Learn Before
Multiple Choice

A vector of raw, unnormalized scores [1000, 1002, 999] is passed as input to a computational function that converts these scores into a probability distribution. A common technique to prevent numerical errors is to first subtract the maximum value of the vector from every element before applying the main transformation (exponentiation). Why is this subtraction step crucial for handling large input values?

0

1

Updated 2025-10-07

Contributors are:

Who are from:

Tags

Data Science

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science