1Cademy - A researcher is debugging a language model where the input representation for each token is created by summing three distinct vectors: one for the tokens identity, one for its position in the sequence, and one for the sentence segment it belongs to. The researcher observes that the model treats the sentences The scientist observed the star and The star observed the scientist as having identical meanings. Which of the three component vectors is most likely being calculated incorrectly or omi

Learn Before

Input Embedding Formula in BERT-like Models

Multiple Choice

A researcher is debugging a language model where the input representation for each token is created by summing three distinct vectors: one for the token's identity, one for its position in the sequence, and one for the sentence segment it belongs to. The researcher observes that the model treats the sentences 'The scientist observed the star' and 'The star observed the scientist' as having identical meanings. Which of the three component vectors is most likely being calculated incorrectly or omi

Updated 2025-09-28

Contributors are:

Who are from:

Learn Before

Related