Learn Before
Handling Unfamiliar Words in LLM Inputs
A large language model is given the input sentence: 'The scientist discovered a new subatomic particle, the 'quarkino'.' The word 'quarkino' is a neologism and does not exist in the model's original training vocabulary. Based on your understanding of how text is prepared for processing, explain the most likely method the system uses to create a numerical representation for the unfamiliar word 'quarkino' and contrast this with how it would represent a common word like 'scientist'.
0
1
Tags
Ch.3 Prompting - Foundations of Large Language Models
Foundations of Large Language Models
Foundations of Large Language Models Course
Computing Sciences
Analysis in Bloom's Taxonomy
Cognitive Psychology
Psychology
Social Science
Empirical Science
Science
Related
Mechanism of Prompt Tuning at the Embedding Layer
A large language model is processing the following two sentences: 'The cat sat on the mat' and 'The feline rested on the rug'. Assuming the model has been well-trained, which of the following statements best analyzes how the initial numerical representations for the tokens 'cat' and 'feline' would relate to each other before being processed by the main model layers?
A user provides a sentence as input to a large language model. Arrange the following initial processing steps in the correct chronological order before the model begins its main computational tasks.
Handling Unfamiliar Words in LLM Inputs