Short Answer

Input Sequence Formatting Analysis

A junior data scientist has prepared the following input sequence for a model that needs to understand the relationship between two sentences: The sun is bright. [SEP] The sky is blue. [CLS]. Identify two distinct formatting errors in this sequence and explain the correct placement and purpose of the special tokens involved.

0

1

Updated 2025-10-04

Contributors are:

Who are from:

Tags

Ch.1 Pre-training - Foundations of Large Language Models

Foundations of Large Language Models

Foundations of Large Language Models Course

Computing Sciences

Analysis in Bloom's Taxonomy

Cognitive Psychology

Psychology

Social Science

Empirical Science

Science