Example

Example of Character-Level n-Grams for Obfuscated Hate Speech

Token-based models often fail to recognize intentionally obfuscated hate speech, such as the phrase "$ ki11 yrslef a$$hole$". Because standard word-level tokenizers cannot process these variations, character-level n-grams are necessary features to detect disguised offensive language.

0

1

Updated 2026-06-13

Tags

Data Science