Concept

Bag-of-Words Assumption

The bag-of-words assumption is used to simplify the computation of the Naive Bayes classifier. It assumes that the positions of words in a document do not affect its classification. Based on this assumption, the features of a document can be represented merely by word occurrences, completely discarding their relative ordering. Hence, the likelihood function P(dc)P(d|c) can be written as P(f1,,fnc)P(f_1, \dots, f_n|c), treating the document as an unordered collection of word features.

0

1

Updated 2026-06-14

Tags

Data Science