Learn Before
Concept
Per-document Binarization
Per-document Binarization is used to generate the training data of Binary Naive Bayes Classifier. Within each document, the word count is clipped to 1, that is, repeated words in the same document are deleted. Then, the words left in all documents are grouped as positive and negative respectively to the labelling of the document and their frequencies are counted.
0
1
Updated 2022-06-17
Tags
Data Science