Learn Before
Concept
Application of the tf-idf vector models
The tf-idf model can be used to compute word similarity by computing the cosine of two word vectors
The tf-idf vector models can also be used to decide if two documents are similar. For each document that we want to compare, we need to compute the centroid for all their vectors. Given word vectors , the centroid document vector is:
Then we can compute the to estimate the similarity of these two documents: high cosine, high similarity.
0
2
Updated 2021-10-15
Tags
Data Science