Document similarities with cosine similarity
returns the pairwise cosine similarities for the specified documents using the tf-idf matrix derived from their word counts. The score insimilarities
= cosineSimilarity(documents
)similarities(i,j)
represents the similarity betweendocuments(i)
anddocuments(j)
.
returns similarities betweensimilarities
= cosineSimilarity(documents
,queries
)documents
andqueries
using tf-idf matrices derived from the word counts indocuments
. The score insimilarities(i,j)
represents the similarity betweendocuments(i)
andqueries(j)
.
returns pairwise similarities for the documents encoded by the specified bag-of-words or bag-of-n-grams model using the tf-idf matrix derived from the word counts insimilarities
= cosineSimilarity(bag
)bag
. The score insimilarities(i,j)
represents the similarity between thei
th andj
th documents encoded bybag
.
returns similarities between the documents encoded by the bag-of-words or bag-of-n-grams modelsimilarities
= cosineSimilarity(bag
,queries
)bag
andqueries
using tf-idf matrices derived from the word counts inbag
. The score insimilarities(i,j)
represents the similarity between thei
th document encoded bybag
andqueries(j)
.
returns similarities for the data encoded in the row vectors of the matrixsimilarities
= cosineSimilarity(M
)M
. The score insimilarities(i,j)
represents the similarity betweenM(i,:)
andM(j,:)
.
returns similarities between the documents encoded in the matricessimilarities
= cosineSimilarity(M1,M2)M1
andM2
. The score insimilarities(i,j)
corresponds to the similarity betweenM1(i,:)
andM2(j,:)
.
tokenizedDocument
|bleuEvaluationScore
|rougeEvaluationScore
|bm25Similarity
|textrankScores
|lexrankScores
|mmrScores
|extractSummary