removeStopWords
Remove stop words from documents
Syntax
Description
Words like "a", "and", "to", and "the" (known as stop words) can add noise to data. Use this function to remove stop words before analysis.
The function supports English, Japanese, German, and Korean text. To learn how to useremoveStopWords
for other languages, seeLanguage Considerations.
removes the stop words from thenewDocuments
= removeStopWords(documents
)tokenizedDocument
arraydocuments
. The function, by default, uses the stop word list given by thestopWords
function according to the language details ofdocuments
and is case insensitive.
To remove a custom list of words, use theremoveWords
function.
removes stop words with case matching the stop word list given by thenewDocuments
= removeStopWords(documents
,'IgnoreCase',false)stopWords
function.
Tip
UseremoveStopWords
before using thenormalizeWords
function asremoveStopWords
uses information that is removed by this function.