Main Content

Language Support

Information on language support in Text Analytics Toolbox™

Text Analytics Toolbox supports the languages English, Japanese, German, and Korean. Most Text Analytics Toolbox functions also work with text in other languages. For more information, seeLanguage Considerations.

Functions

expand all

tokenizedDocument 数组的kenized documents for text analysis
removeStopWords Remove stop words from documents
normalizeWords Stem or lemmatize words
stopWords List of stop words
mecabOptions Options for MeCab tokenization
tokenDetails Details of tokens in tokenized document array
addSentenceDetails Add sentence numbers to documents
addPartOfSpeechDetails Add part-of-speech tags to documents
addEntityDetails Add entity tags to documents
addLemmaDetails Add lemma forms of tokens to documents
addLanguageDetails Add language identifiers to documents
corpusLanguage Detect language of text

Topics

English Language

Japanese Language

German Language

Korean Language

Other Languages