Table of Contents

Public classStatic StopWords

Namespace
Rowles.LeanCorpus.Analysis
Assembly
Rowles.LeanCorpus.dll

Stop word lists for common languages.

public static class StopWords
StopWords

Public field Arabic

Gets the built-in Arabic stop word list.

Public field Chinese

Gets the built-in Chinese stop word list.

Public field Dutch

Gets the built-in Dutch stop word list.

Public field English

Gets the classic 33-word English stop word list. This is the default used by StandardAnalyser.

Public field EnglishExtended

Gets the extended English stop word list (~95 words) which covers prepositions, pronouns, modals, adverbs, and negation fragments in addition to the classic set. Pass this to IndexWriterConfig.StopWords to opt in to more aggressive stop word removal.

Public field French

Gets the built-in French stop word list.

Public field German

Gets the built-in German stop word list.

Public field Italian

Gets the built-in Italian stop word list.

Public field Japanese

Gets the built-in Japanese stop word list.

Public field Korean

Gets the built-in Korean stop word list.

Public field Portuguese

Gets the built-in Portuguese stop word list.

Public field Russian

Gets the built-in Russian stop word list.

Public field Spanish

Gets the built-in Spanish stop word list.

Public property SupportedLanguages

Returns all supported BCP 47 language codes.

Public method ForLanguage(string)

Gets the stop word list for a BCP 47 language code, or null if the language is not supported.