Table of Contents

Public classSealed JapaneseStemmer

Namespace
Rowles.LeanLucene.Analysis.Stemmers
Assembly
Rowles.LeanLucene.dll

Japanese stemmer — identity implementation.

public sealed class JapaneseStemmer : IStemmer
JapaneseStemmer
Implements

Remarks

Japanese morphology is handled by conjugation paradigms (活用), not detachable suffixes of the kind that a Snowball-style stemmer can reliably strip. Inflected verb and adjective forms are interleaved with auxiliary morphemes (e.g. 食べられる, 走っている) that require a full morphological analyser to decompose correctly.

Recommended pre-processing for Japanese search:

  • Morphological analysis with MeCab, Kuromoji (Lucene's JapaneseAnalyzer), or SudachiPy
  • Lemmatisation using the analyser's dictionary base-form output
  • Kana normalisation (hiragana ↔ katakana, full-width → half-width)

This class is provided so the IStemmer pipeline compiles uniformly across all supported languages.

Methods

Public method Stem(string)

Returns the stemmed form of the word.