Table of Contents

Internal classStaticInternal TermDictionaryFstBuilder

Namespace
Rowles.LeanLucene.Codecs.Fst
Assembly
Rowles.LeanLucene.dll

Serialises a sorted term list into a compact byte-keyed v2 dictionary format. Format: [termCount:int32][postingsOffsets: N*int64][keyStarts: (N+1)*int32][keyData: UTF-8 bytes]. This format enables O(log N) binary search on raw UTF-8 bytes without string materialisation. Terms are re-sorted in UTF-8 byte order to ensure binary search correctness (string ordinal sort can differ from UTF-8 byte sort for supplementary characters).

internal static class TermDictionaryFstBuilder
TermDictionaryFstBuilder

Methods

Public methodStatic Write(IndexOutput, List<string>, Dictionary<string, long>)

Writes sorted terms and their postings offsets in v2 format to the given output. The codec header (magic + version) must already have been written by the caller.