Table of Contents

Public classSealed HtmlStripCharFilter

Namespace
Rowles.LeanCorpus.Analysis.Filters
Assembly
Rowles.LeanCorpus.dll

Strips HTML/XML tags from input text, leaving only the text content.

public sealed class HtmlStripCharFilter : ICharFilter
HtmlStripCharFilter
Implements

Public method EntityPattern()

Public method Filter(ReadOnlySpan<char>)

Transforms the input text, returning filtered text ready for tokenisation.

Public method TagPattern()