8.7. Tokenizers

TODO: Write me.

Here are the list of built-in tokenizers:

  • TokenBigram
  • TokenBigramSplitSymbol
  • TokenBigramSplitSymbolAlpha
  • TokenBigramSplitSymbolAlphaDigit
  • TokenBigramIgnoreBlank
  • TokenBigramIgnoreBlankSplitSymbol
  • TokenBigramIgnoreBlankSplitAlpha
  • TokenBigramIgnoreBlankSplitAlphaDigit
  • TokenDelimit
  • TokenDelimitNull
  • TokenTrigram
  • TokenUnigram