tokenizer_list command lists tokenizers in a database.
Here is a simple example.
Execution example:
tokenizer_list
# [
# [
# 0,
# 0.0,
# 0.0
# ],
# [
# {
# "name": "TokenMecab"
# },
# {
# "name": "TokenDelimit"
# },
# {
# "name": "TokenUnigram"
# },
# {
# "name": "TokenBigram"
# },
# {
# "name": "TokenTrigram"
# },
# {
# "name": "TokenBigramSplitSymbol"
# },
# {
# "name": "TokenBigramSplitSymbolAlpha"
# },
# {
# "name": "TokenBigramSplitSymbolAlphaDigit"
# },
# {
# "name": "TokenBigramIgnoreBlank"
# },
# {
# "name": "TokenBigramIgnoreBlankSplitSymbol"
# },
# {
# "name": "TokenBigramIgnoreBlankSplitSymbolAlpha"
# },
# {
# "name": "TokenBigramIgnoreBlankSplitSymbolAlphaDigit"
# },
# {
# "name": "TokenDelimitNull"
# }
# ]
# ]
It returns tokenizers in a database.
tokenizer_list command returns tokenizers. Each tokenizers has an attribute that contains the name. The attribute will be increased in the feature:
[HEADER, tokenizers]
HEADER
See Output format about HEADER.
tokenizers
tokenizers is an array of tokenizer. Tokenizer is an object that has the following attributes.
Name Description name Tokenizer name.