Sha256: 7ceb4cd086b5d0eafa25aae3287e3d04cea42356988eb2f4f87702647e68b00d
Contents?: true
Size: 385 Bytes
Versions: 5
Compression:
Stored size: 385 Bytes
Contents
require 'spec_helper' describe PragmaticTokenizer do context 'Language: French (fr)' do it 'tokenizes a string #001' do text = "L'art de l'univers, c'est un art" pt = PragmaticTokenizer::Tokenizer.new( text, language: 'fr' ) expect(pt.tokenize).to eq(["l'", "art", "de", "l'", "univers", ",", "c'est", "un", "art"]) end end end
Version data entries
5 entries across 5 versions & 1 rubygems