Sha256: 507f533a2c58dddb42e55206524a37c4a7d22082627c21077d335b525540c47b
Contents?: true
Size: 379 Bytes
Versions: 3
Compression:
Stored size: 379 Bytes
Contents
require 'spec_helper' describe PragmaticTokenizer do context 'Language: French (fr)' do it 'tokenizes a string #001' do text = "D'art de l'univers, c'est un art" pt = PragmaticTokenizer::Tokenizer.new( language: 'fr' ) expect(pt.tokenize(text)).to eq(["d'", "art", "de", "l'", "univers", ",", "c'" ,"est", "un", "art"]) end end end
Version data entries
3 entries across 3 versions & 1 rubygems
Version | Path |
---|---|
pragmatic_tokenizer-3.2.1 | spec/languages/french_spec.rb |
pragmatic_tokenizer-3.2.0 | spec/languages/french_spec.rb |
pragmatic_tokenizer-3.1.0 | spec/languages/french_spec.rb |