Sha256: 507f533a2c58dddb42e55206524a37c4a7d22082627c21077d335b525540c47b

Contents?: true

Size: 379 Bytes

Versions: 3

Compression:

Stored size: 379 Bytes

Contents

require 'spec_helper'

describe PragmaticTokenizer do
  context 'Language: French (fr)' do
    it 'tokenizes a string #001' do
      text = "D'art de l'univers, c'est un art"
      pt = PragmaticTokenizer::Tokenizer.new(
          language: 'fr'
      )
      expect(pt.tokenize(text)).to eq(["d'", "art", "de", "l'", "univers", ",", "c'" ,"est", "un", "art"])
    end
  end
end

Version data entries

3 entries across 3 versions & 1 rubygems

Version Path
pragmatic_tokenizer-3.2.1 spec/languages/french_spec.rb
pragmatic_tokenizer-3.2.0 spec/languages/french_spec.rb
pragmatic_tokenizer-3.1.0 spec/languages/french_spec.rb