Sha256: faa31a79f64bdfe9332cef2aae36d2d473a561617f4f4abc5ca7d195a148fe6b

Contents?: true

Size: 648 Bytes

Versions: 7

Compression:

Stored size: 648 Bytes

Contents

# frozen_string_literal: true

module PragmaticSegmenter
  module Languages
    module English
      include Languages::Common

      class Cleaner < Cleaner
        def clean
          super
          clean_quotations
        end

        private

        def clean_quotations
          @text.gsub(/`/, "'")
        end

        def abbreviations
          [].freeze
        end
      end

      class AbbreviationReplacer < AbbreviationReplacer
        SENTENCE_STARTERS = %w(
          A Being Did For He How However I In It Millions More She That The
          There They We What When Where Who Why
        ).freeze
      end
    end
  end
end

Version data entries

7 entries across 7 versions & 1 rubygems

Version Path
pragmatic_segmenter-0.3.24 lib/pragmatic_segmenter/languages/english.rb
pragmatic_segmenter-0.3.23 lib/pragmatic_segmenter/languages/english.rb
pragmatic_segmenter-0.3.22 lib/pragmatic_segmenter/languages/english.rb
pragmatic_segmenter-0.3.21 lib/pragmatic_segmenter/languages/english.rb
pragmatic_segmenter-0.3.20 lib/pragmatic_segmenter/languages/english.rb
pragmatic_segmenter-0.3.19 lib/pragmatic_segmenter/languages/english.rb
pragmatic_segmenter-0.3.18 lib/pragmatic_segmenter/languages/english.rb