Sha256: 251cffe2073fde0ff1e2348947b5e4142d1af4bed25ea5ffa247f17c2a462b2f

Contents?: true

Size: 610 Bytes

Versions: 2

Compression:

Stored size: 610 Bytes

Contents

module PragmaticSegmenter
  module Languages
    module English
      include Languages::Common

      class Cleaner < Cleaner
        def clean
          super
          clean_quotations
        end

        private

        def clean_quotations
          @text.gsub(/`/, "'")
        end

        def abbreviations
          []
        end
      end

      class AbbreviationReplacer < AbbreviationReplacer
        SENTENCE_STARTERS = %w(
          A Being Did For He How However I In It Millions More She That The
          There They We What When Where Who Why
        ).freeze
      end
    end
  end
end

Version data entries

2 entries across 2 versions & 1 rubygems

Version Path
pragmatic_segmenter-0.3.7 lib/pragmatic_segmenter/languages/english.rb
pragmatic_segmenter-0.3.6 lib/pragmatic_segmenter/languages/english.rb