Sha256: 83006048d1ca33532c27defae29113cbb22188de1194e245158fa624eb131dc9

Contents?: true

Size: 702 Bytes

Versions: 4

Compression:

Stored size: 702 Bytes

Contents

##
# Asian J. (2007) "Effective Techniques for Indonesian Text Retrieval". page 26
# http://researchbank.rmit.edu.au/eserv/rmit:6312/Asian.pdf

module Sastrawi
  module Morphology
    class InvalidAffixPairSpecification
      def satisfied_by?(word)
        return false if /^me(.*)kan$/.match(word)

        return false if word == 'ketahui'

        invalid_affixes = [
          /^ber(.*)i$/, /^di(.*)an$/, /^ke(.*)i$/, /^ke(.*)an$/,
          /^me(.*)an$/, /^me(.*)an$/, /^ter(.*)an$/, /^per(.*)an$/
        ]

        matches = false

        invalid_affixes.each do |invalid_affix|
          matches = matches || !!(word =~ invalid_affix)
        end

        matches
      end
    end
  end
end

Version data entries

4 entries across 4 versions & 1 rubygems

Version Path
sastrawi-0.1.4 lib/sastrawi/morphology/invalid_affix_pair_specification.rb
sastrawi-0.1.3 lib/sastrawi/morphology/invalid_affix_pair_specification.rb
sastrawi-0.1.2 lib/sastrawi/morphology/invalid_affix_pair_specification.rb
sastrawi-0.1.1 lib/sastrawi/morphology/invalid_affix_pair_specification.rb