Sha256: 18cf8c1a1a3fed5ea9ef43d30421ee059905b47254975b452f363182d7e192ab

Contents?: true

Size: 1.4 KB

Versions: 4

Compression:

Stored size: 1.4 KB

Contents

module Picky

  # encoding: utf-8
  #
  module Generators

    module Similarity

      # It's actually a combination of double metaphone
      # and Levenshtein.
      #
      # It uses the double metaphone to get similar words
      # and ranks them using the levenshtein.
      #
      class Phonetic < Strategy

        attr_reader :amount

        #
        #
        def initialize amount = 10
          raise "In Picky 2.0+, the Similarity::Phonetic has been renamed to Similarity::DoubleMetaphone. Please use that one. Thanks!" if self.class == Phonetic
          @amount = amount
        end

        # # Generates an index for the given index (in exact index style).
        # #
        # # In the following form:
        # # [:meier, :mueller, :peter, :pater] => { MR: [:meier], MLR: [:mueller], PTR: [:peter, :pater] }
        # #
        # def generate_from inverted
        #   hash = hashify inverted.keys
        #   sort hash
        # end

        protected

          # Sorts the index values in place.
          #
          # TODO Include this again. Sort at the end.
          #      Or sort when inserting in realtime.
          #
          def sort hash
            hash.each_pair.each do |code, ary|
              ary.sort_by_levenshtein! code
              ary.slice! amount, ary.size # size is not perfectly correct, but anyway
            end
            hash
          end

      end

    end

  end

end

Version data entries

4 entries across 4 versions & 1 rubygems

Version Path
picky-3.5.4 lib/picky/generators/similarity/phonetic.rb
picky-3.5.3 lib/picky/generators/similarity/phonetic.rb
picky-3.5.2 lib/picky/generators/similarity/phonetic.rb
picky-3.5.1 lib/picky/generators/similarity/phonetic.rb