Sha256: 95ded05f3b97bab37edaa77f5bfe5960b0eaee27e9c59d8fd7c9eadc4b6be29b

Contents?: true

Size: 582 Bytes

Versions: 1

Compression:

Stored size: 582 Bytes

Contents

require 'markov/generator'

#markov = Markov::Generator.new
#markov.parse_source_file "./test_seed.txt"

#puts "#{markov.generate_sentence}"

#@split_words = /([',.?!\n-])|[\s]+/
#@split_sentence = /(?<=[.!?\n])\s+/

split_sentence = /(?<=[.?!])\s+/
split_words = /([,.?!\n\r])|[\s]/
replace_chars = /[„':;_"()\n\r]/

source = "./file_parser_test.txt"

sentences = File.open(source, "r").read.force_encoding(Encoding::UTF_8).split(split_sentence)

sentences.each do |sentence|
  puts sentence
  puts sentence.gsub!( replace_chars, "")
  puts "#{sentence.split(split_words)}"
end

Version data entries

1 entries across 1 versions & 1 rubygems

Version Path
markov-generator-0.9.1 test/file_parser_test.rb