Sha256: cc504a255628748561684420bccdfae1430a414af552a4e57ca7786c71dee508

Contents?: true

Size: 407 Bytes

Versions: 2

Compression:

Stored size: 407 Bytes

Contents

require 'rubygems'
require 'open-uri'
require 'hpricot'

f = File.open(File.expand_path("../../data/non_twss.txt", __FILE__), "w")

domain = "http://www.fmylife.com"

200.times do |i|
  url = domain + "/intimacy?page=#{i}"
  puts url
  body = open(url).read
  doc = Hpricot(body)
  doc.search('div.post p a.fmllink') do |story|
    f.puts story.to_plain_text
  end
  f.flush
  sleep rand * 3.0
end

f.close

Version data entries

2 entries across 2 versions & 1 rubygems

Version Path
twss-0.0.5 script/collect_non_twss.rb
twss-0.0.4 script/collect_non_twss.rb