Sha256: cc504a255628748561684420bccdfae1430a414af552a4e57ca7786c71dee508
Contents?: true
Size: 407 Bytes
Versions: 2
Compression:
Stored size: 407 Bytes
Contents
require 'rubygems' require 'open-uri' require 'hpricot' f = File.open(File.expand_path("../../data/non_twss.txt", __FILE__), "w") domain = "http://www.fmylife.com" 200.times do |i| url = domain + "/intimacy?page=#{i}" puts url body = open(url).read doc = Hpricot(body) doc.search('div.post p a.fmllink') do |story| f.puts story.to_plain_text end f.flush sleep rand * 3.0 end f.close
Version data entries
2 entries across 2 versions & 1 rubygems
Version | Path |
---|---|
twss-0.0.5 | script/collect_non_twss.rb |
twss-0.0.4 | script/collect_non_twss.rb |