Sha256: e0b19f37b241cdb14abf3f8f93ff37ba7a74f4d1477f6c7b381e2ebaf8b5e72b
Contents?: true
Size: 1.83 KB
Versions: 1
Compression:
Stored size: 1.83 KB
Contents
require File.dirname(__FILE__) + '/../spec_helper.rb' describe Rawler::Crawler do it "should parse all links" do url = 'http://example.com/' register(url, site) Rawler::Crawler.new(url).links.should == ['http://example.com/foo', 'http://external.com/bar'] end it "should return an empty array when raising Errno::ECONNREFUSED" do url = 'http://example.com' register(url, site) Net::HTTP.should_receive(:get).and_raise Errno::ECONNREFUSED crawler = Rawler::Crawler.new(url).links.should == [] end it "should parse relative links" do url = 'http://example.com/path' register(url, '<a href="/foo">foo</a>') Rawler::Crawler.new(url).links.should == ['http://example.com/foo'] end # it "should print a message when raising Errno::ECONNREFUSED" do # pending "refactor output. Don't use a global variable" # url = 'http://example.com' # register(url, site) # # Net::HTTP.should_receive(:get).and_raise Errno::ECONNREFUSED # # $stdout.should_receive(:puts).with("Couldn't connect to #{url}") # # Rawler::Crawler.new(url).links # end private def site <<-site <!DOCTYPE html> <html> <body> <p>Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.</p> <p><a href="http://example.com/foo">foo</a></p> <p><a href="http://external.com/bar">bar</a></p> </body> </html> site end end
Version data entries
1 entries across 1 versions & 1 rubygems
Version | Path |
---|---|
rawler-0.0.2 | spec/unit/crawler_spec.rb |