RubygemsResearch

Sha256: 3b5deb19cdd709578b3d494b25aba2fd9a6cb91462db8dcfa1e8553bfb8c9d03

Contents?: true

Size: 1.34 KB

Versions: 1

Compression:

Stored size: 1.34 KB

=== 0.1.5 / 2009-03-22

* Catch malformed URIs in Page#to_absolute and return +nil+.
* Filter out +nil+ URIs in Page#urls.

=== 0.1.4 / 2009-01-15

* Use Nokogiri for HTML and XML parsing.

=== 0.1.3 / 2009-01-10

* Added the :host options to Spidr::Agent#initialize.
* Added the Web Spider Obstacle Course files to the Manifest.
* Aliased Spidr::Agent#visited_urls to Spidr::Agent#history.

=== 0.1.2 / 2008-11-06

* Fixed a bug in Page#to_absolute where URLs with no path were not
  receiving a default path of <tt>/</tt>.
* Fixed a bug in Page#to_absolute where URL paths were not being
  expanded, in order to remove <tt>..</tt> and <tt>.</tt> directories.
* Fixed a bug where absolute URLs could have a blank path, thus causing
  Agent#get_page to crash when it performed the HTTP request.
* Added RSpec spec tests.
* Created a Web-Spider Obstacle Course
  (http://spidr.rubyforge.org/course/start.html) which is used in the spec
  tests.

=== 0.1.1 / 2008-10-04

* Added a reader method for the response instance variable in Page.
* Fixed a bug in Page#method_missing.

=== 0.1.0 / 2008-05-23

* Initial release.
  * Black-list or white-list URLs based upon:
    * Host name
    * Port number
    * Full link
    * URL extension
  * Provides call-backs for:
    * Every visited Page.
    * Every visited URL.
    * Every visited URL that matches a specified pattern.

Version data entries

1 entries across 1 versions & 1 rubygems

Version	Path
spidr-0.1.5	History.txt