Sha256: 0f609e636555b6b072a42d649384757701b6ec107fd3b700995f93b9627cfc6d

Contents?: true

Size: 604 Bytes

Versions: 1

Compression:

Stored size: 604 Bytes

Contents

= Anemone

== DESCRIPTION
Anemone is a web spider framework that can spider a domain and collect useful
information about the pages it visits. It is versatile, allowing you to
write your own specialized spider tasks quickly and easily.

== FEATURES
* Multi-threaded design for high performance
* Tracks 301 HTTP redirects to understand a page's aliases
* Built-in BFS algorithm for determining page depth
* Allows exclusion of URLs based on regular expressions 
* Can crawl obeying robots.txt

== REQUIREMENTS
* nokogiri

== EXAMPLES
See the +bin+ directory for several examples of useful Anemone tasks.

Version data entries

1 entries across 1 versions & 1 rubygems

Version Path
parolkar-anemone-0.1.2 README.rdoc