Sha256: 0f609e636555b6b072a42d649384757701b6ec107fd3b700995f93b9627cfc6d
Contents?: true
Size: 604 Bytes
Versions: 1
Compression:
Stored size: 604 Bytes
Contents
= Anemone == DESCRIPTION Anemone is a web spider framework that can spider a domain and collect useful information about the pages it visits. It is versatile, allowing you to write your own specialized spider tasks quickly and easily. == FEATURES * Multi-threaded design for high performance * Tracks 301 HTTP redirects to understand a page's aliases * Built-in BFS algorithm for determining page depth * Allows exclusion of URLs based on regular expressions * Can crawl obeying robots.txt == REQUIREMENTS * nokogiri == EXAMPLES See the +bin+ directory for several examples of useful Anemone tasks.
Version data entries
1 entries across 1 versions & 1 rubygems
Version | Path |
---|---|
parolkar-anemone-0.1.2 | README.rdoc |