# Ronin Web * [Source](http://github.com/ronin-ruby/ronin-web) * [Issues](http://github.com/ronin-ruby/ronin-web/issues) * [Documentation](http://rubydoc.info/github/ronin-ruby/ronin-web/frames) * [Mailing List](http://groups.google.com/group/ronin-ruby) * [irc.freenode.net #ronin](http://webchat.freenode.net/?channels=ronin&uio=Mj10cnVldd) ## Description Ronin Web is a Ruby library for Ronin that provides support for web scraping and spidering functionality. Ronin is a Ruby platform for exploit development and security research. Ronin allows for the rapid development and distribution of code, exploits or payloads over many common Source-Code-Management (SCM) systems. ### Ruby Ronin's Ruby environment allows security researchers to leverage Ruby with ease. The Ruby environment contains a multitude of convenience methods for working with data in Ruby, a Ruby Object Database, a customized Ruby Console and an extendable command-line interface. ### Extend Ronin's more specialized features are provided by additional Ronin libraries, which users can choose to install. These libraries can allow one to write and run Exploits and Payloads, scan for PHP vulnerabilities, perform Google Dorks or run 3rd party scanners. ### Publish Ronin allows users to publish and share code, exploits, payloads or other data via Overlays. Overlays are directories of code and data that can be hosted on any SVN, Hg, Git or Rsync server. Ronin makes it easy to create, install or update Overlays. ## Features * Provides popular [User Agent strings](https://github.com/ronin-ruby/ronin-web/blob/master/data/ronin/web/user_agents.yml). * Web access (utilizing [Mechanize](http://rubydoc.info/gems/mechanize/frames) and [Nokogiri](http://rubydoc.info/gems/nokogiri/frames). * Integrates [Spidr](http://github.com/postmodern/spidr#readme) into {Ronin::Web::Spider}. * Provides a collection of useful [Rack](http://rack.rubyforge.org/) Middleware: * {Ronin::Web::Middleware::Files} * {Ronin::Web::Middleware::Directories} * {Ronin::Web::Middleware::Proxy} * {Ronin::Web::Middleware::Router} * Provides {Ronin::Web::Server}, a [Sinatra](http://sinatrarb.com/) based Web Server. * Provides {Ronin::Web::Proxy}, a [Sinatra](http://sinatrarb.com/) based Web Proxy. ## Synopsis Start the Ronin console with Ronin Web preloaded: $ ronin-web ## Examples Get a web-page: Web.get('http://www.rubyinside.com/') Get only the body of the web-page: Web.get_body('http://www.rubyinside.com/') Get a [Mechanize agent](http://rubydoc.info/gems/mechanize/1.0.0/Mechanize): agent = Web.agent Parse HTML: Web.html(open('some_file.html')) # => # # # # # Build a HTML document: doc = Web.build_html do html { head { script(:type => 'text/javascript', :src => 'redirect.js') } } end puts doc.to_html # # Parse XML: Web.xml(some_text) # => # # # admin # 0mni # # Build a XML document: doc = Web.build_xml do playlist { mp3 { file { text('02 THE WAIT.mp3') } artist { text('Evil Nine') } track { text('The Wait feat David Autokratz') } duration { text('1000000000') } } } end puts doc.to_xml # # # # 02 THE WAIT.mp3 # Evil Nine # The Wait feat David Autokratz # 1000000000 # # Spider a web site: Web::Spider.host('www.example.com') do |spider| spider.every_url do |url| # ... end spider.every_page do |page| # ... end end Serve files via a Web Server: require 'ronin/web/server' Web.server do file '/opensearch.xml', '/tmp/test.xml' directory '/download/', '/tmp/download/' end Web.server.get '/test' do 'Test 1 2 1 2' end ## Requirements * [nokogiri](http://github.com/tenderlove/nokogiri) ~> 1.4 * [libxml2](http://xmlsoft.org/) * [libxslt1](http://xmlsoft.org/XSLT/) * [mechanize](http://github.com/tenderlove/mechanize) ~> 1.0 * [spidr](http://github.com/postmodern/spidr) ~> 0.2 * [rack](http://github.com/rack/rack) ~> 1.3 * [sinatra](http://github.com/sinatra/sinatra) ~> 1.2 * [data_paths](http://github.com/postmodern/data_paths) ~> 0.3 * [ronin-support](http://github.com/ronin-ruby/ronin-support) ~> 0.2 * [ronin](http://github.com/ronin-ruby/ronin) ~> 1.1 ## Install $ gem install ronin-web ## License Ronin Web - A Ruby library for Ronin that provides support for web scraping and spidering functionality. Copyright (c) 2006-2011 Hal Brodigan (postmodern.mod3 at gmail.com) This file is part of Ronin Web. Ronin is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version. Ronin is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with Ronin. If not, see .