Sha256: 77c6ff7c527ace67874bbb813420ba2bebac2e1207c3a7d94d3e0c12fff477d0

Contents?: true

Size: 1.28 KB

Versions: 3

Compression:

Stored size: 1.28 KB

Contents

=begin
    Copyright 2010-2015 Tasos Laskos <tasos.laskos@arachni-scanner.com>

    This file is part of the Arachni Framework project and is subject to
    redistribution and commercial restrictions. Please see the Arachni Framework
    web site for more information on licensing and terms of use.
=end

# Extracts paths from `script` HTML elements.
# Both from `src` and the text inside the scripts.
#
# @author Tasos "Zapotek" Laskos <tasos.laskos@arachni-scanner.com>
# @version 0.2
class Arachni::Parser::Extractors::Scripts < Arachni::Parser::Extractors::Base

    def run
        return [] if !includes?( 'script' )

        document.search( '//script[@src]' ).map { |a| a['src'] } |
            document.xpath( '//script' ).map(&:text).join.
                scan( /[\/a-zA-Z0-9%._-]+/ ).
                select do |s|
                    # String looks like a path, but don't get fooled by comments.
                    s.include?( '.' ) && s.include?( '/' )  &&
                    !s.include?( '*' ) && !s.start_with?( '//' ) &&

                    # Require absolute paths, otherwise we may get caught in
                    # a loop, this context isn't the most reliable for extracting
                    # real paths.
                    s.start_with?( '/' )
                end
    end

end

Version data entries

3 entries across 3 versions & 1 rubygems

Version Path
arachni-1.3.2 components/path_extractors/scripts.rb
arachni-1.3.1 components/path_extractors/scripts.rb
arachni-1.3 components/path_extractors/scripts.rb