Sha256: 95aa75be17285b1ed6df8711a3bca9bd9f76e7bb2663ff67e12bdad6a19df186

Contents?: true

Size: 409 Bytes

Versions: 2

Compression:

Stored size: 409 Bytes

Contents

 # Marks all blocks as content.

module Boilerpipe::Extractors
  class KeepEverythingExtractor
    def self.text(contents)
      doc = ::Boilerpipe::SAX::BoilerpipeHTMLParser.parse(contents)
      ::Boilerpipe::Extractors::KeepEverythingExtractor.process doc
      doc.content
    end

    def self.process(doc)
      ::Boilerpipe::Filters::MarkEverythingContentFilter.process doc
      doc
    end
  end
end

Version data entries

2 entries across 2 versions & 1 rubygems

Version Path
boilerpipe-ruby-0.4.0 lib/boilerpipe/extractors/keep_everything_extractor.rb
boilerpipe-ruby-0.3.0 lib/boilerpipe/extractors/keep_everything_extractor.rb