Sha256: 26156d580251093e473cc8140e053ba920f629b81cf2facf9af79315b2853387

Contents?: true

Size: 351 Bytes

Versions: 2

Compression:

Stored size: 351 Bytes

Contents

module Boilerpipe::Extractors
  class CanolaExtractor

    def self.text(contents)
      doc = ::Boilerpipe::SAX::BoilerpipeHTMLParser.parse(contents)
      ::Boilerpipe::Extractors::CanolaExtractor.process doc
      doc.content
    end

    def self.process(doc)
      ::Boilerpipe::Filters::CanolaClassifier.process doc

      doc
    end
  end
end

Version data entries

2 entries across 2 versions & 1 rubygems

Version Path
boilerpipe-ruby-0.4.0 lib/boilerpipe/extractors/canola_extractor.rb
boilerpipe-ruby-0.3.0 lib/boilerpipe/extractors/canola_extractor.rb