Sha256: aade157adc841b5a0603af5685b32283932ac6e6fe35b104f18f34e271807e3a

Contents?: true

Size: 566 Bytes

Versions: 1

Compression:

Stored size: 566 Bytes

Contents

module Ddr
  module Extraction
    module Adapters
      class TikaAdapter

        # Extract text from file
        #
        # @param file [String] path to file from which to extract text
        # @return [IO] the output
        def extract_text(file)
          IO.popen(["java", "-jar", self.class.path, "--text", file])
        end

        class << self
          # Path to tika-app.jar
          attr_accessor :path

          # Tika server port (optional, required for server)
          attr_accessor :port
        end          

      end
    end
  end
end

Version data entries

1 entries across 1 versions & 1 rubygems

Version Path
ddr-extraction-0.2.1 lib/ddr/extraction/adapters/tika_adapter.rb