Sha256: 19ecf6b1906d09e91d740981072eadd37ab2385e285d2008cb2f56030609a8fc

Contents?: true

Size: 991 Bytes

Versions: 1

Compression:

Stored size: 991 Bytes

Contents

# Rika

A JRuby wrapper for Apache Tika to extract text and metadata from various file formats.

More information about Apache Tika can be found here: http://tika.apache.org/

## Installation

Add this line to your application's Gemfile:

    gem 'rika'

Remember that this gem only works on JRuby.

And then execute:

    $ bundle

Or install it yourself as:

    $ gem install rika

## Usage

Something like this:

require 'rika'

parser = Rika::Parser.new('document.pdf')

parser.content # Returns the content of the document as text

parser.metadata["title"] if parser.metadata_exists?("title") # Returns the metadata field title if it exists 

parser.available_metadata # Returns all the available metadata keys that can be read from the document

## Contributing

1. Fork it
2. Create your feature branch (`git checkout -b my-new-feature`)
3. Commit your changes (`git commit -am 'Add some feature'`)
4. Push to the branch (`git push origin my-new-feature`)
5. Create new Pull Request

Version data entries

1 entries across 1 versions & 1 rubygems

Version Path
rika-0.9.0-java README.md