Sha256: 19ecf6b1906d09e91d740981072eadd37ab2385e285d2008cb2f56030609a8fc
Contents?: true
Size: 991 Bytes
Versions: 1
Compression:
Stored size: 991 Bytes
Contents
# Rika A JRuby wrapper for Apache Tika to extract text and metadata from various file formats. More information about Apache Tika can be found here: http://tika.apache.org/ ## Installation Add this line to your application's Gemfile: gem 'rika' Remember that this gem only works on JRuby. And then execute: $ bundle Or install it yourself as: $ gem install rika ## Usage Something like this: require 'rika' parser = Rika::Parser.new('document.pdf') parser.content # Returns the content of the document as text parser.metadata["title"] if parser.metadata_exists?("title") # Returns the metadata field title if it exists parser.available_metadata # Returns all the available metadata keys that can be read from the document ## Contributing 1. Fork it 2. Create your feature branch (`git checkout -b my-new-feature`) 3. Commit your changes (`git commit -am 'Add some feature'`) 4. Push to the branch (`git push origin my-new-feature`) 5. Create new Pull Request
Version data entries
1 entries across 1 versions & 1 rubygems
Version | Path |
---|---|
rika-0.9.0-java | README.md |