Sha256: 0a2791e1f795bf392c2e8207bc2422153ac8f100ca932ce17c5666e5723628b7
Contents?: true
Size: 476 Bytes
Versions: 1
Compression:
Stored size: 476 Bytes
Contents
# Open Document to text conversion require 'zip/zip' PlainTextExtractor.new { every :odt as 'application/vnd.oasis.opendocument.text' aka "Open Document Format for text" extract_content_with {|source| Zip::ZipFile.open(source){|zipfile| zipfile.read("content.xml").split(/</).grep(/^text:(p|span)/).collect{|l| l.sub(/^[^>]+>/,'') }.join("\n") } } which_should_for_example_extract 'written with OpenOffice.org', :from => 'basic.odt' }
Version data entries
1 entries across 1 versions & 1 rubygems
Version | Path |
---|---|
picolena-0.2.2 | lib/picolena/templates/lib/plain_text_extractors/opendocument.text.rb |