Sha256: e1a20cd48459ee683afeba96c1eeae071c628ed96275de642082b9c84a351c8b

Contents?: true

Size: 314 Bytes

Versions: 11

Compression:

Stored size: 314 Bytes

Contents

# frozen_string_literal: true

module SimpleTextExtract
  module FormatExtractor
    class DocX < Base
      def extract
        return nil if missing_dependency?("unzip")

        `unzip -p #{Shellwords.escape(file.path)} | grep '<w:t' | sed 's/<[^<]*>//g' | grep -v '^[[:space:]]*$'`
      end
    end
  end
end

Version data entries

11 entries across 11 versions & 1 rubygems

Version Path
simple_text_extract-2.0.0 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-1.3.0 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-1.2.0 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-1.1.0 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-1.0.2 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-1.0.1 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-0.3.0 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-0.2.1 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-0.2.0 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-0.1.3 lib/simple_text_extract/format_extractor/doc_x.rb
simple_text_extract-0.1.2 lib/simple_text_extract/format_extractor/doc_x.rb