Sha256: d7621c68fe8b12e6ecd4ddf4585506854ec9b8eac7a364c36eba8db53d7aae7d

Contents?: true

Size: 961 Bytes

Versions: 7

Compression:

Stored size: 961 Bytes

Contents

module OcrFile
  module ImageEngines
    module Pdftoppm
      extend self

      # TODO: other options
      # https://www.xpdfreader.com/pdftoppm-man.html
      # password
      # −mono Generate a monochrome PBM file (instead of an RGB PPM file).
      # −gray Generate a grayscale PGM file (instead of an RGB PPM file).
      # −cmyk Generate a CMYK PAM file (instead of an RGB PPM file).
      def images_from_pdf(pdf_path, save_path, filename: 'image', filetype: 'png', quality: 100, dpi: 300, verbose: true)
        print 'Generating screenshots of each PDF page ... '

        if filetype == 'jpg'
          `pdftoppm -jpeg -jpegopt quality=#{quality} -r #{dpi} "#{pdf_path}" "#{save_path}/#{filename}"`
        else
          `pdftoppm -#{filetype} -r #{dpi} #{pdf_path} #{save_path}/#{filename}`
        end

        puts 'Complete!'

        OcrFile::FileHelpers.fetch_temp_image_paths(save_path, filename, filetype)
      end
    end
  end
end

Version data entries

7 entries across 7 versions & 1 rubygems

Version Path
ocr-file-0.0.10 lib/ocr-file/image_engines/pdftoppm.rb
ocr-file-0.0.8 lib/ocr-file/image_engines/pdftoppm.rb
ocr-file-0.0.7 lib/ocr-file/image_engines/pdftoppm.rb
ocr-file-0.0.6 lib/ocr-file/image_engines/pdftoppm.rb
ocr-file-0.0.4 lib/ocr-file/image_engines/pdftoppm.rb
ocr-file-0.0.3 lib/ocr-file/image_engines/pdftoppm.rb
ocr-file-0.0.2 lib/ocr-file/image_engines/pdftoppm.rb