... to PDFImages, it generates ppm/pbm and optional jpg pictures on the fly . - I experienced good OCR results, perhaps due to this fact: 'pdfimages extracts the raw image data from the PDF file, without performing any additional transforms. Any rotation, clipping, color inversion, etc. done by the ...