Using the Developer Version of the Amyuni OCR Module

The OCR module can be used for one of two purposes:

  1. Open an existing PDF and convert all text into searchable text. A number of PDF files contain only images or text that is not searchable. Applying OCR on these PDF files will make the text searchable. After opening the PDF file and applying OCR, the PDF can be resaved as another PDF or the text contents exported into a plain text file.

  1. Create a searchable PDF file from various image files such as TIFF, JPEG or PNG. The PDF file is creating by loading each image individually, applying OCR and resaving the PDF.

 

By default, the OCR module will search the tessaract library in the same folder where the DLL is registered.

tessdata : folder tessaract library

Win32 : folder with the x86 version of tesseract41.dll

X64 : folder with the x64 version of tesseract41.dll