pdfOCR
Produced by iText
We are proud to introduce a new iText 7 add-on, iText pdfOCR, which provides Optical Character Recognition (OCR) functionality to convert printed text in scanned documents and images into a fully searchable PDF/A-3u compliant format (PDF version 1.7) and make accessing those texts easier and faster. The iText pdfOCR add-on offers OCR capabilities using open-source Tesseract 4 OCR engine and ML-based ONNX technologies, depending on your requirements. Pre-trained ONNX models can be used for specific languages while Tesseract provides comprehensive (over 100) language support.
Still have questions?
If you are interested in learning more or have additional questions, contact us
If you are interested in learning more about pdfOCR, click here


