PDF Association logo

Discover pdfa.org

Key resources

Get involved

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!
The PDF Association celebrates its members’ public statements
of support
for ISO-standardized PDF technology.

Member Area

Does OCR make sense for digitally generated PDFs?

Digital born documents make use of transparency and spot colors. It isn’t uncommon to use geometric lines & curves to represent text, while the machine-readable information is missing.
Nadine Schuppisser


October 3, 2018

Member News



Scanned PDF files usually consist of one raster image for each page. The OCR engine can recognize the text in this image and make the document searchable. But what about digitally generated documents?

Read more in our PDF expert blog


Pdftools counts more than 5,000 companies and organizations in 70 countries among its customers, making it one of the world’s leading producers of software solutions and developer components for PDF and PDF/A products. The product range support the entire document flow, from raw materials to scanning processes through to signing…

Read more
WordPress Cookie Notice by Real Cookie Banner