Peter Wyatt |
This new corpus – nearly 8 million PDFs totaling about 8 TB – was gathered from across the web in July/August of 2021. CONTINUE READING |
PDF Day in Washington DC is 18 fast-paced educational sessions packed with non-commercial information for IT executives on a range of PDF technology related topics.
The PDF Association today announced the appointment of the organization’s Vice Chairman to the role of Executive Director.
Mail-Gard, a division of IWCO Direct, is one of the nations leading providers of print-to-mail continuity and recovery services. With locations in Pennsylvania and Minnesota, Mail-Gard maintains fully-secured and dedicated recovery facilities that support cut sheet, continuous form, duplex, MICR …
j2 Global® is a leading cloud services company that improves business performance and efficiency for millions of customers worldwide. j2 provides essential business tools such as online faxing, unified communications, hosted email services, email marketing, virtual phones systems and online …
A PDF/A document requires that all resources such as fonts, color profiles, etc. must be embedded in the file. The archiving of transactional documents can be nightmare because such documents are usually short by nature and contain huge number of …
A 5 minute survey providing insight into the way people think about electronic documents vs. documents on paper. The questions invite the user to reflect on how they use (or don’t use) electronic documents to get things done.
The PDF Day events in Washington DC and New York City offer executives and managers high-level, non-commercial information to improve their understanding of how PDF fits into their technology infrastructure, and what they could do to further leverage it.
Yvonne Friese, ZBW, shares her thoughts about the PDF Hackathon titled “Preserving PDF – identify, validate, repair”, which took place in Hamburg, 1 – 2 September, 2014. PDF Association Chairman Olaf Drümmer attended the hackathon as speaker and technical expert.
If you try to extract images from a PDF file it sometimes happens that you get a bunch of slices of the original image, mostly consisting of a few image rows per slice or, in extreme cases, just one row. …
In document processing, nothing works without meaningful metadata at least not if you want to automate it. The XMP format is key.