Peter Wyatt |
This new corpus – nearly 8 million PDFs totaling about 8 TB – was gathered from across the web in July/August of 2021. CONTINUE READING |
Today we’re excited to officially announce the next phase of our PDF processing evolution: Datalogics Cloud, a suite of cloud-based PDF processing products, which includes a free app on Zapier and a robust API on Amazon Web Services.
EA-PDF establishes high-level requirements for using PDF technology to package email for long-term preservation.
Google Chrome now supports saving tagged PDFs, plus PDF form filling in the browser. Released on 25th August, version 85 of the Chrome web browser included a couple of significant updates to its PDF output capabilities. Let’s take a quick …
Solimar Systems, Inc., provider of leading workflow solutions for print production and digital communications, today launches Solimar ReadyPDF® Prepress Server (TM). This intuitive, pioneering solution enables users to overcome numerous production inefficiencies associated with PDF files, and benefit from increased …
Dual Lab is proud to announce the new release of the web application ngPDF that demonstrates the use of Tagged PDF documents in the Open Web world. The application demonstrates the implementation of the derivation algorithm developed by the PDF Association. The new …
Users printing to PDF are throwing away information that could be reused by downstream applications. Find out how “Deriving HTML from PDF” addresses this issue by leveraging tagged PDF.
Following the official PDF Association publication of the PDF to HTML Derivation algorithm Dual Lab is glad to release a web application ngPDF.com for experimenting with the conversion from PDF to HTML in a few clicks without leaving your browser.
For the two most common web formats – HTML and PDF – the relationship hasn’t been easy. Whenever PDF is used on a website it’s usually in the form of a download link. Rarely, the end user sees some sort …
In a recent article I discussed using PDF as a container to organize, transport and archive collections of content. Since then I’ve had numerous discussions about this idea with members of the PDF technology and related communities. This article is an …
Introduction PDF technology profiles can be leveraged to provide trusted, predictable containers for record types that often present workflow and preservation challenges, with email and case files (associated files in arbitrary formats) as primary use-cases. Community-developed profiles can solve utilization, preservation …