This new corpus – nearly 8 million PDFs totaling about 8 TB – was gathered from across the web in July/August of 2021.CONTINUE READING
|Peter Wyatt // May 13, 2023|
Adobe releases the source code for an early version of PostScript, the forerunner of PDF.
DARPA will provide seed funding of $1.3M for research and prototyping centered on reinventing the PDF specification and delivering on the benefits of defining file formats using unambiguous machine-readable assets.
In a new podcast episode, SE Radio’s Gavin Henry interviews PDF Association CTO Wyatt and CEO Johnson about a wide range of PDF topics.
Technical resources on pdfa.org are now more understandable and navigable by less technically experienced readers as well as knowledgeable developers.
Version 1.18 of veraPDF, the industry-supported PDF/A validator, is now available for download, and now includes support for PDF/UA-1 (machine checks) and other enhancements.
The PDF community was saddened to learn of the passing of Dr. Chuck Geschke, one of the visionaries who helped invent PostScript and PDF technology.
The Technical University of Dresden has created very compact and clear instructions that describe in an easy-to-understand manner how to create accessible PDF documents.
PDF 2.0 Application Note 003 identifies precise locations for PDF objects’ XMP metadata to help drive interoperability across all PDF creators and consumers.
A recent article on Medium reveals a nugget regarding PDF usage: “…Adobe reports that in its 2020 fiscal year alone, about 303 billion PDFs were opened using its Document Cloud service, a 17% annual increase….” As the article rightly points …
The recently-announced pdf-issues GitHub repo already has 25 issues, with 12 proposed resolutions for review in the next PDF TWG meeting.