Examining PDFs as an aggregate
What can thousands of PDFs tell us?
Excerpt: PDFs is not a single application’s file format, it is a file format shared with thousands of applications, and each PDF producer seemingly has their own quirks for generating PDFs. What can we learn by examining a couple of largish collections of PDF as an aggregate sample of PDFs? Within these file sets: which are the most common PDF producers? What PDF version is most common? How common are errors? What are the most common errors? How common is PDF tagging? how common is PDF/A? What do these a … Read moreAbout the presenter(s)
Patrick Gallot, has been working with software developers and PDFs since 2000. At Datalogics, he is the lead technical support Engineer for the Adobe PDF Library and Datalogics PDF Java … Read more
Description
PDFs is not a single application’s file format, it is a file format shared with thousands of applications, and each PDF producer seemingly has their own quirks for generating PDFs. What can we learn by examining a couple of largish collections of PDF as an aggregate sample of PDFs?
Within these file sets: which are the most common PDF producers? What PDF version is most common? How common are errors? What are the most common errors? How common is PDF tagging? how common is PDF/A? What do these answers mean?