Peter Wyatt |
This new corpus – nearly 8 million PDFs totaling about 8 TB – was gathered from across the web in July/August of 2021. CONTINUE READING |
Podcast part 1 talks about PDF standards in general. Part 2 discusses PDF/A for long term archiving and part 3 is about Accessibility and PDF/UA.
A domain-specific subset of PDF, PDF/R is simple to generate and interpret, allowing it to replace the TIFF and JPEG file formats for capture from imaging systems.