PDF Association logo

Discover pdfa.org

Key resources

Get involved

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!
The PDF Association celebrates its members’ public statements
of support
for ISO-standardized PDF technology.

Member Area


A case study in PDF forensics: The Epstein PDFs

This article details a PDF forensics case study on a small, random selection of the Epstein PDF files released by the US Department of Justice (DoJ). The tranche contains 4,085 PDF files, with an estimated 5,879 remaining unreleased. Key findings include:

  • A difference in PDF version reporting between forensic tools.
  • The presence of two incremental updates.
  • The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
  • The DoJ avoided JPEG images to prevent metadata leakage.
  • Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.
Peter Wyatt

By Peter Wyatt
December 2025

A case study in PDF forensics: The Epstein PDFs

This article details a PDF forensics case study on a small, random selection of the Epstein PDF files released by the US Department of Justice (DoJ). The tranche contains 4,085 PDF files, with an estimated 5,879 remaining unreleased. Key findings include:

  • A difference in PDF version reporting between forensic tools.
  • The presence of two incremental updates.
  • The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
  • The DoJ avoided JPEG images to prevent metadata leakage.
  • Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.
Peter Wyatt

By Peter Wyatt
December 2025

OCR for PDFs – old news?

December 2019 by Thomas Zellmann
Article


Thomas Zellmann discusses the benefits of OCR and of making PDFs fully searchable, especially as input for AI systems.

Picture of Thomas Zellmann
Visit Thomas Zellmann's profile.

November 2019 by Klaas Posselt
Visit Klaas Posselt‘s profile.

The PDF/UA Technical Working Group released Tagged PDF Best Practice Guide: Syntax – to provide developers and expert users with … Read more

Article

November 2019 by Thomas Zellmann
Picture of Thomas Zellmann
Visit Thomas Zellmann‘s profile.

AI applications for business intelligence processing can extract information from unstructured “born digital” documents, but many archives include years (or … Read more

Article

November 2019 by Duff Johnson
Duff Johnson
Visit Duff Johnson‘s profile.

Apple’s desktop suite, Pages, Keynote and Numbers, now supports creation of tagged (and thus, accessible and reusable) PDF.

Article

October 2019 by PDF Association staff
PDF Association logo
Visit PDF Association staff‘s profile.

In the late 1990s Karl’s ‘Planet PDF’ created a worldwide PDF community and went on to found successful companies leveraging … Read more

News

October 2019 by Duff Johnson
Duff Johnson
Visit Duff Johnson‘s profile.

As of October 7, 2019, websites and mobile applications in the U.S. will be assessed as “public accommodations”, and the … Read more

Article

October 2019 by PDF Association staff
PDF Association logo
Visit PDF Association staff‘s profile.

These vulnerabilities require an attacker to modify an encrypted PDF file, potentially leading to the exfiltration of the document’s contents … Read more

News

October 2019 by Thomas Zellmann
Picture of Thomas Zellmann
Visit Thomas Zellmann‘s profile.

To survive a great flood Noah brought pairs from every species aboard his ark. But is this approach – gathering … Read more

Article

September 2019 by Duff Johnson
Duff Johnson
Visit Duff Johnson‘s profile.

Earlier this year we covered the release of the Mueller Report covering the Special Counsel’s investigation into Russian interference in … Read more

Article

September 2019 by PDF Association staff
PDF Association logo
Visit PDF Association staff‘s profile.

Today the PDF Association publishes a new feature for PDF: a standardized mechanism allowing authors to define and share their … Read more

PDF Association news

September 2019 by PDF Association staff (TWAIN Working Group)
PDF Association logo
Visit PDF Association staff‘s profile.

The TWAIN Working Group, a liaison member of the PDF Association, has just announced the release of TWAIN Direct, their … Read more

News

Member News

WordPress Cookie Notice by Real Cookie Banner