PDF Association logo

Discover pdfa.org

Key resources

Get involved

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!
The PDF Association celebrates its members’ public statements
of support
for ISO-standardized PDF technology.

Member Area


A case study in PDF forensics: The Epstein PDFs

This article details a PDF forensics case study on a small, random selection of the Epstein PDF files released by the US Department of Justice (DoJ). The tranche contains 4,085 PDF files, with an estimated 5,879 remaining unreleased. Key findings include:

  • A difference in PDF version reporting between forensic tools.
  • The presence of two incremental updates.
  • The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
  • The DoJ avoided JPEG images to prevent metadata leakage.
  • Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.
Peter Wyatt

By Peter Wyatt
December 2025

A case study in PDF forensics: The Epstein PDFs

This article details a PDF forensics case study on a small, random selection of the Epstein PDF files released by the US Department of Justice (DoJ). The tranche contains 4,085 PDF files, with an estimated 5,879 remaining unreleased. Key findings include:

  • A difference in PDF version reporting between forensic tools.
  • The presence of two incremental updates.
  • The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
  • The DoJ avoided JPEG images to prevent metadata leakage.
  • Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.
Peter Wyatt

By Peter Wyatt
December 2025

Watch the Electronic Document Conference presentations

August 2019 by PDF Association staff
PDF Association news


In June 2019, the Electronic Document Conference was held. All 44 presentations were video-recorded and are available for those who could not attend in person.

PDF Association logo
Visit PDF Association staff's profile.

August 2019 by Dietrich von Seggern
Visit Dietrich von Seggern‘s profile.

This article was recently updated. The new version was posted on February 16, 2021. PDF is one of the most … Read more

Article

August 2019 by Elizabeth Thede
Visit Elizabeth Thede‘s profile.

Instead of retrieving and searching each file in its associated application, a search engine needs to review all files together … Read more

Article

July 2019 by Roman Toda (Normex)
Headshot of Roman Toda
Visit Roman Toda‘s profile.

Users printing to PDF are throwing away information that could be reused by downstream applications. Find out how “Deriving HTML … Read more

Article, For members only

July 2019 by Duff Johnson
Duff Johnson
Visit Duff Johnson‘s profile.

After an impressive effort the Mueller Report is now available as a free EPUB file. What might have been in … Read more

News

July 2019 by PDF Association staff
PDF Association logo
Visit PDF Association staff‘s profile.

The Open Preservation Foundation, a PDF Association liaison member, has announced the 1.14 release of the open source, industry-supported PDF/A validator, part … Read more

News

July 2019 by Frode Hegland
Visit Frode Hegland‘s profile.

The Visual-Meta approach take the metadata out of of the document internals and presents it as an appendix at the … Read more

Article

July 2019 by Duff Johnson
Duff Johnson
Visit Duff Johnson‘s profile.

What are the essential characteristics and optimal functional requirements of email messages and necessary related information in a PDF technology-based … Read more

Article, PDF Association news

June 2019 by Dietrich von Seggern (callas software GmbH)
Visit Dietrich von Seggern‘s profile.

Dietrich von Seggern from callas software is just back from Electronic Document Conference (EDC) in Seattle. His short summary: It … Read more

Article

June 2019 by Roman Toda
Headshot of Roman Toda
Visit Roman Toda‘s profile.

For the two most common web formats – HTML and PDF – the relationship hasn’t been easy. Whenever PDF is … Read more

PDF Association news

June 2019 by Duff Johnson
Duff Johnson
Visit Duff Johnson‘s profile.

The SafeDocs research is intended to result in novel parser methodologies to ensure security in digital content.

PDF Association news

Member News

WordPress Cookie Notice by Real Cookie Banner