PDF Association logo

Discover pdfa.org

Key resources

Get involved

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!
The PDF Association celebrates its members’ public statements
of support
for ISO-standardized PDF technology.

Member Area


A case study in PDF forensics: The Epstein PDFs

This article details a PDF forensics case study on a small, random selection of the Epstein PDF files released by the US Department of Justice (DoJ). The tranche contains 4,085 PDF files, with an estimated 5,879 remaining unreleased. Key findings include:

  • A difference in PDF version reporting between forensic tools.
  • The presence of two incremental updates.
  • The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
  • The DoJ avoided JPEG images to prevent metadata leakage.
  • Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.
Peter Wyatt

By Peter Wyatt
December 2025

A case study in PDF forensics: The Epstein PDFs

This article details a PDF forensics case study on a small, random selection of the Epstein PDF files released by the US Department of Justice (DoJ). The tranche contains 4,085 PDF files, with an estimated 5,879 remaining unreleased. Key findings include:

  • A difference in PDF version reporting between forensic tools.
  • The presence of two incremental updates.
  • The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
  • The DoJ avoided JPEG images to prevent metadata leakage.
  • Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.
Peter Wyatt

By Peter Wyatt
December 2025

OctoberPDFest recordings now available

November 2020 by PDF Association staff
News


OctoberPDFest recordings are now available! 31 videos offer a wide variety of perspectives on our favorite format.

PDF Association logo
Visit PDF Association staff's profile.

November 2020 by Peter Wyatt
Peter Wyatt
Visit Peter Wyatt‘s profile.

The original PDF Issue Tracker corpus generated a lot of interest from the PDF technical community; now version 2 of … Read more

Article

October 2020 by Duff Johnson
Duff Johnson
Visit Duff Johnson‘s profile.

Introduction Akin to our earlier series on the Mueller Report PDF, this article provides cultural framing and technical background for … Read more

Article

September 2020 by PDF Association staff
PDF Association logo
Visit PDF Association staff‘s profile.

The PDF Association’s new PDF Forms Technical Working Group (TWG) is advancing PDF Forms technologies through the introduction of new … Read more

PDF Association news

September 2020 by PDF Association staff
PDF Association logo
Visit PDF Association staff‘s profile.

Starting October 5, the PDF Association’s free webinar series celebrates the diversity of PDF technology capabilities and solutions with a … Read more

PDF Association news

September 2020 by Bernd Wild (intarsys GmbH)
Photo of Dr. Bernd Wild
Visit Bernd Wild‘s profile.

The history of integration of digital signatures in PDF together with the underlying public standards like PAdES, CAdES and XAdES … Read more

Article

September 2020 by Paul Rayius (CommonLook)
Paul Rayius
Visit Paul Rayius‘s profile.

While testing and verification is never a bad idea, the advantages of automation include speeding up processes and removing human … Read more

Article

September 2020 by PDF Association staff
PDF Association logo
Visit PDF Association staff‘s profile.

The PDF Association’s PDF/UA Technical Working Group (PDF/UA TWG) has published a new PDF/UA Reference Suite, an update to the … Read more

PDF Association news

September 2020 by Peter Wyatt
Peter Wyatt
Visit Peter Wyatt‘s profile.

Interoperability is the core value proposition of the PDF file format. Although interoperability comes from sharing a clear and precise … Read more

Article

September 2020 by Matthew Hardy (Adobe)
Visit Matthew Hardy‘s profile.

The PDF Reuse TWG is facilitating the reuse of document content and semantics on a diverse set of devices and … Read more

PDF Association news

September 2020 by Dietrich von Seggern (callas software GmbH)
Visit Dietrich von Seggern‘s profile.

Dietrich von Seggern talks about an example in which the internet community came up with a smart solution for a … Read more

Article

Member News

WordPress Cookie Notice by Real Cookie Banner