A case study in PDF forensics: The Epstein PDFs

This article details a PDF forensics case study on a small, random selection of the Epstein PDF files released by the US Department of Justice (DoJ). The tranche contains 4,085 PDF files, with an estimated 5,879 remaining unreleased. Key findings include:

A difference in PDF version reporting between forensic tools.
The presence of two incremental updates.
The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
The DoJ avoided JPEG images to prevent metadata leakage.
Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.

By Peter Wyatt
December 2025

A case study in PDF forensics: The Epstein PDFs

A difference in PDF version reporting between forensic tools.
The presence of two incremental updates.
The discovery of a hidden (orphaned) document information dictionary revealing the software used in processing.
The DoJ avoided JPEG images to prevent metadata leakage.
Overall, the DoJ’s sanitization workflow could be improved to reduce file size and information leakage.

By Peter Wyatt
December 2025

Member News

JPedal: Mastering PDF Assembly and Page Geometry

January 2026, by Nadir Shah, IDRsolutions

The December 2025 JPedal release introduces advanced document assembly tools, including page normalisation, an automated Table of Contents, and an … Read more

Crawford Technologies Marks 30 Years of Innovation and Customer-centric Excellence

January 2026, Crawford Technologies Inc.

A pioneer in the customer communications management industry, Crawford Technologies celebrates 30 years as a leader in document accessibility technologies, … Read more

Want to make your PDFs 20% smaller for free?

January 2026, by Guust Ysebie, iText

After 30 years of Deflate, PDFs are finally upgrading. Brotli will soon enter the PDF spec, delivering 15–25% smaller files … Read more

pdfAssistant Expands AI Capabilities for Professional Document Finalization

January 2026, by Eric Shore, Datalogics

pdfAssistant.ai expands PDF optimization with AI-powered flattening of transparencies, layers, and annotations to ensure print-ready, secure, and consistent documents.

PDF Annotator 10 – Major Update Brings Modern Design and Powerful New Features

January 2026, by Oliver Grahl, GRAHL software design

PDF Annotator Version 10 is a major update that introduces a modern user interface, significantly improved performance, and many new … Read more

UPDF Achieves G2 Leader Status, Joins Top 4 Global PDF Editors in Winter 2026

January 2026, Superace Software Technologies Co., Ltd.

UPDF, Superace’s all-in-one AI-powered PDF solution, has been named a G2 Leader and ranked among the Top 4 PDF Editors … Read more

Featured articles

Discover pdfa.org

Key resources

Get involved

A case study in PDF forensics: The Epstein PDFs

A case study in PDF forensics: The Epstein PDFs

Member News