PDF Association logo

Discover pdfa.org

Key resources

Get involved

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!
The PDF Association celebrates its members’ public statements
of support
for ISO-standardized PDF technology.

Member Area

Duff’s playground

Extensions to the formal Arlington PDF Data Model

The

Arlington PDF Data Model is an open-source data model of all PDF objects in the PDF Document Object Model. It is defined as a set of text-based TSV files that use a predicate syntax to express data integrity requirements as they are expressed in ISO 32000.

Domains:

  • Computing science
  • Document engineering
  • Formal methods
Formalized grammar for PDF’s graphic operator and operands

Extend the Arlington PDF Data Model with a formal definition of
the PDF content stream operators and operands. See iText pdfcop GitHub repository.
pdfcop currently uses an ANTLR4 grammar but this is not a requirement.

Domains:

  • Computing science
  • Document engineering
  • Formal methods
Optimization of Brotli compression for PDF by using pre-defined dictionaries

Can custom Brotli dictionaries significantly benefit different kinds of PDF streams (e.g., font programs, ICC profiles, PDF content streams, etc.) beyond the
default Brotli dictionary used for web content?

Domains:

  • Computing science
  • Engineering
  • Image processing
WordPress Cookie Notice by Real Cookie Banner