PDF Association logo

Discover pdfa.org

Key resources

Get involved

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!
The PDF Association celebrates its members’ public statements
of support
for ISO-standardized PDF technology.

Member Area

Deriving HTML from PDF TWG

Background

Reliable, deterministic creation of valid HTML from PDF leveraging Tagged PDF is a key objective for many reuse applications.

Read the introductory article explaining the purpose of the algorithm at the core of the Deriving HTML from PDF specification, Roman Toda’s 2019 Electronic Document Conference presentation provides some further explanation and visualizations.

Objective

Having published the first edition of “Deriving HTML from PDF” this group continues to explore opportunities and challenges in advanced reuse of PDF content with a focus on pathways to HTML expression of PDF content.

How to participate in the Working Group

It’s possible to participate in the Working Group without attending meetings!

  • The Working Group’s activities are centered on a private GitHub repository.
  • Working group members may attend the working group’s online meetings – currently every month on the last Wednesday of the month, 2005–2100 CET / 1405–1500 ET / 1105–1200 PT, or review meeting notes and recordings of previous calls.

The Deriving HTML from PDF TWG is open to all PDF Association Individual, Division, Full, Partner and Institutional members.

Join this working group to:

  • Access all of the working group’s private GitHub repositories
  • Access the working group’s private Google Drive area with working documents
  • Participate via the mailing list and view mailing list archives
  • Participate in the approval and publication process
  • Access meeting recordings and notes
  • Access and provide comments on related ISO documents

PDF Association members can join the Deriving HTML from PDF TWG by visiting the Member Area and clicking “Manage Working Groups”.

Relevant resources



WORKING GROUP CHAIR


Roman Toda
Foxit Corporation


WordPress Cookie Notice by Real Cookie Banner