Deriving HTML from PDF TWG
Having published the first edition of “Deriving HTML from PDF” this group continues to explore opportunities and challenges in advanced reuse of PDF content with a focus on pathways to HTML expression of PDF content.
Read the introductory article explaining the purpose of the algorithim. Roman’s 2019 Electronic Document Conference presentation provides some further explanation and visualizations.
Participation
PDF Association members can join the Deriving HTML from PDF TWG by logging into pdfa.org, visiting the Member Area and clicking “Manage Communities”.