Extract Text is a REST API tool that extracts text from PDF documents to facilitate search and retrieval, enable reuse and repurposing and streamline workflows.
Although PDF is commonly understood as a single holistic file format, in reality, PDF comprises a number of distinct dialects, each designed to support a specific aspect of the format.
5 years ago the PDF Accessibility LWG began a project to develop a set of accessibility techniques for PDF files. This project is now beginning to come to fruition.