Article
This new corpus – nearly 8 million PDFs totaling about 8 TB – was gathered from across the web in July/August of 2021.
Announcement