OCRd text from Trove books and ephemera
Harvested: August 2021
A harvest of 26,762 files of OCRd text from digitised books and ephemera in Trove. There's about 3.6gb in total. You can either download the complete collection, or use Datasette to find titles of interest that you can download individually.
Download (1.2gb zip) Browse in Datasette
Related resources¶
- Harvesting the text of digitised books (and ephemera)
- CSV formatted list of books available in digital form
Getting help¶
Cite as¶
Sherratt, Tim. (2019). GLAM-Workbench/trove-books (version v0.1.0). Zenodo. https://doi.org/10.5281/zenodo.3549481