Skip to content

OCRd text from Trove books and ephemera

Harvested: August 2021

A harvest of 26,762 files of OCRd text from digitised books and ephemera in Trove. There's about 3.6gb in total. You can either download the complete collection, or use Datasette to find titles of interest that you can download individually.

Download (1.2gb zip) Browse in Datasette

Getting help

Cite as

Sherratt, Tim. (2019). GLAM-Workbench/trove-books (version v0.1.0). Zenodo. https://doi.org/10.5281/zenodo.3549481