Skip to content

CSV formatted list of digitised books in Trove

This file provides metadata of 21,218 digitised works with the format Book. Unlike previous harvests, this dataset attempts to exclude Parliamentary Papers, which have been harvested separately.

date harvested 2024-02-14
file size 20.3 MB
format text/csv
created by Harvesting the text of digitised books (and ephemera)
number of rows 21,218

Download from GitHub Explore in Datasette

Columns

name type description
title string title of the work
sub_title string additional title or publication information, eg: 'Volume 1'
contributor string contributors including authors, editors, translators; multiple values separated by
publisher string multiple values separated by
date string publication date; multiple values separated by
type string eg: 'text'; multiple values separated by
format string eg: 'Book', 'volume'; multiple values separated by
extent string size or physical dimensions, can include number of pages or number of words; multiple values separated by
language string publication language; multiple values separated by
subject string associated subject headings; multiple values separated by
spatial string associated places (mostly using Library of Congress geographic area codes); multiple values separated by
is_part_of string collections or series this publication is part of; multiple values separated by
identifier string library identifiers; multiple values separated by
rights string copyright and licensing information; multiple values separated by
pages integer number of digitised pages
fulltext_url string link to digitised book viewer
fulltext_url_text string text of link to digitised book viewer
text_download_url string link to download OCRd text of book
catalogue_url string link to NLA catalogue; multiple values separated by
work_url string link to work record in Trove; multiple values separated by
work_type string Trove work format, eg: 'Book'; multiple values separated by
parent string parent work identifiers; multiple values separated by
parent_url string parent work links; multiple values separated by
children any child work identifiers; multiple values separated by
text_file string file name of downloaded text

Examples of use

Getting help

Cite as