Skip to content

Newspaper titles harvested from web archives

Harvested 21 January 2022

The number of digitised newspapers available through Trove has increased dramatically since 2009. Understanding when newspapers were added is important for historiographical purposes, but there's no data about this available directly from Trove. These datasets were created by harvesting information about newspaper titles in Trove from web archives.

Files

trove_newspaper_titles_2009_2021.csv

CSV formatted data file containing details of newspaper titles extracted from web archive captures.

The data file contains the following columns:

Column Contents
title_id title identifier
full_title full title (including location and dates)
title newspaper title
place place of publication
dates date range in Trove
capture_date date of web archive capture
capture_timestamp timestamp of web archive capture

Download from GitHub Explore in Datasette

trove_newspaper_titles_first_appearance_2009_2021.csv

CSV formatted data file containing details of the first appearance of newspaper titles in web archive captures, indicating when the titles were (approximately) added to Trove. The complete list of captures has been filtered to include only the first appearance of each title / place / date range combination.

The dataset contains the following columns:

Column Contents
title_id title identifier
full_title full title (including location and dates)
title newspaper title
place place of publication
dates date range in Trove
capture_date date of web archive capture
capture_timestamp timestamp of web archive capture

Download from GitHub Explore in Datasette

Examples of use

Generated by

Getting help

Cite as

Sherratt, Tim. (2022). GLAM-Workbench/trove-newspaper-titles-web-archives (version v1.2). Zenodo. https://doi.org/10.5281/zenodo.13761732