OCR corrections in Trove newspapers
Updated: 13 September 2024
OCR errors in Trove's digitised newspapers can be corrected by users. To help understand patterns in newspaper correction, this dataset has been created to record information about the number of articles with corrections.
Files¶
There are three files in the dataset:
corrections_by_year.csv
– number of articles corrected in each publication yearcorrections_by_category.csv
– number of articles corrected in each Trove categorycorrections_by_title.csv
– number of articles corrected in each newspaper
The files are in CSV format and contain the following fields.
corrections_by_year.csv
¶
term
– the publication yeartotal_results
– the number of articles with correctionstotal_articles
– the total number of articlesproportion
– the proportion of articles with corrections
Download from GitHub Explore in Datasette
corrections_by_category.csv
¶
term
– the category nametotal_results
– the number of articles with correctionstotal_articles
– the total number of articlesproportion
– the proportion of articles with corrections
Download from GitHub Explore in Datasette
corrections_by_title.csv
¶
id
– the Trove identitifer of the newsspaper titletitle
– the name of the newspaperarticles_with_corrections
– the number of articles with correctionstotal_articles
– the total number of articles from the newspaper in Trovepercentage_with_corrections
– the percentage of articles with corrections
Download from GitHub Explore in Datasette
Generated by¶
Getting help¶
Cite as¶
Sherratt, Tim. (2024). GLAM-Workbench/trove-newspapers-corrections (version v2.1). Zenodo. https://doi.org/10.5281/zenodo.13761546