Harvesting collections of text from archived web pages
Works with AWA, NZWA, IA, & UKWA
This notebook helps you assemble datasets of text extracted from all available captures of archived web pages. You can then feed these datasets to the text analysis tool of your choice to analyse changes over time.
Sherratt, Tim & Jackson, Andrew. (2022). GLAM-Workbench/web-archives (version v1.1.0). Zenodo. https://doi.org/10.5281/zenodo.6450762
The Web Archives section of the GLAM Workbench is sponsored by the British Library.