Exploring text files harvested with the Trove Harvester

This notebook suggests some ways in which you can aggregate and analyse the individual OCRd text files for each article — look at word frequencies ; calculate TF-IDF values. (Under construction)

Preview

Expand

Using this notebook¶

ARDC BinderMyBinderOther options

To run this notebook using the ARDC Binder service you'll need to log in using an account from an Australian university or research organisation. If you don't have an account, try MyBinder instead.

Run live on ARDC Binder

The MyBinder service doesn't require any authentication, but it can be slow to start and will sometimes fail when busy. If you have a login at an Australian university, you'll probably get better results with ARDC Binder.

Run live on MyBinder

Binder is great for experimentation and quick tasks, but for some projects you might need a dedicated, persistent environment in which to work. There's information on other options in the run these notebooks section.

Additional documentation¶

Run these notebooks

Getting help¶

Cite as¶

Sherratt, Tim. (2024). GLAM-Workbench/trove-newspaper-harvester (version v2.1.1). Zenodo. https://doi.org/10.5281/zenodo.11295552