Welcome to the GLAM Workbench¶
I've added links and buttons to run many of the GLAM Workbench repositories in the ARDC's new Binder service. You'll need a university login to access this service. If you don't have university credentials you can continue to use the international, unauthenticated Binder service as before. Find out more on the Using ARDC Binder page.
Here you’ll find a collection of tools, tutorials, examples, and hacks to help you work with data from galleries, libraries, archives, and museums (the GLAM sector). The primary focus is Australia and New Zealand, but new collections are being added all the time. Let me know if there’s some GLAM data you’d like me to explore – suggestions are always welcome!
Quick start¶
The resources in the GLAM Workbench are created and shared as Jupyter notebooks. Jupyter lets you combine narrative text and live code in an environment that encourages you to learn and explore. Jupyter notebooks run in your browser, and you can get started without installing any software!
If you want to dive straight in, just have a look around the site and click on one of the links that says 'Run live on Binder'. This will open the notebook, ready to use, in a customised computing environment using the Binder service.
If that seems too scary, here's some first steps to get you started.
You'll find more information in the Getting Started section. If you have any questions, or strike any problems, head to the OzGLAM Help site.
Finding GLAM data¶
When we talk about GLAM data we’re usually referring to the collections held by cultural institutions – books, manuscripts, photographs, objects, and much more. We’re used to exploring these collections through online search interfaces or finding aids, but sometimes we want to do more – instead of a list of search results on a web page, we want access to the underlying collection data for analysis, enrichment, or visualisation. We want collections as data.
This GLAM Workbench shows you how to create your own research datasets from a variety of GLAM collections. In some cases cultural institutions provide direct access to collection data through APIs (Application Programming Interfaces) or data downloads. In other cases we have to find ways of extracting data from web interfaces – a process known as screen-scraping. Here you’ll find examples of all these approaches, as well as links to a number of pre-harvested datasets. For example:
Harvesting data¶
- Trove Newspaper and Gazette Harvester
- Get OCRd text from a digitised journal in Trove
- Harvest parliament press releases from Trove
- Harvest data from Papers Past
- Harvest items from a search in RecordSearch
- Harvesting collections of text from archived web pages
Data sources¶
- Sources of Australian GLAM data
- Commonwealth Hansard XML repository
- Australian Women's Weekly front covers, 1933 to 1982
- ABC Radio National Programs
Asking different questions¶
What can you do with GLAM data?
- Shift scales – from individual items to big pictures
- Find patterns – change over time, distribution through space
- Extract features – find people, places, words, images
- Make connections – link within and between collections
- Get creative – reuse collections in unexpected ways
The GLAM Workbench demonstrates a variety of tools and techniques that you can use to ask different questions of your data. For example:
- Visualise Trove newspaper searches over time
- Convert a year's worth of Historic Hansard into a dataframe for analysis
- Exploring subdomains in the whole of gov.au
- Explore places associated with NMA collection objects
- Finding faces in the SLNSW Tribune collection
- Create 'scissors and paste' messages from Trove newspaper articles
Hacking heritage¶
Digital access means more than just putting stuff online. What you can actually do with collections is constrained by the design of interfaces, the quality of documentation, the format of the data, and many other factors. The GLAM Workbench provides hacks and workarounds that help you peek behind the search box, and do what you want to do with online collections. For example:
- Save a Trove newspaper article as an image
- Finding non-English newspapers in Trove
- Random items from Trove
- Download an image from SLV using the IIIF server and a Handle url
- Select a random(ish) record from DigitalNZ
Bringing documentation alive¶
The way Jupyter notebooks combine text and code into a 'computational narrative', makes them ideal for documentating GLAM data sources. You don't just have to describe how an API works, you can show it in action. The GLAM Workbench leads you through a range of GLAM data sources, asking questions, offering examples, and demonstrating both the possibilities and limitations.
- A random item from Museums Victoria's collections!
- Exploring the Te Papa collection API
- Timegates, Timemaps, and Mementos
- Build a DigitalNZ API search query
- More fun with IIIF
Do I need to be able to code?¶
No, you can use the Jupyter notebooks within the workbench without any coding experience – just edit and click where indicated. But every time you do edit one of the notebooks, you are coding. The notebooks provide an opportunity to gain confidence and experiment. They might not turn you into a coder, but they will show you how to do useful things with code.
Go to the getting started page for more information about using Jupyter notebooks.
Cite as¶
To refer to the GLAM Workbench as a whole, use the following:
Tim Sherratt. (2021). GLAM Workbench (version v1.0.0). Zenodo. https://doi.org/10.5281/zenodo.5603060
See individual sections for suggested citations.