Tags¶
Warning
Under construction!
API¶
- Exploring Trove Facets
- Working with Trove Zones
- Your first Trove API request
- Convert a Trove list into a CSV file
- Harvesting the complete set of data from the People and Organisations zone
- Compare two versions of an archived web page
- Comparing CDX APIs
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
- Display changes in the text of an archived web page over time
- Exploring subdomains in the whole of gov.au
- Exploring the Internet Archive's CDX API
- Find and explore Powerpoint presentations from a specific domain
- Find when a piece of text appears in an archived web page
- Find all the archived versions of a web page
- Get the archived version of a page closest to a particular date
- Unique subdomains of gov.au split into components
- Unique subdomains of gov.au in SURT format
- Circular dendrograms of gov.au subdomains
- Harvest of unique urls from the gov.au domain
- Harvesting data about a domain using the IA CDX API
- Harvesting collections of text from archived web pages
- Observing change in a web page over time
- Timegates, Timemaps, and Mementos
- Timemaps vs CDX APIs
Amazon s3¶
CSV¶
CSV dataset¶
- CSV list of indexes
- Repository of harvested indexes
- CSV formatted list of 'Australian' books in Trove with full text versions in the Internet Archive
- CSV formatted list of Trove books available in digital form
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- CSV formatted list of journals available from Trove in digital form
- CSV formatted list of journals with OCRd text
- OCRd text from Trove digitised journals
CloudStor¶
- Cloudstor access to a public share via WebDAV
- Creating and sharing public links to nested resources in CloudStor
CollectionBuilder¶
Datasette¶
- Add content from the Tasmanian Post Office Directories to an SQLite database
- Display the results of a harvest as a searchable database using Datasette
- Create a database to search across each line of text in a series of volumes
EAC-CPF¶
- Harvesting the complete set of data from the People and Organisations zone
- Harvesting the complete set of data from the People and Organisations zone using OAI-PMH
- Extract some aggregated data from the complete harvest
- Harvest SRU API results as JSON
IIIF¶
Internet Archive¶
- CSV formatted list of 'Australian' books in Trove with full text versions in the Internet Archive
- Getting the text of Trove books from the Internet Archive
- OCRd text from the Internet Archive of 'Australian' books listed in Trove
National Archives of Australia¶
- Create a Gannt chart of Australian government departments
- Create a network graph visualisation of Australian government departments
- Visualise the connections of a single Australian government agency
OAI-PMH¶
OCR¶
OCR quality¶
PDF¶
PyMuPDF¶
SQLite¶
SRU¶
- Harvesting the complete set of data from the People and Organisations zone
- Harvest SRU API results as JSON
Tesseract¶
UpSet plot¶
VIAF¶
Voilá¶
Wikidata¶
- Create a Gannt chart of Australian government departments
- Create a network graph visualisation of Australian government departments
- Visualise the connections of a single Australian government agency
analysis¶
api¶
coordinates¶
copyright¶
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- The use of standard licences and rights statements in Trove image records
- Finding unpublished works that might be entering the public domain on 1 January 2019
- Finding unpublished works that might be entering the public domain on 1 January 2019
crowdsourcing¶
data harvesting¶
- Getting the text of Trove books from the Internet Archive
- Harvesting the text of digitised books (and ephemera)
- Harvesting articles that mention "Anzac Day" on Anzac Day
- Trove Harvester web app
- Using TroveHarvester to get newspaper and gazette articles in bulk
- Create a list of Trove's digitised journals
- Create a database to search across each line of text in a series of volumes
- Finding editorial cartoons in the Bulletin
- Get covers (or any other pages) from a digitised journal in Trove
- Download the OCRd text for ALL the digitised journals in Trove!
- Get OCRd text from a digitised journal in Trove
- Harvest parliament press releases from Trove
- Harvest summary data from Trove lists
- Harvest public tags from Trove zones
- Exploring digitised maps in Trove
- Harvest ABC Radio National records from Trove
- Harvesting the complete set of data from the People and Organisations zone
- Harvesting the complete set of data from the People and Organisations zone using OAI-PMH
- Exploring subdomains in the whole of gov.au
- Find and explore Powerpoint presentations from a specific domain
- Harvesting collections of text from archived web pages
dataset¶
- Count of records by contributor and zone
- Trove lists metadata
- Trove public tags
- Trove tag counts
- Trove digitised maps – coordinates
- Trove digitised maps metadata
- Harvest of ABC Radio National metadata
- Aggregated data extracted from People and Organisations data
- Complete harvest of People and Organisations data
- NLA digitised finding aids: summary information
- NLA digitised finding aids: list of urls
- Unpublished works that might be entering the public domain on 1 January 2019
documentation¶
- Find all the archived versions of a web page
- Get the archived version of a page closest to a particular date
- Timegates, Timemaps, and Mementos
exhibition¶
finding aids¶
- Convert a HTML finding aid to JSON
- Find urls of digitised finding aids
- Collect information about digitised finding aids
fun¶
geospatial¶
government¶
harvest¶
- Get details of online indexes
- Harvest online indexes
- Create a flat list of organisations contributing metadata to Trove
- Get the number of records from each Trove contributor by zone and format
- Find urls of digitised finding aids
- Collect information about digitised finding aids
image dataset¶
images¶
- Download and process Tasmanian Post Office Directory PDFs
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- The use of standard licences and rights statements in Trove image records
- Finding editorial cartoons in the Bulletin
- Get covers (or any other pages) from a digitised journal in Trove
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
language detection¶
licensing¶
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- The use of standard licences and rights statements in Trove image records
manuscripts¶
- Finding unpublished works that might be entering the public domain on 1 January 2019
- Finding unpublished works that might be entering the public domain on 1 January 2019
- Convert a HTML finding aid to JSON
- Find urls of digitised finding aids
- NLA digitised finding aids: summary information
- NLA digitised finding aids: list of urls
- Collect information about digitised finding aids
- Unpublished works that might be entering the public domain on 1 January 2019
maps¶
- Exploring digitised maps in Trove
- Parse map coordinates from metadata
- Trove digitised maps – coordinates
- Trove digitised maps metadata
metadata¶
- CSV formatted list of 'Australian' books in Trove with full text versions in the Internet Archive
- CSV formatted list of Trove books available in digital form
- Government publications from Trove in digital form
- Metadata for Trove digitised works
- Create a flat list of organisations contributing metadata to Trove
- Get the number of records from each Trove contributor by zone and format
- Count of records by contributor, zone, and format
- List of organisations contributing metadata to Trove
- Count of records by contributor and zone
- Create a list of Trove's digitised journals
- CSV formatted list of journals available from Trove in digital form
- CSV formatted list of journals with OCRd text
- Harvest parliament press releases from Trove
- List of journals with OCRd text
- OCRd text from Trove digitised journals
- Trove lists metadata
- Trove public tags
- Trove tag counts
- Exploring digitised maps in Trove
- Parse map coordinates from metadata
- Trove digitised maps – coordinates
- Trove digitised maps metadata
- Harvest of ABC Radio National metadata
- Harvest ABC Radio National records from Trove
- Aggregated data extracted from People and Organisations data
- Complete harvest of People and Organisations data
- NLA digitised finding aids: summary information
- NLA digitised finding aids: list of urls
- Collect information about digitised finding aids
- Unpublished works that might be entering the public domain on 1 January 2019
- Unique subdomains of gov.au split into components
- Unique subdomains of gov.au in SURT format
- Circular dendrograms of gov.au subdomains
- Harvest of unique urls from the gov.au domain
- Harvesting data about a domain using the IA CDX API
- Observing change in a web page over time
network graph¶
public domain¶
- Finding unpublished works that might be entering the public domain on 1 January 2019
- Finding unpublished works that might be entering the public domain on 1 January 2019
- NLA digitised finding aids: summary information
- NLA digitised finding aids: list of urls
- Unpublished works that might be entering the public domain on 1 January 2019
pyvips¶
random¶
- Get an random newspaper article from Trove
- Get a random work from Trove using queries and facets
- Get a random work from Trove by generating random ids
scraper¶
screenscraping¶
screenshots¶
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
text¶
- Download and process Tasmanian Post Office Directory PDFs
- Download the OCRd text for ALL the digitised journals in Trove!
- Get OCRd text from a digitised journal in Trove
- Harvest parliament press releases from Trove
- Harvesting collections of text from archived web pages
text analysis¶
- Counting words and phrases
- Exploring the Digitised Books Collection from Trove by Adel Rahmani
- Recipe generator
- Exploring your TroveHarvester data
- Exploring text files harvested with the Trove Harvester
- Topic Modelling of Australian Parliamentary Press Releases by Adel Rahmani
- Compare two versions of an archived web page
- Display changes in the text of an archived web page over time
- Find when a piece of text appears in an archived web page
text dataset¶
- OCRd text from the Internet Archive of 'Australian' books listed in Trove
- OCRd text from Trove books and ephemera
- List of journals with OCRd text
- OCRd text from Trove digitised journals
- Politicians talking about COVID
- Politicians talking about 'immigrants' and 'refugees'
topic modelling¶
- Exploring the Digitised Books Collection from Trove by Adel Rahmani
- Topic Modelling of Australian Parliamentary Press Releases by Adel Rahmani
visualisation¶
- GLAM CSV Explorer
- NSW State Archives Index Explorer
- Exploring your TroveHarvester data
- Analyse public tags added to Trove
- Exploring ABC Radio National metadata
- Visualise the total number of newspaper articles in Trove by year and state
- Visualising intersections and overlaps between data sources
- Compare two versions of an archived web page
- Exploring subdomains in the whole of gov.au
- Circular dendrograms of gov.au subdomains
- Observing change in a web page over time
- Create a Gannt chart of Australian government departments
- Create a network graph visualisation of Australian government departments
- Visualise the connections of a single Australian government agency
web app¶
web archives¶
- Compare two versions of an archived web page
- Comparing CDX APIs
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
- Display changes in the text of an archived web page over time
- Exploring subdomains in the whole of gov.au
- Exploring the Internet Archive's CDX API
- Find and explore Powerpoint presentations from a specific domain
- Find when a piece of text appears in an archived web page
- Find all the archived versions of a web page
- Get the archived version of a page closest to a particular date
- Harvesting data about a domain using the IA CDX API
- Harvesting collections of text from archived web pages
- Observing change in a web page over time
- Timegates, Timemaps, and Mementos
- Timemaps vs CDX APIs