Tags¶
Warning
Under construction!
API¶
- Compare two versions of an archived web page
- Comparing CDX APIs
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
- Display changes in the text of an archived web page over time
- Exploring subdomains in the whole of gov.au
- Exploring the Internet Archive's CDX API
- Find and explore Powerpoint presentations from a specific domain
- Find when a piece of text appears in an archived web page
- Find all the archived versions of a web page
- Get the archived version of a page closest to a particular date
- Unique subdomains of gov.au split into components
- Unique subdomains of gov.au in SURT format
- Circular dendrograms of gov.au subdomains
- Harvest of unique urls from the gov.au domain
- Harvesting data about a domain using the IA CDX API
- Harvesting collections of text from archived web pages
- Observing change in a web page over time
- Timegates, Timemaps, and Mementos
- Timemaps vs CDX APIs
CSV dataset¶
- CSV formatted list of 'Australian' books in Trove with full text versions in the Internet Archive
- CSV formatted list of Trove books available in digital form
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- CSV formatted list of journals available from Trove in digital form
- CSV formatted list of journals with OCRd text
- OCRd text from Trove digitised journals
CloudStor¶
- Cloudstor access to a public share via WebDAV
- Creating and sharing public links to nested resources in CloudStor
Datasette¶
Internet Archive¶
- CSV formatted list of 'Australian' books in Trove with full text versions in the Internet Archive
- Getting the text of Trove books from the Internet Archive
- OCRd text from the Internet Archive of 'Australian' books listed in Trove
Voilá¶
copyright¶
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- The use of standard licences and rights statements in Trove image records
data harvesting¶
- Getting the text of Trove books from the Internet Archive
- Harvesting the text of digitised books (and ephemera)
- Trove Harvester web app
- Using TroveHarvester to get newspaper and gazette articles in bulk
- Create a list of Trove's digitised journals
- Finding editorial cartoons in the Bulletin
- Get covers (or any other pages) from a digitised journal in Trove
- Download the OCRd text for ALL the digitised journals in Trove!
- Get OCRd text from a digitised journal in Trove
- Harvest parliament press releases from Trove
- Exploring subdomains in the whole of gov.au
- Find and explore Powerpoint presentations from a specific domain
- Harvesting collections of text from archived web pages
documentation¶
- Find all the archived versions of a web page
- Get the archived version of a page closest to a particular date
- Timegates, Timemaps, and Mementos
fun¶
geospatial¶
government¶
image dataset¶
images¶
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- The use of standard licences and rights statements in Trove image records
- Finding editorial cartoons in the Bulletin
- Get covers (or any other pages) from a digitised journal in Trove
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
licensing¶
- Rights applied to images by each Trove contributor
- Rights applied to out-of-copyright photographs by each Trove contributor
- The use of standard licences and rights statements in Trove image records
metadata¶
- CSV formatted list of 'Australian' books in Trove with full text versions in the Internet Archive
- CSV formatted list of Trove books available in digital form
- Government publications from Trove in digital form
- Metadata for Trove digitised works
- Create a list of Trove's digitised journals
- CSV formatted list of journals available from Trove in digital form
- CSV formatted list of journals with OCRd text
- Harvest parliament press releases from Trove
- OCRd text from Trove digitised journals
- Unique subdomains of gov.au split into components
- Unique subdomains of gov.au in SURT format
- Circular dendrograms of gov.au subdomains
- Harvest of unique urls from the gov.au domain
- Harvesting data about a domain using the IA CDX API
- Observing change in a web page over time
screenscraping¶
screenshots¶
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
text¶
- Download the OCRd text for ALL the digitised journals in Trove!
- Get OCRd text from a digitised journal in Trove
- Harvest parliament press releases from Trove
- Harvesting collections of text from archived web pages
text analysis¶
- Counting words and phrases
- Exploring the Digitised Books Collection from Trove by Adel Rahmani
- Recipe generator
- Exploring your TroveHarvester data
- Exploring text files harvested with the Trove Harvester
- Topic Modelling of Australian Parliamentary Press Releases by Adel Rahmani
- Compare two versions of an archived web page
- Display changes in the text of an archived web page over time
- Find when a piece of text appears in an archived web page
text dataset¶
- OCRd text from the Internet Archive of 'Australian' books listed in Trove
- OCRd text from Trove books and ephemera
- OCRd text from Trove digitised journals
- Politicians talking about COVID
- Politicians talking about 'immigrants' and 'refugees'
topic modelling¶
- Exploring the Digitised Books Collection from Trove by Adel Rahmani
- Topic Modelling of Australian Parliamentary Press Releases by Adel Rahmani
visualisation¶
- GLAM CSV Explorer
- Exploring your TroveHarvester data
- Compare two versions of an archived web page
- Exploring subdomains in the whole of gov.au
- Circular dendrograms of gov.au subdomains
- Observing change in a web page over time
web app¶
web archives¶
- Compare two versions of an archived web page
- Comparing CDX APIs
- Using screenshots to visualise change in a page over time
- Create and compare full page screenshots from archived web pages
- Display changes in the text of an archived web page over time
- Exploring subdomains in the whole of gov.au
- Exploring the Internet Archive's CDX API
- Find and explore Powerpoint presentations from a specific domain
- Find when a piece of text appears in an archived web page
- Find all the archived versions of a web page
- Get the archived version of a page closest to a particular date
- Harvesting data about a domain using the IA CDX API
- Harvesting collections of text from archived web pages
- Observing change in a web page over time
- Timegates, Timemaps, and Mementos
- Timemaps vs CDX APIs