Trove API v3

Trove API access restrictions

With the cancellation of my Trove API keys by the National Library of Australia, I've made the difficult decision to stop work on Trove and archive all related code repositories.

The Trove sections of the GLAM Workbench will remain online, but they won't be updated. Everything here is openly-licensed, so feel free to take what’s useful and develop it further yourself.

Given the fact that the NLA is willing to change the API terms of use to restrict access without any consultation, provides no transparency around acceptable use of full text content, and is willing to cancel API keys without warning, I can no longer recommend Trove as a reliable source for digital research.

Version 2 of the Trove API has now been decommissioned, so you have to use v3. The Trove documentation describes v3 as 'finalised', but there are still a number of outstanding bugs.

Timeline¶

March 2023: Trove API v3 beta (release 1)
May 2023: Trove API v3 beta (release 2)
June 2023: Trove API v3 beta (release 3)
silence for many months...
September 2024: Trove API v2 decommissioned without warning

Trove API Console¶

The Trove API Console has been updated to add examples from v3 of the API. It provides many sample queries that you can run and modify without the need for an API key.

Migration tips¶

All the breaking changes I know about are listed below, but here's a short summary of things you should look out for when migrating your code from version 2 to version 3 of the Trove API.

Update API version¶

This is an obvious one – change the /v2/ in your API urls to /v3/!

Moving from zones to categories¶

You can't just change all the the zone parameters to category as there's no one to one correspondence between the old zones and new categories. In some cases you can add the l-artType facet to try to match the old behaviour.

Zone	Category
`newspaper`	`newspaper` – add `l-articleType=newspapers` (note the plural) to keep gazettes out
`gazette`	`newspaper` – add `l-articleType=gazette`
`book`	`book` – mostly the same but seems to have gained some formats and lost theses
`article`	this zone has been split – NLA digitised articles are in the `magazine` category, other articles are in `research` and `book`, periodical records are in `book`
`picture`	`image` – this category now includes the contents of the former `map` zone, use `l-artType=Pictures and photos` to filter the maps out
`music`	`music`
`map`	`picture` – add `l-artType=Maps`
`collection`	`diary`
`list`	`list`
`people`	`people`

Be aware of changes to record structure¶

This one is easy to overlook and likely to have you scratching your head at unexpected errors. The actual structure of most JSON and XML records has changed. In most cases the top level container has been removed, so any code that drills down through the record hierarchy to find a value will break. See below for the specific changes for each endpoint.

There's also some changes in the way lists of resources are provided. For example, facets are now always returned as an array of objects no matter how many you request. Previously if you requested a single facet, you got a single object.

If something's not working after you migrate – check these things first! They've already tripped me up a couple of times.

Other odd and annoying changes¶

Some changes just seem to be designed to break your code. In v2 the text version of a newspaper title was found in record["title"]["value"], but in v3 this has changed to record["title"]["title"]. The word and illtype facets have changed to wordCount and illustrationType. Why?

See below for various odd changes to List records.

Some url values have changed¶

The value of trovePageUrl has changed. Previously it was of the form https://trove.nla.gov.au/ndp/del/page/[PAGE ID], but in v3 it is https://nla.gov.au/nla.news-page[PAGE ID]. If, like me, you've been using regular expressions to extract the numeric identifier you'll have to update your code.

Don't lose your keys!¶

API keys now expire after 12 months, even if they're in use! You'll get an email asking if you want to renew your key – don't ignore this email or your application will stop working!

Summary of breaking changes¶

Info

This is a work in progress as I work my way through the API changes. Additions are very welcome.

This summary only includes breaking changes – the things you'll need to change in your code before v2 of the API is switched off in early 2024. For a full list of changes see the introduction to the v3 API, and the v3 technical guide.

The release notes don't mention the changes to the JSON responses. In general, the changes flatten the JSON responses by removing top-level wrappers that served no function. This means the way you access data within responses has changed. For example, assuming that you've saved the JSON response as a variable called data, to access the title of a newspaper article retrieved via the /newspaper/[article id] endpoint in version 2 you would use:

data["article"]["heading"]

While in version 3 you'd use:

data["heading"]

There are similar changes across all endpoints.

All endpoints¶

API keys will now automatically expire 12 months after activation! (There will be an option to renew.)
change v2 to v3 in the endpoint url!
callback parameter for JSONP removed, use CORS instead
~~default encoding now JSON not XML (I've been told this is a bug and release 3 will make XML the default again)~~ Default encoding is once again XML (same as API v2).

`/result` endpoint¶

zone parameter has been replaced with category, possible values are: all, newspaper, magazine, image, research, book, diary, music, people, list; note that in most cases there is no one-to-one correspondence between zones and categories as the boundaries of what's included have changed, for example, the 'newspaper' category includes the contents of both the 'newspaper' and 'gazette' zones
the handling of 'blank' searches has changed – ~~in version 2 you could use a " " space as a value for q allowing you to search for everything, this doesn't work in v3beta, however, using the unicode null value, %00, does seem to work~~ release 2 of the v3 beta made the q (query) parameter optional. This means blank searches are supported without any need for workarounds.
the behaviour of the include parameter has changed – previously you could supply multiple values to include as a comma-separated string, now to specify mutiple values you have to include include multiple times (eg. instead of &include=tags,comments you need &include=tags&include=comments).
some facet names have changed: word is now wordCount, and illtype is now illustrationType
JSON response format changes:
- top-level response key removed
- zone key changed to category
- the meaning of the name parameter inside each category value has change – the old name value can be accessed via the code key, while name now returns the category's display name.
- If you requested a single facet in version 2, it was returned as a JSON object. In version 3, facets are always returned as an array of objects no matter how many you request. So if you include the facet=format parameter in your request, the facet terms within each category will be in category["facets"]["facet"][0] rather than category["facets"]["facet"]. Compare the v2 result with the v3 result.

Newspapers and gazette records (both `/result` and `/newspaper/[article id]`)¶

urls of newspaper and gazette articles and pages returned by the API have changed to use NLA identifiers. In most cases this shouldn't break anything as they still go to the same place. However, if you've been extracting the numeric page identifier from the trovePageUrl then you might need to adjust your code. Previously the trovePageUrl value was of the form https://trove.nla.gov.au/ndp/del/page/[PAGE ID], but in v3 it is https://nla.gov.au/nla.news-page[PAGE ID].
the structure of the title element has changed – in v2 the text version of a newspaper title was found in record["title"]["value"], but in v3 this has changed to record["title"]["title"]

List records (both `/result` and `/list/[list id]`)¶

in v2 the dates a list was created and updated were available at record["created"] and record["lastupdated"], but in v3 both dates are under record["date"] so record["date"]["created"] and record["date"]["lastupdated"].
the values of tagCount and commentCount have moved – in v2 they were available at record["tagCount"] and record["commentCount"], but in v3 they have moved down a level to record["tagCount"]["value"] and record["commentCount"]["value"]

`/work/[work id]` endpoint¶

JSON response format changes:
- top-level work key removed

`/newspaper/[article id]` endpoint¶

JSON response format changes:
- top-level article key removed

`/newspaper/titles` endpoint¶

JSON response format changes:
- top-level response and records keys removed

`/newspaper/titles/[title id]` endpoint¶

JSON response format changes:
- top-level newspaper key removed

`/list/[list id]` endpoint¶

JSON response format changes:
- top-level list key removed – note too that in v2 list returned an array (even though there was only one value), in v3 there is just a single value, not an array; so instead of using data["list"][0]["title"] to get the title of a list, you'll need to use data["title"].

`contributor` endpoint¶

There are major changes here that aren't mentioned in the release notes. Previously a call to this endpoint without additional parameters just returned a long list of contributors. Now, you need to supply a q parameter to filter the results.

~~q parameter added and required – to do a blank search and get all contributors (ie the v2 behaviour), include the q parameter with no value, eg: contributor?q=&encoding=json&reclevel=full~~ Release 2 of the v3 API beta made the q parameter optional.
JSON response format changes:
- top-level response key removed

`contributor/[NUC]` endpoint¶

JSON response format changes:
- top-level contributor key removed

`magazine/titles/` endpoint¶

~~This endpoint was introduced in release 2 of the Trove API v3 beta, however, it's currently returning no results. I've been told this will be fixed in release 3.~~ Fixed in release 3.

Trove API v3

Timeline¶

Trove API Console¶

Migration tips¶

Update API version¶

Moving from zones to categories¶

Be aware of changes to record structure¶

Other odd and annoying changes¶

Some url values have changed¶

Don't lose your keys!¶

Summary of breaking changes¶

All endpoints¶

/result endpoint¶

Newspapers and gazette records (both /result and /newspaper/[article id])¶

List records (both /result and /list/[list id])¶

/work/[work id] endpoint¶

/newspaper/[article id] endpoint¶

/newspaper/titles endpoint¶

/newspaper/titles/[title id] endpoint¶

/list/[list id] endpoint¶

contributor endpoint¶

contributor/[NUC] endpoint¶

magazine/titles/ endpoint¶