Trove music and sound
Trove's 'music' zone includes music, oral history interviews, and radio programs. You can access metadata from the music zone through the Trove API.
Tips, tools, and examples¶
Harvest ABC Radio National records from Trove¶
Trove harvests details of programs and segments broadcast on ABC Radio National. You can find them by searching for nuc:"ABC:RN" in the Music & Audio category. The records include basic metadata such as titles, dates, and contributors, but not full transcripts or audio. This notebook harvests, cleans, and saves all the available Radio National data from Trove.
Exploring ABC Radio National metadata¶
This notebook shows a few ways you can start to explore the ABC Radio National metadata harvested by the notebook above.
Data¶
ABC Radio National programs¶
The full harvest of ABC Radio National program metadata, containing more than 400,000 records is available for download from CloudStor in the following formats:
- 580mb JSONL file (or compressed as 100mb zip file) – JSONL saves each record as a JSON object, separated by line breaks
- 340mb CSV file (or compressed as 80mb zip file)
You can also download CSVs for individual programs:
Data fields¶
Any of the fields other than work_id
and version_id
might be empty, though in most cases there should at least be values for title
, date
, creator
, contributor
and isPartOf
.
work_id
– identifier for the containing work in Trove (you can use this to create a url to the item)version_id
– an identifier for the version within the worktitle
– title for the program or segmentisPartOf
– name of the program this is a part ofdate
– ISO formatted datecreator
– usually just the ABCcontributor
– a list of names of those involved, such as the host, reporter or guestpublisher
– usually just the ABCrights
– copyright informationtype
– list of types (not sure how this differa fromformat
)format
– list of formats (not sure how this differs fromtype
)abstract
– text providing a summary of the program or segment (may incude multiple values)subject
– list of subject tags (uncontrolled and very messy)description
– truncated text fragment from the start of the transcript (may include multiple values)fulltext_url
– link to the page on the ABC website where you can find more informationthumbnail_url
– link to a related thumbnail image on the ABC websitenotonline_url
– not sure...