Skip to content

Exploring the Internet Archive's CDX API

Works with IA

Some web archives provide indexes of the web pages they've archived through an API. These CDX APIs can be queried by a number of fields including capture date, url, and mimetype. This notebook looks in detail at the data provided by the Internet Archive's CDX API.

Run live on Binder

Other options

Additional documentation

Getting help

Cite as

Sherratt, Tim & Jackson, Andrew. (2022). GLAM-Workbench/web-archives (version v1.1.0). Zenodo. https://doi.org/10.5281/zenodo.6450762

Section sponsor

The Web Archives section of the GLAM Workbench is sponsored by the British Library.