Catalogue entries for the artworks held and the artists represented in the Tate Gallery collection and Tate Archive.
Artwork and Archive entries include
- Date created
- Reference number / Accession number
- Medium description (e.g. Photograph)
- Tate Website URL
- Subject index terms (e.g. Self Portrait - these look to be from a hierarchical controlled vocabulary / ontology / schema)
- Thumbnail URL
- Credit line
- Movements (e.g. Pop Art)
An example entry for an Andy Warhol polaroid: https://github.com/tategallery/collection/blob/master/artworks/ar/003/ar00303-97671.json
Artist entries include
- Date of birth
- Place of birth
- Place of death
- Tate Website URL
Archive items can be anything from posters for exhibitions to items of clothing owned by an artist: the records on the Tate site are fairly dry collection index-card entries.
John Stack has published presentations about the history of and future plans for the collection.
Used in Anger
The data is released as JSON and a flattened CSV, and released on GitHub at https://github.com/tategallery/collection
The artworks are referenced by Accession Number, with each artwork in a separate JSON file. The flattened CSV can be used to index the JSON files.
The documentation there is excellent - they get bonus points for the use of the words 'grok' and 'artistic pivot tables'. Someone in there gets it.
70,000 artworks and 3,500 artists.
CC0 for data, but images are not included. Polite request for attribution in the usage guidelines. Some very limited use of Tate copyright images might be possible under the website terms and conditions (which make allowance for non commercial and educational use). However, these also explicitly ban scraping and data mining, reproduction on social media platforms or 'considerable public dissemination ... of website content via any third party platforms'.
Is this article out-of-date? Update it via GitHub.