To foster the reuse of the metadata that is published in Europeana, our offer includes compressed zip files containing the metadata of all objects in Europeana's repository readily available for bulk download. These files are generated on Sunday evening each week using our harvesting solution, which guarantees that the data is as up-to-date as possible while making sure our harvesting service is working as expected.
FTP listing and file structure
All the files are available in our FTP server at ftp://download.europeana.eu/dataset/. You can connect to an FTP server by using software programs like FileZilla, or you can connect to an FTP server as a Shared Network Location or using the Command Prompt. If you are using a Linux OS, you can run the command: wget -m ftp://download.europeana.eu/dataset/XML
Information on how to login to the FTP Server:
Host: | ftp://download.europeana.eu/dataset/ |
User: | anonymous |
Password: | [leave blank] |
Port: | 21 |
The structure in the FTP server is organised in the following way:
A directory for each available format. For the time being only two formats are available: XML for the RDF-XML format and TTL for Turtle.
Each directory then lists a compressed zip file for each Dataset in Europeana, where the name of the file is the dataset identifier (e.g. 2021672.zip). Under this directory will be a respective MD5 checksum file under the file extension .md5sum (e.g. 2021672.zip.md5sum) which can be used to validate the file upon download.
On each compressed zip file there will be a file for each Europeana metadata record where the name of the file will be the local identifier of the Record in Europeana.
Example
The data for the Girl with the Pearl Earring from the Mauritshuis encoded using the RDF-XML format will be available at the following URL ftp://download.europeana.eu/dataset/XML/2021672.zip . To find to which dataset any record belongs, you can check the URL of the record (for the Girl with the pearl earring, the Europeana item URL is https://www.europeana.eu/nl/item/2021672/resource_document_mauritshuis_670 ), or you can find the dataset name next to the field 'Collection Name' in the 'More Metadata' tab on the item page.
The FTP server will provide you with a ZIP file with the metadata for all the objects in the dataset with the dataset number '2021672' if you request the URL ftp://download.europeana.eu/dataset/XML/2021672.zip. Unzipping the ZIP File will give you an XML file for every digital cultural heritage object. You can find the metadata for the “Girl with the Pearl Earring” in the ZIP file with the ID of that object, 'resource_document_mauritshuis_670' in the XML file named "resource_document_mauritshuis_670.xml"