This partnership, established in 2015, focuses on integrating digital cultural heritage data from Europeana into CLARIN’s research infrastructure for use by scholars, students and all those who are interested in language resources. CLARIN and Europeana also work together to promote Europeana’s mission and resources to the academic community through a number of outreach activities.
CLARIN is the European Infrastructure for Language Resources and Technology. It is composed of 21 national consortium members and has been a Europeana Research partner since 2015.
The first years of the partnership were spent on setting up a stable process to evaluate, select and ingest digital cultural heritage from Europeana. Since 2019, CLARIN has structurally harvested Europeana resources into the Virtual Language Observatory (VLO) allowing researchers interested in language resources to reuse them with the variety of tools available on the Language Resource Switchboard. 275,000 Europeana records are now available on CLARIN’s VLO, making Europeana the content provider with the highest number of items in terms of individual records to this platform.
In 2022 this partnership resulted into the publication of Jupyter notebooks for Europeana newspaper full-text with natural language processing (NLP) tools on the SSH | Social Sciences and Humanities Open Marketplace.
In addition, the Europeana Newspapers full text resources now stand as a resource family on the VLO, where they have been incorporated in a custom metadata aggregation with multiple levels of granularity. On this basis, the VLO offers methods of search, filter and access tailored to audiences with a specific interest in language resources and technology.
CLARIN and Europeana Research co-organise a number of outreach activities (workshops, presentations), including at the CLARIN Annual Conference.