This website uses cookies to ensure you get the best experience. By clicking or navigating the site you agree to allow our collection of information through cookies. More info

Tech

Posted on Tuesday August 1, 2017

Updated on Monday March 22, 2021

Data Quality Committee

Quality is a key priority for our whole community! The Data Quality Committee works to address key data quality issues over time.

Formally defined as a Europeana Network and EuropeanaTech Working Group, the Data Quality Committee is a standing committee that works on the various facets of the data quality challenge over time with a particular focus on reuse and discovery of cultural heritage objects. We believe it is crucial to tackle data quality issues at every level of the data exchange chain from its creation to its publication. We have gathered together experts from various backgrounds (metadata experts, software developers, search and retrieval experts) to help us capture all the issues.

Data Quality Committee Mission Statement

Work areas as well as specific tasks will be defined and prioritized as the Committee sees fits and regularly reported and submitted for ratification to the community, notably the Europeana Aggregators' Forum. Items such as mandatory elements for ingestion of EDM data, data checking and normalisation, data completeness have been already added on the menu. The Committee's mission statement is available here.

Discovery - User scenarios and their metadata requirements

We defined our main requirements in terms of discovery and information-retrieval requirements. A series of usage scenarios have been created reflecting information-access user needs (based on the Europeana user personas), listing current metadata issues for a given scenario and then proposing future actions. These scenarios focus specifically on metadata and are not tackling any challenges regarding the user interface or the user experience in Europeana Collections.

Europeana Publishing Framework - Metadata Quality Tiers

The Data Quality Committee has developed, together with aggregators, the metadata component of the Europeana Publishing Framework, based on three quality "pillars". The first one is based on the languages in which metadata records are provided, following Europeana's multilingual strategy. The second follows the recommendations of the Discovery - User Scenarios and relies on "enabling metadata elements". These enabling elements are desirable but optional - including them supports functionalities for a specific set of usage scenarios, beyond the fundamental minimum for all metadata descriptions defined by mandatory elements. The third pillar is based on the use of "contextual classes" that provide richer semantics to metadata. More information can be found here.

Representing events in EDM

Metadata on events can bring important information about the history of cultural heritage objects, especially when multiple events (e.g. creation, modification, destruction) can help to distinguish between different contexts. We have written a report that looks at existing events metadata from the community and provides recommendations about supporting events in EDM. This report can be of interest to all Europeana data partners that hold events data, but it may also benefit other non-Europeana aggregation initiatives. You can download the report and the associated documentation through the folder below. 

Folder

Representing events in EDM - documentation

Open folder

Problem patterns

Metadata problems can occur in many records. We're actively identifying patterns that reflect metadata quality issues such as redundant or incomplete information, lack of normalization and misuse of metadata fields. A work in progress list of these metadata problem patterns can be found here.

Multilingual saturation score

The multilingual saturation score is a score for multilinguality which can be applied on statement, property or record level. We defined a simplified schema which is the basis for the measurement assuming that each statement in a property can have one of the following values: a literal, a literal with a language tag, a URI (ideally to a controlled vocabulary). Learn more in this presentation. Note that the implementation work described in this presentation is still ongoing and is subject to changes

Presentations and publications

Folder

Documents

Open folder

Chairs

Filter by
NameCompanyCountryDomain
Antoine IsaacCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Adina CiocoiuCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage

Members

  • Fiona Fieldsend, DigitalNZ
  • Dulce Fontes, Biblioteca Nacional de Portugal
Filter by
NameCompanyCountryDomain
Valentine CharlesCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Agathi PapanotiCompany: National Documentation Centre, GreeceCountry: GreeceDomain: Academic Research
Kate FernieCompany: 2Culture AssociatesCountry: United KingdomDomain: Creative industries
Tom MilesCompany: The British LibraryCountry: United KingdomDomain: Cultural Heritage
Werner BailerCompany: Joanneum ResearchCountry: AustriaDomain: Other
Eleonore EmsbachCompany: German Digital LibraryCountry: GermanyDomain: Cultural Heritage
Cosmina BertaCompany: German Digital LibraryCountry: GermanyDomain: Cultural Heritage
Jonathan BlokCompany: Netherlands Institute for Sound and VisionCountry: NetherlandsDomain: Other
Henning ScholzCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Haris GeorgiadisCompany: National Documentation Centre, GreeceCountry: GreeceDomain: Academic Research
Jef MallietCompany: Flemish Government. dept. of Culture, Youth and MediaCountry: BelgiumDomain: Government
Nuno FreireCompany: NoneCountry: NetherlandsDomain: Cultural Heritage
Péter KirályCompany: GWDGCountry: GermanyDomain: Education
Marco ScarbaciCompany: FreelanceCountry: ItalyDomain: Cultural Heritage
Mónica MarreroCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Maria Teresa NataleCompany: Michael Culture AssociationCountry: BelgiumDomain: Academic Research
Hugo ManguinhasCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Eleftheria TsoupraCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Sergiu GordeaCompany: Austrian Institute of Technology GmbHCountry: AustriaDomain: Academic Research
Helena PatrícioCompany: National Library of PortugalCountry: PortugalDomain: Cultural Heritage
Marco RendinaCompany: European Fashion Heritage AssociationCountry: ItalyDomain: Cultural Heritage
Albin LarssonCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Fiona MowatCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Maša ŠkrinjarCompany: Europeana FoundationCountry: NetherlandsDomain: Cultural Heritage
Gerda KochCompany: AIT Angewandte Informationstechnik ForschungsgesellschaftCountry: AustriaDomain: Other
Anila AngjeliCompany: National Library of FranceCountry: FranceDomain: Cultural Heritage
Vladimir AlexievCompany: OntotextCountry: BulgariaDomain: Creative industries
Marie-Véronique LeroiCompany: Ministry of Culture and Communication FranceCountry: FranceDomain: Government
Juliane StillerCompany: You, We & DigitalCountry: UnknownDomain: Academic Research
Kristina RoseCompany: DFF - German Film Institute & Film MuseumCountry: GermanyDomain: Cultural Heritage

and

  • Fiona Fieldsend, DigitalNZ
  • Dulce Fontes, Biblioteca Nacional de Portugal

This page was updated in March 2021 to add the sections 'Europeana Publishing Framework - Metadata Quality Tiers', 'Representing events in EDM' and 'Problem patterns'.

top