The term full-text is meant to refer to the combination of the textual representation of the digital object plus where parts of the text is present in the original image (represented as annotations). In the EDM profile, the textual representation of the digital object is referred to as Full-text Resource while the relations between the segments of the text and the coordinates in the image are referred to as Annotations.
Annotation Pages
An Annotation Page contains all the annotations that make up the full-text of a Page (ie. image). It is referred to by the Manifest and can be accessed via the following request.
Request
https://iiif.europeana.eu/presentation/[RECORD_ID]/annopage/[PAGE_ID]
Parameter | Description |
---|
RECORD_ID | The identifier of the record which is composed of the dataset identifier plus a local identifier within the dataset in the form of "/DATASET_ID/LOCAL_ID", for more detail see Europeana ID. |
PAGE_ID | The number of the page in logical sequence starting with 1 for the first page. There can be pages that do not contain any text which will mean that the request will return a HTTP 404. |
Response
The response is a JSON-LD structure composed of the following fields:
Parameter | Datatype | Description |
---|
@context | Array of String (URL) | The URL of the JSON-LD context (always with the values "http://iiif.io/api/presentation/3/context.json" and "https://www.europeana.eu/schemas/context/edm.jsonld"). |
id | String (URI) | The canonical identifier of the Annotation Page. |
type | String | The type of the resource. Always set to "AnnotationPage". |
items | Array (Annotation) | An array containing all the Annotation that are part of this page. |
Annotation |
---|
id | String (URI) | The canonical identifier of the Annotation. |
type | String | The type of the resource. Always set to "FullTextResource". |
motivation | String | The motivation of the annotation, see reference for more information. Always set to "transcribing". |
dcType | String | Represents the granularity level of the Annotation, reflecting levels such as: Page, Block, Line and Word. |
body | Object (Body) | The reference to the transcribed text. |
target | Array (String) | A target can represent an image or just a part of it that is being annotated. For most annotations that are at a level of granularity lower than Page will point to the specific coordinates where the text is found on the image, using the Media Fragments specification. |
Body |
---|
id | String (URL) | The url of the transcribed text (ie. full-text resource) in case of a Page level annotation, or a segment of the transcribed text using in this case the URI Fragment Identifiers for the text/plain Media Type specification. |
language (optional) | String | The language of the segment of the transcription text being annotated. It is represented as a ISO 639 language code. |
Example: Requesting an Annotation Page in v2.1.
https://iiif.europeana.eu/presentation/9200396/BibliographicResource_3000118436165/annopage/1
{
"@context": [ "http://iiif.io/api/presentation/3/context.json", "https://www.europeana.eu/schemas/context/edm.jsonld" ],
"id": "https://iiif.europeana.eu/presentation/9200396/BibliographicResource_3000118436165/annopage/1"
"type": "AnnotationPage",
"items": [
// full text annotation with no language
{
"id": "http://data.europeana.eu/annotation/9200356/BibliographicResource_3000100331503/a4cbbc7a0dc6b056c7bc0",
"type": "Annotation",
"motivation": "transcribing",
"dcType": "Block",
"body": {
"id": "http://data.europeana.eu/fulltext/9200356/BibliographicResource_3000100331503/XPTO#char=0,10"
},
"target": [
"https://iiif.europeana.eu/presentation/9200356/BibliographicResource_3000100331503/canvas/p1#xywh=13,0,16,10"
]
}
, ...
]
}
Fulltext Resource
The edm:FullTextResource represents the transcription of a single page of a Newspaper. A full-text resource can be accessed separately from the Annotation List that it is associated to using the following method.
Request
https://www.europeana.eu/api/fulltext/[RECORD_ID]/[FULLTEXT_ID]
Parameter | Description |
---|
RECORD_ID | The identifier of the record which is composed of the dataset identifier plus a local identifier within the dataset in the form of "/DATASET_ID/LOCAL_ID", for more detail see Europeana ID. |
FULLTEXT_ID | The identifier of the full text resource. |
Response
The response is a JSON-LD structure composed of the following fields:
Parameter | Datatype | Description |
---|
@context | String (URL) | The URL of the JSON-LD context (always with the value "https://www.europeana.eu/schemas/context/edm.jsonld"). |
id | String (URI) | The canonical identifier of the full-text resource. |
type | String | The type of the resource. Always set to "FullTextResource". |
language | String | The more predominante language of the transcription text represented as a ISO 639 language code. Parts of the text may be written in different languages. When that is the case, the language information will be indicated as part of the full-text Annotations. |
value | String | The transcription text. |
Example: Requesting a full-text resource.
https://www.europeana.eu/api/fulltext/9200396/BibliographicResource_3000118435063/8ebb67ccf9f8a1dcc2ea119c60954111
{
"@context": "https://www.europeana.eu/schemas/context/edm.jsonld",
"id": "http://data.europeana.eu/fulltext/9200396/BibliographicResource_3000118435063/8ebb67ccf9f8a1dcc2ea119c60954111"
"type": "FullTextResource",
"language": "nl",
"value": "… De ondergeteekende sedert veie jaren drukker van het met Uit. Oetober 11. vervallene Nieuw A. H. _ E. Blad, heeft de eer te berigten, dat hij, bewogen met het lot van eene menigte huisge zinnen die daardoor plotseling bij den naderenden winter hun be staan hebben verloren, besloten heeft tot de uitgaaf eener nieuwe courant onder de benaming van: Het Amsterdamsche Handels- en Effectenblad. en dat hij daartoe de voorloopige medewerking heeft verkregen van belangstellenden, die van oordeel zijn, dat het bestaan van een dagblad als het gewezen Nieuw A. H. _- E. 81. voor het algemeen … "
}