← Home Annotations Ontology Ontology Docs KG Explorer ObliquER About Demo KB SPARQL Publications About
Annotations
Ready
Annotation Corpus
Gold-standard annotation corpus for artwork entity recognition in Vasari's The Lives, manually annotated in INCEpTION with the ObliquER annotation schema.
16
Biographies
270
Paragraphs
2438
Mentions
1107
Unique Entities
Annotation Schema

Each mention is classified into one of four types following the ObliquER annotation schema. Mentions are anchored to exact character positions within paragraphs and linked to entities via Wikidata QIDs or Viewsari-internal OOKB IRIs.

Explicit — artwork named directly
Implicit — artwork described without title
Coreferent — anaphoric reference to earlier mention
Generic — non-specific artwork reference
Sampling and Annotation Process
Paragraphs were selected via stratified sampling (random seed 42), capped at 10–25 paragraphs per biography depending on corpus size. Annotations follow the ObliquER pipeline schema with character-level offsets, entity IDs, Wikidata linking, and OOKB classification.
Annotated Biographies
Biography Vol. Paragraphs Mentions Type Distribution Wikidata OOKB
Cavallini 1 7 93
14 (15%) 26 (28%)
Cimabue 1 14 116
23 (20%) 35 (30%)
Giotto 1 15 203
44 (22%) 52 (26%)
Brunelleschi 2 16 272
28 (10%) 50 (18%)
Alberti 3 11 86
15 (17%) 20 (23%)
Baldovinetti 3 8 47
6 (13%) 16 (34%)
Botticelli 3 14 110
41 (37%) 16 (15%)
Gherardo 3 5 66
10 (15%) 12 (18%)
Ghirlandaio 3 19 235
53 (23%) 64 (27%)
Pollaiuolo 3 14 127
12 (9%) 52 (41%)
Rosselli 3 4 62
16 (26%) 17 (27%)
Verrocchio 3 17 216
31 (14%) 42 (19%)
Lippi 4 9 118
21 (18%) 50 (42%)
Pontormo 7 53 119
19 (16%) 27 (23%)
Michelangelo 9 49 395
59 (15%) 57 (14%)
Titian 9 15 173
42 (24%) 30 (17%)
Components
ObliquER Pipeline
LLM-based entity recognition and linking pipeline designed for implicit and long-tail entities. Features formal task definitions grounded in the Viewsari ontology, prompt engineering for zero-shot and few-shot extraction, dynamic chunking, entity linking with OOKB tagging, and coreference clustering.
Viewsari Knowledge Graph
The populated KG covers the full Lives corpus: person entities from the index of names, co-occurrence instances with PMI and Dice scores, Web Annotation layer for paragraph-level mention anchoring, provenance activities for each extraction run, and FRBR-level bibliographic metadata.