Annotations Ontology Ontology overview WIDOCO documentation Knowledge graph explorer Ground truth-based KG ObliquER-based KG ObliquER About Demo Knowledge base SPARQL endpoint Publications About Ready
Viewsari
A provenance-aware knowledge graph built from Giorgio Vasari's Le Vite de' piu eccellenti pittori, scultori, e architettori (1568), developed at KIT / FIZ Karlsruhe (ISE group) as part of a doctoral dissertation.
Ground truth
16
Biographies
270
Paragraphs
6
Volumes
ObliquER
825
Mentions
443
Entities
Knowledge extracted from interpretive texts is not discovered but constructed; therefore, knowledge graphs must model the construction process itself.

Cultural heritage materials, once digitized, remain largely unstructured: human-readable, but not systematically queryable, linkable, or analyzable. Large language models have improved access to such content, yet they cannot guarantee epistemic traceability. Viewsari addresses this gap by treating knowledge extraction from historical text as an interpretive activity, recording agents, prompts, software versions, and source contexts for each extracted statement.

The approach is neuro-symbolic: the Viewsari ontology (symbolic component) provides a formal schema that structures and constrains extraction tasks assigned to a large language model (neural component). Vasari's Lives (1568) serves as testbed — a founding document of art history characterized by implicit entity references, attributions presented as facts, and long-tail entities absent from standard knowledge bases.

Research Questions
RQ1 In what way can semantic technologies model the heterogeneous content, provenance, and interpretive complexity of historical texts?
The Viewsari ontology formalizes a three-layer model covering bibliographic structure, document components, and extracted entities with full PROV-O provenance. Every mention, entity, and extraction run is traceable back to its source paragraph and the agent that produced it.
RQ2 How can NLP methods, and LLMs in particular, be used to extract and link both explicitly and implicitly mentioned entities?
The ObliquER pipeline uses LLM-based named entity recognition with few-shot and ontology-guided prompting strategies. Extraction runs are recorded as per-paragraph PROV activities with full prompt text, timestamps, and model provenance. Entity linking from the ground truth maps mentions to Wikidata or OOKB identifiers.
RQ3 What are the common challenges in modeling complex, multilingual historical sources, and how can a generalizable approach be drawn?
Vasari's text features implicit references, long-tail entities absent from standard knowledge bases, and attributions presented as facts. The Viewsari approach addresses these through OOKB entity handling, coreference resolution across biographies, and a provenance-first KG design that separates extraction from interpretation — principles applicable to other cultural heritage corpora.
Knowledge Graph Preview
Person Artwork Co-occurrence Biography
Annotation Viewer
Browse annotated biographies with facsimile pages, entity mentions, provenance graphs, and Wikidata links.
16 biographies →
Viewsari Ontology — An Introduction
A guided overview of the three-layer OWL ontology: bibliographic structure, document components, and extracted entities with full provenance.
Overview →
Ontology Documentation
Formal WIDOCO-generated reference with complete class hierarchy, property definitions, and cross-references.
OWL / RDF-XML →
KG Explorer
Interactive graph visualization of the Viewsari Knowledge Graph. Explore persons, co-occurrences, and biographies as a force-directed network.
378K+ triples →
Annotations
Gold-standard annotation corpus: 16 biographies, 2,438 artwork mentions with explicit, implicit, coreferent, and generic types.
ObliquER corpus →
Publications
Peer-reviewed publications from the Viewsari project: ontology design, extraction provenance, NFDI4Memory, and more.
Dissertation + papers →
Giorgio Vasari