Sem TEI 2 - Semantic TEI 2: Semantic text publishing as open research data for a broader audience
Persone
(Responsabile)
Abstract
This project builds upon the success of the previous Semantic TEI initiative, which established a semantic data model for scholarly text as Linked Data. While the initial project developed an ETL pipeline for transforming TEI data into RDF, its usability remained limited to technically skilled users. The proposed project seeks to generalize the ETL pipeline for broader application and integrate the Semantic TEI model within Geovistory, a research and data publication environment designed to support Open Research Data (ORD) practices.
Key objectives include: (1) expanding ETL pipeline applicability by analyzing TEI datasets from multiple digital editions, ensuring adaptability to diverse textual corpora; (2) developing TEI-specific SKOS vocabularies to enhance interoperability and semantic structuring; (3) integrating the Semantic TEI model into the OntoME platform to facilitate alignment with Geovistory’s ontology ecosystem; and (4) creating a visualization tool within Geovistory that allows for intuitive exploration of enriched textual data.
This initiative addresses persistent challenges in digital scholarly editing, including low interoperability, static text representation, and limited reuse of TEI-based corpora. By leveraging Linked Data principles and Geovistory’s graph-based knowledge environment, the project advances FAIR (Findable, Accessible, Interoperable, Reusable) data practices in the humanities. Additionally, community engagement through dedicated workshops will ensure alignment with the needs of digital edition projects. The outcomes will be documented, published in open repositories, and disseminated via academic platforms to promote wider adoption and long-term sustainability.