HomeLREC 2026WorkshopsHTRESlrec2026-ws-htres-01
Back to HTRES 2026
LREC 2026workshop

Integrating TEI Publication, Guided Exploration, and Vector Databases for Semantic Search in the Voci dall’Inferno Project

Proceedings of The Second Workshop on Holocaust Testimonies as Language Resources (HTRes)

DOI:10.63317/2axve8uynrdb

Abstract

This paper presents recent advances toward an integrated framework that combines TEI-based digital publishing with embedding-based semantic search to support the preservation, exploration and analysis of Holocaust survivor testimonies. The corpus includes written and oral sources and preserves them within a XML-TEI model supported by an ODD customization that preserves provenance, structure and interpretability. A dedicated web application developed within the eXistdb platform provides guided access to the digital corpus and supports the management, visualization, and exploration of the encoded data. The project aims to investigate a specific research goal: to verify the presence of references to the Divine Comedy by Dante within Holocaust testimonies. To this end, we implement a semantic retrieval component based on SentenceTransformers’ embeddings and a vector database, enabling the discovery of both literal and non-literal Dantean passages within the testimonies. The paper presents the advances achieved toward this objective and the ethical constraints shaping access policies, resulting in a sustainable archive and a reproducible methodology for intertextual research in sensitive historical collections.

Details

Paper ID
lrec2026-ws-htres-01
Pages
pp. 1-11
BibKey
delgrosso-etal-2026-integrating
Editors
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of The Second Workshop on Holocaust Testimonies as Language Resources (HTRes)
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • AD

    Angelo Mario Del Grosso

  • EM

    Elvira Mercatanti

  • CC

    Carla Congiu

  • MR

    Marina Riccucci

Links