Back to Main Conference 2012
LREC 2012main

Matching Cultural Heritage items to Wikipedia

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/4cj6jyq8j2q3

Abstract

Digitised Cultural Heritage (CH) items usually have short descriptions and lack rich contextual information. Wikipedia articles, on the contrary, include in-depth descriptions and links to related articles, which motivate the enrichment of CH items with information from Wikipedia. In this paper we explore the feasibility of finding matching articles in Wikipedia for a given Cultural Heritage item. We manually annotated a random sample of items from Europeana, and performed a qualitative and quantitative study of the issues and problems that arise, showing that each kind of CH item is different and needs a nuanced definition of what ``matching article'' means. In addition, we test a well-known wikification (aka entity linking) algorithm on the task. Our results indicate that a substantial number of items can be effectively linked to their corresponding Wikipedia article.

Details

Paper ID
lrec2012-main-609
Pages
pp. 1729-1735
BibKey
agirre-etal-2012-matching
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • EA

    Eneko Agirre

  • AB

    Ander Barrena

  • Od

    Oier Lopez de Lacalle

  • AS

    Aitor Soroa

  • SF

    Samuel Fernando

  • MS

    Mark Stevenson

Links