Back to Main Conference 2012
LREC 2012main

Accessing and standardizing Wiktionary lexical entries for the translation of labels in Cultural Heritage taxonomies

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/4gxaent3d4sv

Abstract

We describe the usefulness of Wiktionary, the freely available web-based lexical resource, in providing multilingual extensions to catalogues that serve content-based indexing of folktales and related narratives. We develop conversion tools between Wiktionary and TEI, using ISO standards (LMF, MAF), to make such resources available to both the Digital Humanities community and the Language Resources community. The converted data can be queried via a web interface, while the tools of the workflow are to be released with an open source license. We report on the actual state and functionality of our tools and analyse some shortcomings of Wiktionary, as well as potential domains of application.

Details

Paper ID
lrec2012-main-487
Pages
pp. 2511-2514
BibKey
declerck-etal-2012-accessing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • TD

    Thierry Declerck

  • KM

    Karlheinz Mörth

  • PL

    Piroska Lendvai

Links