HomeLREC 2020WorkshopsLDLlrec2020-ws-ldl-09
Back to LDL 2020
LREC 2020workshop

Using OntoLex-Lemon for Representing and Interlinking Lexicographic Collections of Bavarian Dialects

Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)

DOI:10.63317/5hji92q2i2qz

Abstract

This paper describes the ongoing work in converting the lexicographic collection of a non-standard German language dataset (Bavarian Dialects) into a Linguistic Linked Open Data (LLOD) format. The collection is divided into two: questionnaire dataset (DBÖ) which contains details of the questionnaires, questions, collectors, paper slips etc., and the lexical dataset (WBÖ) which contains lexical entries (answers) collected in response to the questions. In its current form, the lexical dataset is available in a TEI/XML format separately from the questionnaire dataset. This paper presents the mapping of the lexical entries in the TEI/XML format into LLOD format using the Ontolex-Lemon model. The paper shows how the data in TEI/XML format is transformed into LLOD and produces a lexicon for Bavarian Dialects. It also presents the approach used to interlink the original questions with the lexical entries. The resulting lexicon complements the questionnaire dataset, which is already in a LLOD format, by semantically interlinking the original questions with the answers and vice-versa.

Details

Paper ID
lrec2020-ws-ldl-09
Pages
pp. 61-69
BibKey
abgaz-2020-using
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • YA

    Yalemisew Abgaz

Links