Back to Main Conference 2022
LREC 2022main

Cross-Lingual Link Discovery for Under-Resourced Languages

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/4qxmcm6nkezn

Abstract

In this paper, we provide an overview of current technologies for cross-lingual link discovery, and we discuss challenges, experiences and prospects of their application to under-resourced languages. We rst introduce the goals of cross-lingual linking and associated technologies, and in particular, the role that the Linked Data paradigm (Bizer et al., 2011) applied to language data can play in this context. We de ne under-resourced languages with a speci c focus on languages actively used on the internet, i.e., languages with a digitally versatile speaker community, but limited support in terms of language technology. We argue that languages for which considerable amounts of textual data and (at least) a bilingual word list are available, techniques for cross-lingual linking can be readily applied, and that these enable the implementation of downstream applications for under-resourced languages via the localisation and adaptation of existing technologies and resources.

Details

Paper ID
lrec2022-main-020
Pages
pp. 181-192
BibKey
rosner-etal-2022-cross
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • MR

    Michael Rosner

  • SA

    Sina Ahmadi

  • EA

    Elena-Simona Apostol

  • JB

    Julia Bosque-Gil

  • CC

    Christian Chiarcos

  • MD

    Milan Dojchinovski

  • KG

    Katerina Gkirtzou

  • JG

    Jorge Gracia

  • DG

    Dagmar Gromann

  • CL

    Chaya Liebeskind

  • GV

    Giedrė Valūnaitė Oleškevičienė

  • GS

    Gilles Sérasset

  • CT

    Ciprian-Octavian Truică

Links