Back to Main Conference 2022
LREC 2022main

Entity Linking over Nested Named Entities for Russian

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/4s884u39hjm2

Abstract

In this paper, we describe entity linking annotation over nested named entities in the recently released Russian NEREL dataset for information extraction. The NEREL collection is currently the largest Russian dataset annotated with entities and relations. It includes 933 news texts with annotation of 29 entity types and 49 relation types. The paper describes the main design principles behind NEREL’s entity linking annotation, provides its statistics, and reports evaluation results for several entity linking baselines. To date, 38,152 entity mentions in 933 documents are linked to Wikidata. The NEREL dataset is publicly available.

Details

Paper ID
lrec2022-main-474
Pages
pp. 4458-4466
BibKey
loukachevitch-etal-2022-entity
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • NL

    Natalia Loukachevitch

  • PB

    Pavel Braslavski

  • VI

    Vladimir Ivanov

  • TB

    Tatiana Batura

  • SM

    Suresh Manandhar

  • AS

    Artem Shelmanov

  • ET

    Elena Tutubalina

Links