HomeLREC 2020WorkshopsGLOBALEXlrec2020-ws-globalex-09
Back to GLOBALEX 2020
LREC 2020workshop

Towards a Swedish Roget-Style Thesaurus for NLP

Proceedings of the 2020 Globalex Workshop on Linked Lexicography

DOI:10.63317/3v3ps4zkfmfm

Abstract

Bring’s thesaurus (Bring) is a Swedish counterpart of Roget, and its digitized version could make a valuable language resource for use in many and diverse natural language processing (NLP) applications. From the literature we know that Roget-style thesauruses and wordnets have complementary strengths in this context, so both kinds of lexical-semantic resource are good to have. However, Bring was published in 1930, and its lexical items are in the form of lemma–POS pairings. In order to be useful in our NLP systems, polysemous lexical items need to be disambiguated, and a large amount of modern vocabulary must be added in the proper places in Bring. The work presented here describes experiments aiming at automating these two tasks, at least in part, where we use the structure of an existing Swedish semantic lexicon – Saldo – both for disambiguation of ambiguous Bring entries and for addition of new entries to Bring.

Details

Paper ID
lrec2020-ws-globalex-09
Pages
pp. 53-60
BibKey
zechner-borin-2020-towards
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2020 Globalex Workshop on Linked Lexicography
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • NZ

    Niklas Zechner

  • LB

    Lars Borin

Links