Back to Main Conference 2016
LREC 2016main

Constructing a Norwegian Academic Wordlist

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/4r3x5whzbjxa

Abstract

We present the development of a Norwegian Academic Wordlist (AKA list) for the Norwegian Bokmäl variety. To identify specific academic vocabulary we developed a 100-million-word academic corpus based on the University of Oslo archive of digital publications. Other corpora were used for testing and developing general word lists. We tried two different methods, those of Carlund et al. (2012) and Gardner & Davies (2013), and compared them. The resulting list is presented on a web site, where the words can be inspected in different ways, and freely downloaded.

Details

Paper ID
lrec2016-main-232
Pages
pp. 1457-1462
BibKey
johannessen-etal-2016-constructing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • JJ

    Janne M Johannessen

  • AS

    Arash Saidi

  • KH

    Kristin Hagen

Links