Summary of the paper

Title Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration
Authors Wolodja Wentland, Johannes Knopp, Carina Silberer and Matthias Hartung
Abstract In this paper, we present HeiNER, the multilingual Heidelberg Named Entity Resource. HeiNER contains 1,547,586 disambiguated English Named Entities together with translations and transliterations to 15 languages. Our work builds on the approach described in (Bunescu and Pasca, 2006), yet extends it to a multilingual dimension. Translating Named Entities into the various target languages is carried out by exploiting crosslingual information contained in the online encyclopedia Wikipedia. In addition, HeiNER provides linguistic contexts for every NE in all target languages which makes it a valuable resource for multilingual Named Entity Recognition, Disambiguation and Classification. The results of our evaluation against the assessments of human annotators yield a high precision of 0.95 for the NEs we extract from the English Wikipedia. These source language NEs are thus very reliable seeds for our multilingual NE translation method.
Language Multiple languages
Topics Named Entity recognition, Lexicon, lexical database, Multilinguality
Full paper Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration
Slides -
Bibtex @InProceedings{WENTLAND08.816,
  author = {Wolodja Wentland, Johannes Knopp, Carina Silberer and Matthias Hartung},
  title = {Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration},
  booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
  year = {2008},
  month = {may},
  date = {28-30},
  address = {Marrakech, Morocco},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-4-0},
  note = {http://www.lrec-conf.org/proceedings/lrec2008/},
  language = {english}
  }

Powered by ELDA © 2008 ELDA/ELRA