Summary of the paper

Title Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Authors Els Lefever, Veronique Hoste and Martine De Cock
Abstract Wikipedia pages typically contain inter-language links to the corresponding pages in other languages. These links, however, are often incomplete. This paper describes a set of experiments in which the viability of discovering such missing inter-language links for ambiguous nouns by means of a cross-lingual Word Sense Disambiguation approach is investigated. The input for the inter-language link detection system is a set of Dutch pages for a given ambiguous noun and the output of the system is a set of links to the corresponding pages in three target languages (viz. French, Spanish and Italian). The experimental results show that although it is a very challenging task, the system succeeds to detect missing inter-language links between Wikipedia documents for a manually labeled test set. The final goal of the system is to provide a human editor with a list of possible missing links that should be manually verified.
Topics Word Sense Disambiguation, Multilinguality
Full paper Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation
Bibtex @InProceedings{LEFEVER12.508,
  author = {Els Lefever and Veronique Hoste and Martine De Cock},
  title = {Discovering Missing Wikipedia Inter-language Links by means of Cross-lingual Word Sense Disambiguation},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-7-7},
  language = {english}
 }
Powered by ELDA © 2012 ELDA/ELRA