Visualisation and Exploration of High-Dimensional Distributional Features in Lexical Semantic Classification

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

Abstract

Vector space models and distributional information are widely used in NLP. The models typically rely on complex, high-dimensional objects. We present an interactive visualisation tool to explore salient lexical-semantic features of high-dimensional word objects and word similarities. Most visualisation tools provide only one low-dimensional map of the underlying data, so they are not capable of retaining the local and the global structure. We overcome this limitation by providing an additional trust-view to obtain a more realistic picture of the actual object distances. Additional tool options include the reference to a gold standard classification, the reference to a cluster analysis as well as listing the most salient (common) features for a selected subset of the words.

Resources

Details

Paper ID

lrec2016-main-191

Pages

pp. 1202-1206

DOI

10.63317/36cj8o4iy4h5

BibKey

koper-etal-2016-visualisation

Editors

Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis

Publisher

European Language Resources Association (ELRA)

ISSN

2522-2686

ISBN

978-2-9517408-9-1

Conference

Tenth International Conference on Language Resources and Evaluation

Location

Portorož, Slovenia

Date

23 - 28 May 2016

Authors

MK
Maximilian Köper
MZ
Melanie Zaiß
QH
Qi Han
SK
Steffen Koch
SS
Sabine Schulte im Walde

Links

URL

DOI