Title

Clips, a Multi-level Italian Computational Lexicon: a Glimpse to Data

Authors

Nilda Ruimy (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Monica Monachini (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Raffaella Distante (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Elisabetta Guazzini (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Stefano Molino (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Marisa Ulivieri (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Nicoletta Calzolari (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Antonio Zampolli (Istituto di Linguistica Computazionale, CNR, Pisa, Via Moruzzi, 1 - 56124 Pisa - ITALY)

Session

WO6: Semantic Lexicons

Abstract

CLIPS is a multi-layered Italian computational lexicon based on the PAROLE-SIMPLE model. In this paper we briefly recall the main characteristics of the model and devote our attention to issues emerging from the encoding of large quantities of data, especially in relation to those types of syntactic and semantic information specific to our lexicon and that reflect innovative features of the underlying model. At syntactic level, we show how alternating structures may be encoded in a linguistically more elegant way by using framesets. We illustrate the connection between syntactic and semantic information, and show how the SIMPLE Italian lexicon approach to predicate selection has been refined in CLIPS. At semantic level, we illustrate the richness of information types encoded in a word sense description and the way such a wealth of data can be exploited. We stress in particular the expressive power of the Extended Qualia Structure yet mentioning some of its problematic aspects. We show that queries on qualia relations allow to retrieve lexical collocates, to extract domain specific information, semantic networks, and help interpreting modifying PPs in complex nominals. Finally, we show that features, which cut across the type hierarchy, have a stronger expressive power with respect to semantic types in identifying selectional preferences.

Keywords

Multi-Functional, Multi-Layered lexicon, Lexical semantics, Syntax, Qualia structure

Full Paper

197.pdf