Back to Main Conference 2014
LREC 2014main

Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/4m4uqkipy5un

Abstract

Recent studies in metaphor extraction across several languages (Broadwell et al., 2013; Strzalkowski et al., 2013) have shown that word imageability ratings are highly correlated with the presence of metaphors in text. Information about imageability of words can be obtained from the MRC Psycholinguistic Database (MRCPD) for English words and Léxico Informatizado del Español Programa (LEXESP) for Spanish words, which is a collection of human ratings obtained in a series of controlled surveys. Unfortunately, word imageability ratings were collected for only a limited number of words: 9,240 words in English, 6,233 in Spanish; and are unavailable at all in the other two languages studied: Russian and Farsi. The present study describes an automated method for expanding the MRCPD by conferring imageability ratings over the synonyms and hyponyms of existing MRCPD words, as identified in Wordnet. The result is an expanded MRCPD+ database with imagea-bility scores for more than 100,000 words. The appropriateness of this expansion process is assessed by examining the structural coherence of the expanded set and by validating the expanded lexicon against human judgment. Finally, the performance of the metaphor extraction system is shown to improve significantly with the expanded database. This paper describes the process for English MRCPD+ and the resulting lexical resource. The process is analogous for other languages.

Details

Paper ID
lrec2014-main-186
Pages
pp. 2800-2805
BibKey
liu-etal-2014-automatic-expansion
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • TL

    Ting Liu

  • KC

    Kit Cho

  • GB

    G. Aaron Broadwell

  • SS

    Samira Shaikh

  • TS

    Tomek Strzalkowski

  • JL

    John Lien

  • ST

    Sarah Taylor

  • LF

    Laurie Feldman

  • BY

    Boris Yamrom

  • NW

    Nick Webb

  • UB

    Umit Boz

  • IC

    Ignacio Cases

  • CL

    Ching-sheng Lin

Links