Back to Main Conference 2012
LREC 2012main

Chinese Whispers: Cooperative Paraphrase Acquisition

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/5c8jggk8j9cw

Abstract

We present a framework for the acquisition of sentential paraphrases based on crowdsourcing. The proposed method maximizes the lexical divergence between an original sentence s and its valid paraphrases by running a sequence of paraphrasing jobs carried out by a crowd of non-expert workers. Instead of collecting direct paraphrases of s, at each step of the sequence workers manipulate semantically equivalent reformulations produced in the previous round. We applied this method to paraphrase English sentences extracted from Wikipedia. Our results show that, keeping at each round n the most promising paraphrases (i.e. the more lexically dissimilar from those acquired at round n-1), the monotonic increase of divergence allows to collect good-quality paraphrases in a cost-effective manner.

Details

Paper ID
lrec2012-main-452
Pages
pp. 2659-2665
BibKey
negri-etal-2012-chinese
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • MN

    Matteo Negri

  • YM

    Yashar Mehdad

  • AM

    Alessandro Marchetti

  • DG

    Danilo Giampiccolo

  • LB

    Luisa Bentivogli

Links