Back to Main Conference 2018
LREC 2018main

A Multilingual Test Collection for the Semantic Search of Entity Categories

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/4ccyf4ku5i8i

Abstract

Humans naturally organise and classify the world into sets and categories. These categories expressed in natural language are present in all data artefacts from structured to unstructured data and play a fundamental role as tags, dataset predicates or ontology attributes. A better understanding of the category syntactic structure and how to match them semantically is a fundamental problem in the computational linguistics domain. Despite the high popularity of entity search, entity categories have not been receiving equivalent attention. This paper aims to present the task of semantic search of entity categories by defining, developing and making publicly available a multilingual test collection comprehending English, Portuguese and German. The test collections were designed to meet the demands of the entity search community in providing more representative and semantically complex query sets. In addition, we also provide comparative baselines and a brief analysis of the results.

Details

Paper ID
lrec2018-main-398
Pages
N/A
BibKey
sales-etal-2018-multilingual
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • JS

    Juliano Efson Sales

  • SB

    Siamak Barzegar

  • WF

    Wellington Franco

  • BB

    Bernhard Bermeitinger

  • TC

    Tiago Cunha

  • BD

    Brian Davis

  • AF

    André Freitas

  • SH

    Siegfried Handschuh

Links