Back to Main Conference 2026
LREC 2026main

MultiCoS: A Multilingual Dataset of Connective Semantics with Context–Sentence Compatibility

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/28ccty7yu9hn

Abstract

We present a multilingual dataset of connective semantics. The dataset contains the semantic annotations of clausal connectives (e.g. and and or in English) from 24 languages, based on our original native-speaker elicitation data. Unlike existing lexica on connectives, the dataset includes systematic evidence for the annotations in the form of context-sentence compatibility judgments, including negative evidence. The paper describes the methodology of data collection and the format of the dataset. We also discuss its potential use cases for the validation of cross-linguistic generalizations, examinations of their potential counterexamples, and for benchmarking felicity judgments by NLU systems.

Details

Paper ID
lrec2026-main-381
Pages
pp. 4861-4871
BibKey
mucha-etal-2026-multicos
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • AM

    Anne Mucha

  • CQ

    Ciyang Qing

  • WU

    Wataru Uegaki

Links