HomeLREC 2022WorkshopsISAlrec2022-ws-isa-17
Back to ISA 2022
LREC 2022workshop

Annotating Japanese Numeral Expressions for a Logical and Pragmatic Inference Dataset

Proceedings of the 18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation within LREC2022

DOI:10.63317/4s9cf7onand4

Abstract

Numeral expressions in Japanese are characterized by the flexibility of quantifier positions and the variety of numeral suffixes. However, little work has been done to build annotated corpora focusing on these features and datasets for testing the understanding of Japanese numeral expressions. In this study, we build a corpus that annotates each numeral expression in an existing phrase structure-based Japanese treebank with its usage and numeral suffix types. We also construct an inference test set for numerical expressions based on this annotated corpus. In this test set, we particularly pay attention to inferences where the correct label differs between logical entailment and implicature and those contexts such as negations and conditionals where the entailment labels can be reversed. The baseline experiment with Japanese BERT models shows that our inference test set poses challenges for inference involving various types of numeral expressions.

Details

Paper ID
lrec2022-ws-isa-17
Pages
pp. 127-132
BibKey
koyano-etal-2022-annotating
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation within LREC2022
Location
undefined, undefined
Date
20 June 2022 25 June 2022

Authors

  • KK

    Kana Koyano

  • HY

    Hitomi Yanaka

  • KM

    Koji Mineshima

  • DB

    Daisuke Bekki

Links