Back to Main Conference 2024
LREC-COLING 2024main

JFLD: A Japanese Benchmark for Deductive Reasoning Based on Formal Logic

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/2fb93vacm2mf

Abstract

Large language models (LLMs) have proficiently solved a broad range of tasks with their rich knowledge but often struggle with logical reasoning. To foster the research on logical reasoning, many benchmarks have been proposed so far. However, most of these benchmarks are limited to English, hindering the evaluation of LLMs specialized for each language. To address this, we propose **JFLD** (**J**apanese **F**ormal **L**ogic **D**eduction), a deductive reasoning benchmark for Japanese. JFLD assess whether LLMs can generate logical steps to (dis-)prove a given hypothesis based on a given set of facts. Its key features are assessing pure logical reasoning abilities isolated from knowledge and assessing various reasoning rules. We evaluate various Japanese LLMs and see that they are still poor at logical reasoning, thus highlighting a substantial need for future research.

Details

Paper ID
lrec2024-main-0832
Pages
pp. 9526-9535
BibKey
morishita-etal-2024-jfld
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • TM

    Terufumi Morishita

  • AY

    Atsuki Yamaguchi

  • GM

    Gaku Morio

  • HT

    Hikaru Tomonari

  • OI

    Osamu Imaichi

  • YS

    Yasuhiro Sogawa

Links