HomeLREC 2026WorkshopsKGLLMlrec2026-ws-kgllm-14
Back to KGLLM 2026
LREC 2026workshop

LLM-based Atomic Propositions Help Weak Extractors: Evaluation of a Propositioner for Triplet Extraction

Proceedings of the Knowledge Graphs and Large Language Models Workshop (KG-LLM) @ LREC26

DOI:10.63317/3kna3utavhgb

Abstract

Knowledge Graph construction from natural language requires extracting structured triplets from complex, information-dense sentences. In this paper, we investigate if the decomposition of text into atomic propositions (minimal, semantically autonomous units of information) can improve the triplet extraction. We introduce MPropositionneur-V2, a small multilingual model covering six European languages trained by knowledge distillation from Qwen3-32B into a Qwen3-0.6B architecture, and we evaluate its integration into two extraction paradigms: entity-centric (GLiREL) and generative (Qwen3). Experiments on SMiLER, FewRel, DocRED and CaRB show that atomic propositions benefit weaker extractors (GLiREL, CoreNLP, 0.6B models), improving relation recall and, in the multilingual setting, overall accuracy. For stronger LLMs, a fallback combination strategy recovers entity recall losses while preserving the gains in relation extraction. These results show that atomic propositions are an interpretable intermediate data structure that complements extractors without replacing them.

Details

Paper ID
lrec2026-ws-kgllm-14
Pages
pp. 134-143
BibKey
pommeret-etal-2026-llm
Editors
Gilles Sérasset, Katerina Gkirtzou, Michael Cochez, Jan-Christoph Kalo
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Knowledge Graphs and Large Language Models Workshop (KG-LLM) @ LREC26
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • LP

    Luc Pommeret

  • TG

    Thomas Gerald

  • CS

    Christophe Servan

  • SG

    Sahar Ghannay

  • PP

    Patrick Paroubek

  • SR

    Sophie Rosset

Links