Back to Main Conference 2022
LREC 2022main

Organizing and Improving a Database of French Word Formation Using Formal Concept Analysis

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/57isk9bjz6bf

Abstract

We apply Formal Concept Analysis (FCA) to organize and to improve the quality of Démonette2, a French derivational database, through a detection of both missing and spurious derivations in the database. We represent each derivational family as a graph. Given that the subgraph relation exists among derivational families, FCA can group families and represent them in a partially ordered set (poset). This poset is also useful for improving the database. A family is regarded as a possible anomaly (meaning that it may have missing and/or spurious derivations) if its derivational graph is almost, but not completely identical to a large number of other families.

Details

Paper ID
lrec2022-main-422
Pages
pp. 3969-3976
BibKey
juniarta-etal-2022-organizing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • NJ

    Nyoman Juniarta

  • OB

    Olivier Bonami

  • NH

    Nabil Hathout

  • FN

    Fiammetta Namer

  • YT

    Yannick Toussaint

Links